Invited talk by Florian Tramèr
Florian Tramer
Abstract
When everyone attacks
What happens when many people try to attack or influence AI systems simultaneously? I'll examine this question through three lenses. First, I'll discuss why popular collective defense tools against AI fail to deliver on their promises. Second, I'll discuss some dynamics that may emerge when multiple attackers try to steer an AI system to their benefit. Finally, I'll sketch a more optimistic vision- what if instead of attacking AI systems, we deployed AI that genuinely represents our interests and acts as an intermediary between ourselves and digital content?
Video
Chat is not available.
Successful Page Load