Skip to yearly menu bar Skip to main content


Language Agents as Hackers: Evaluating Cybersecurity Skills with Capture the Flag

John Yang ⋅ Akshara Prabhakar ⋅ Shunyu Yao ⋅ Kexin Pei ⋅ Karthik Narasimhan

Abstract

Video

Chat is not available.