Skip to yearly menu bar Skip to main content


Benchmarking Large Language Models as AI Research Agents

Qian Huang ⋅ Jian Vora ⋅ Percy Liang ⋅ Jure Leskovec

Abstract

Chat is not available.