firstbacksecondback
41 Results
Workshop
|
Sun 15:30 |
Reva Schwartz: Real World Matters: What Actually Happens When People Use AI? The NIST Assessing Risks and Impacts of AI (ARIA) Program Reva Schwartz |
|
Workshop
|
FEABench: Evaluating Language Models on Real World Physics Reasoning Ability Nayantara Mudur · Hao Cui · Subhashini Venugopalan · Paul Raccuglia · Michael Brenner · Peter Norgaard |
||
Workshop
|
Track 1: Robust Offline Learning via Adversarial World Models Uljad Berdica · Kelvin Li · Michael Beukman · Alexander D. Goldie · Perla Maiolino · Jakob Foerster |
||
Poster
|
Wed 16:30 |
SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation Zeyao Ma · Bohan Zhang · Jing Zhang · Jifan Yu · Xiaokang Zhang · Xiaohan Zhang · Sijia Luo · Xi Wang · Jie Tang |
|
Poster
|
Fri 16:30 |
Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving Theodore Tsesmelis · Luca Palmieri · Marina Khoroshiltseva · Adeela Islam · Gur Elkin · Ofir I Shahar · Gianluca Scarpellini · Stefano Fiorini · Yaniv Ohayon · Nadav Alali · Sinem Aslan · Pietro Morerio · Sebastiano Vascon · Elena gravina · Maria Napolitano · Giuseppe Scarpati · Gabriel zuchtriegel · Alexandra Spühler · Michel Fuchs · Stuart James · Ohad Ben-Shahar · Marcello Pelillo · Alessio Del Bue |