firstbacksecondback
3 Results
Workshop
|
Regress, Don’t Guess – A Regression-like Loss on Number Tokens for Language Models Jonas Zausinger · Lars Pennig · Kacper Chlodny · Vincent Limbach · Anna Ketteler · Thorben Prein · Vishwa Mohan Singh · Michael Danziger · Jannis Born |
||
Poster
|
Wed 16:30 |
Evaluating Numerical Reasoning in Text-to-Image Models Ivana Kajić · Olivia Wiles · Isabela Albuquerque · Matthias Bauer · Su Wang · Jordi Pont-Tuset · Aida Nematzadeh |
|
Workshop
|
FEABench: Evaluating Language Models on Real World Physics Reasoning Ability Nayantara Mudur · Hao Cui · Subhashini Venugopalan · Paul Raccuglia · Michael Brenner · Peter Norgaard |