firstbacksecondback
4 Results
Poster
|
Wed 14:00 |
What are the best Systems? New Perspectives on NLP Benchmarking Pierre Colombo · Nathan Noiry · Ekhine Irurozki · Stephan Clémençon |
|
Poster
|
Tue 14:00 |
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish Lukasz Augustyniak · Kamil Tagowski · Albert Sawczyn · Denis Janiak · Roman Bartusiak · Adrian Szymczak · Arkadiusz Janz · Piotr Szymański · Marcin Wątroba · Mikołaj Morzy · Tomasz Kajdanowicz · Maciej Piasecki |
|
Poster
|
Thu 14:00 |
CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior Eldar D Abraham · Karel D'Oosterlinck · Amir Feder · Yair Gat · Atticus Geiger · Christopher Potts · Roi Reichart · Zhengxuan Wu |
|
Poster
|
Wed 9:00 |
Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models Maribeth Rauh · John Mellor · Jonathan Uesato · Po-Sen Huang · Johannes Welbl · Laura Weidinger · Sumanth Dathathri · Amelia Glaese · Geoffrey Irving · Iason Gabriel · William Isaac · Lisa Anne Hendricks |