Affinity Workshop
|
|
Towards automatic identification of self-reported COVID-19 tweets: introducing a multilingual manually annotated dataset, baseline systems, and exploratory evaluations
Ramya Tekumalla · Juan Banda · Luis Alberto Robles Hernandez
|
|
Workshop
|
|
WAMP: A Competition-Level Dataset for Assessing the Mathematical Reasoning Capabilities of LLMs
Yujun Mao · Yoon Kim · Yilun Zhou
|
|
Workshop
|
|
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Aditi Jha · Sam Havens · Jeremy Dohmann · Alexander Trott · Jacob Portes
|
|
Affinity Workshop
|
|
PERFORMANCE EVALUATION OF LARGE LANGUAGE MODELS IN MACHINE TRANSLATION AND TEXT CLASSIFICATION TASKS ON TWO GHANAIAN LANGUAGE DATASETS, TWI AND DAGBANI, AND THE ACADEMIC (MIS)USE CASES OF GENERATIVE AI IN GHANAIAN TERTIARY EDUCATION
Rose-Mary Owusuaa Mensah Gyening
|
|
Oral
|
Wed 14:15
|
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Stephanie Milani · Anssi Kanervisto · Karolis Ramanauskas · Sander Schulhoff · Brandon Houghton · Rohin Shah
|
|
Affinity Workshop
|
|
Unraveling the Effects of Age-Based Distribution Shifts on Medical Image Classifiers
Kumail Alhamoud · Yasir Ghunaim · Motasem Alfarra · Philip Torr · Tom Hartvigsen · Bernard Ghanem · Adel Bibi · Marzyeh Ghassemi
|
|
Affinity Workshop
|
|
PERFORMANCE EVALUATION OF LARGE LANGUAGE MODELS IN MACHINE TRANSLATION AND TEXT CLASSIFICATION TASKS ON TWO GHANAIAN LANGUAGE DATASETS, TWI AND DAGBANI, AND THE ACADEMIC (MIS)USE CASES OF GENERATIVE AI IN GHANAIAN TERTIARY EDUCATION
Rose-Mary Owusuaa Mensah Gyening
|
|
Workshop
|
|
Preparation Of Labeled Cryo-ET Datasets For Training And Evaluation Of Machine Learning Models
Aygul Ishemgulova · Alex J. Noble · Tristan Bepler · Alex De Marco
|
|
Poster
|
Thu 15:00
|
Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis
Abhinav Nippani · Dongyue Li · Haotian Ju · Haris Koutsopoulos · Hongyang Zhang
|
|
Poster
|
Wed 15:00
|
BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Stephanie Milani · Anssi Kanervisto · Karolis Ramanauskas · Sander Schulhoff · Brandon Houghton · Rohin Shah
|
|
Workshop
|
Sat 9:45
|
Evaluating Peripheral Vision as an Input Transformation to Understand Object Detection Model Behavior
Anne Harrington · Vasha DuTell · Mark Hamilton · Ayush Tewari · Simon Stent · Bill Freeman · Ruth Rosenholtz
|
|
Poster
|
Tue 8:45
|
OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects
Isabella Liu · Linghao Chen · Ziyang Fu · Liwen Wu · Haian Jin · Zhong Li · Chin Ming Ryan Wong · Yi Xu · Ravi Ramamoorthi · Zexiang Xu · Hao Su
|
|