Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

18 Results

<<   <   Page 1 of 2   >   >>
Affinity Workshop
Towards automatic identification of self-reported COVID-19 tweets: introducing a multilingual manually annotated dataset, baseline systems, and exploratory evaluations
Ramya Tekumalla · Juan Banda · Luis Alberto Robles Hernandez
Workshop
WAMP: A Competition-Level Dataset for Assessing the Mathematical Reasoning Capabilities of LLMs
Yujun Mao · Yoon Kim · Yilun Zhou
Workshop
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Aditi Jha · Sam Havens · Jeremy Dohmann · Alexander Trott · Jacob Portes
Affinity Workshop
PERFORMANCE EVALUATION OF LARGE LANGUAGE MODELS IN MACHINE TRANSLATION AND TEXT CLASSIFICATION TASKS ON TWO GHANAIAN LANGUAGE DATASETS, TWI AND DAGBANI, AND THE ACADEMIC (MIS)USE CASES OF GENERATIVE AI IN GHANAIAN TERTIARY EDUCATION
Rose-Mary Owusuaa Mensah Gyening
Oral
Wed 14:15 BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Stephanie Milani · Anssi Kanervisto · Karolis Ramanauskas · Sander Schulhoff · Brandon Houghton · Rohin Shah
Affinity Workshop
Unraveling the Effects of Age-Based Distribution Shifts on Medical Image Classifiers
Kumail Alhamoud · Yasir Ghunaim · Motasem Alfarra · Philip Torr · Tom Hartvigsen · Bernard Ghanem · Adel Bibi · Marzyeh Ghassemi
Affinity Workshop
PERFORMANCE EVALUATION OF LARGE LANGUAGE MODELS IN MACHINE TRANSLATION AND TEXT CLASSIFICATION TASKS ON TWO GHANAIAN LANGUAGE DATASETS, TWI AND DAGBANI, AND THE ACADEMIC (MIS)USE CASES OF GENERATIVE AI IN GHANAIAN TERTIARY EDUCATION
Rose-Mary Owusuaa Mensah Gyening
Workshop
Preparation Of Labeled Cryo-ET Datasets For Training And Evaluation Of Machine Learning Models
Aygul Ishemgulova · Alex J. Noble · Tristan Bepler · Alex De Marco
Poster
Thu 15:00 Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis
Abhinav Nippani · Dongyue Li · Haotian Ju · Haris Koutsopoulos · Hongyang Zhang
Poster
Wed 15:00 BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Stephanie Milani · Anssi Kanervisto · Karolis Ramanauskas · Sander Schulhoff · Brandon Houghton · Rohin Shah
Workshop
Sat 9:45 Evaluating Peripheral Vision as an Input Transformation to Understand Object Detection Model Behavior
Anne Harrington · Vasha DuTell · Mark Hamilton · Ayush Tewari · Simon Stent · Bill Freeman · Ruth Rosenholtz
Poster
Tue 8:45 OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects
Isabella Liu · Linghao Chen · Ziyang Fu · Liwen Wu · Haian Jin · Zhong Li · Chin Ming Ryan Wong · Yi Xu · Ravi Ramamoorthi · Zexiang Xu · Hao Su