NeurIPS Datasets and Benchmarks Dataset and Benchmark Track 3

Datasets and Benchmarks

Dataset and Benchmark Track 3

Joaquin Vanschoren · Serena Yeung

Moderator : Alice Oh

[ Abstract ]

Abstract:

The Datasets and Benchmarks track serves as a novel venue for high-quality publications, talks, and posters on highly valuable machine learning datasets and benchmarks, as well as a forum for discussions on how to improve dataset development. Datasets and benchmarks are crucial for the development of machine learning methods, but also require their own publishing and reviewing guidelines. For instance, datasets can often not be reviewed in a double-blind fashion, and hence full anonymization will not be required. On the other hand, they do require additional specific checks, such as a proper description of how the data was collected, whether they show intrinsic bias, and whether they will remain accessible.

Chat is not available.

Schedule

Fri 12:00 a.m. - 12:10 a.m.	Programming Puzzles ( Oral ) > SlidesLive Video	Tal Schuster · Ashwin Kalyan · Alex Polozov · Adam Kalai 🔗
Fri 12:10 a.m. - 12:20 a.m.	Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models ( Oral ) > SlidesLive Video	Boxin Wang · Chejian Xu · Shuohang Wang · Zhe Gan · Yu Cheng · Jianfeng Gao · Ahmed Awadallah · Bo Li 🔗
Fri 12:20 a.m. - 12:30 a.m.	NaturalProofs: Mathematical Theorem Proving in Natural Language ( Oral ) > SlidesLive Video	Sean Welleck · Jiacheng Liu · Ronan Le Bras · Hanna Hajishirzi · Yejin Choi · Kyunghyun Cho 🔗
Fri 12:30 a.m. - 12:40 a.m.	HumBugDB: A Large-scale Acoustic Mosquito Dataset ( Oral ) > SlidesLive Video	16 presenters Ivan Kiskin · Marianne Sinka · Adam Cobb · Waqas Rafique · Lawrence Wang · Davide Zilli · Benjamin Gutteridge · Rinita Dam · Theodoros Marinos · Yunpeng Li · Dickson Msaky · Emmanuel Kaindoa · Gerard Killeen · Eva Herreros-Moya · Kathy Willis · Stephen J Roberts 🔗
Fri 12:40 a.m. - 1:00 a.m.	Joint Q&A ( Q&A ) >	🔗