NeurIPS 2024

Workshop

Sat 17:27

Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Khaoula Chehbouni · Jonathan Colaço Carr · Yash More · Jackie CK Cheung · Golnoosh Farnadi

Workshop

The Intersectionality Problem for Algorithmic Fairness
Johannes Himmelreich · Arbie Hsu · Kristian Lum · Ellen Veomett

Workshop

Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach
Changgeon Ko · Jisu Shin · Hoyun Song · Jeongyeon Seo · Jong Park

Workshop

Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
Kefan Song · Jin Yao · Shangtong Zhang

Workshop

Sat 9:00

Algorithmic Fairness through the lens of Metrics and Evaluation
Awa Dieng · Miriam Rateike · Jamelle Watson-Daniels · Golnoosh Farnadi · Nando Fioretto

Workshop

Multilingual Hallucination Gaps in Large Language Models
Cléa Chataigner · Afaf Taik · Golnoosh Farnadi

Workshop

MED-OMIT: Extrinsically-Focused Evaluation Metric for Omissions in Medical Summarization
Elliot Schumacher · Daniel Rosenthal · Dhruv Naik · Varun Nair · Luladay Price · Geoffrey Tso · Anitha Kannan

Workshop

THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models
Mengfei Liang · Archish Arun · Zekun Wu · CRISTIAN VILLALOBOS · Jonathan Lutch · Emre Kazim · Adriano Koshiyama · Philip Treleaven

Workshop

RelWire: Metric Based Rewiring
Rishi Sonthalia · Anna Gilbert · Matthew Durham

Workshop

Does Maximizing Neural Regression Scores Teach Us About The Brain?
Rylan Schaeffer · Mikail Khona · Sarthak Chandra · Mitchell Ostrow · Brando Miranda · Sanmi Koyejo

Workshop

Decision-margin consistency: a principled metric for human and machine performance alignment
George Alvarez · Talia Konkle

Workshop

Demographic (Mis)Alignment of LLMs' Perception of Offensiveness
Shayan Alipour · Indira Sen · Preetam Prabhu Srikar Dammu · Chris Choi · Mattia Samory · Tanu Mitra

Main Navigation