firstbacksecondback
85 Results
Workshop
|
Sat 17:27 |
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset Khaoula Chehbouni · Jonathan Colaço Carr · Yash More · Jackie CK Cheung · Golnoosh Farnadi |
|
Workshop
|
The Intersectionality Problem for Algorithmic Fairness Johannes Himmelreich · Arbie Hsu · Kristian Lum · Ellen Veomett |
||
Workshop
|
Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach Changgeon Ko · Jisu Shin · Hoyun Song · Jeongyeon Seo · Jong Park |
||
Workshop
|
Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models Kefan Song · Jin Yao · Shangtong Zhang |
||
Workshop
|
Sat 9:00 |
Algorithmic Fairness through the lens of Metrics and Evaluation Awa Dieng · Miriam Rateike · Jamelle Watson-Daniels · Golnoosh Farnadi · Nando Fioretto |
|
Workshop
|
Multilingual Hallucination Gaps in Large Language Models Cléa Chataigner · Afaf Taik · Golnoosh Farnadi |
||
Workshop
|
MED-OMIT: Extrinsically-Focused Evaluation Metric for Omissions in Medical Summarization Elliot Schumacher · Daniel Rosenthal · Dhruv Naik · Varun Nair · Luladay Price · Geoffrey Tso · Anitha Kannan |
||
Workshop
|
THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models Mengfei Liang · Archish Arun · Zekun Wu · CRISTIAN VILLALOBOS · Jonathan Lutch · Emre Kazim · Adriano Koshiyama · Philip Treleaven |
||
Workshop
|
RelWire: Metric Based Rewiring Rishi Sonthalia · Anna Gilbert · Matthew Durham |
||
Workshop
|
Does Maximizing Neural Regression Scores Teach Us About The Brain? Rylan Schaeffer · Mikail Khona · Sarthak Chandra · Mitchell Ostrow · Brando Miranda · Sanmi Koyejo |
||
Workshop
|
Decision-margin consistency: a principled metric for human and machine performance alignment George Alvarez · Talia Konkle |
||
Workshop
|
Demographic (Mis)Alignment of LLMs' Perception of Offensiveness Shayan Alipour · Indira Sen · Preetam Prabhu Srikar Dammu · Chris Choi · Mattia Samory · Tanu Mitra |