Skip to yearly menu bar Skip to main content


Poster
in
Workshop: The Second Workshop on GenAI for Health: Potential, Trust, and Policy Compliance
Sat, Dec 6, 2025 • 5:00 PM – 5:00 PM PST

Demo: Statistically Significant Results on Biases and Errors of LLMs Do Not Guarantee Generalizable Results

Jonathan Liu ⋅ Damianos Karakos ⋅ Mark Dredze ⋅ Jonathan Lasko ⋅ Haoling Qiu ⋅ Mahsa Yarmohammadi

Abstract

Chat is not available.