firstbacksecondback
31 Results
Workshop
|
THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models Mengfei Liang · Archish Arun · Zekun Wu · CRISTIAN VILLALOBOS · Jonathan Lutch · Emre Kazim · Adriano Koshiyama · Philip Treleaven |
||
Poster
|
Thu 16:30 |
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries Sunjun Kweon · Jiyoun Kim · Heeyoung Kwak · Dongchul Cha · Hangyul Yoon · Kwang Kim · Jeewon Yang · Seunghyun Won · Edward Choi |
|
Workshop
|
Using Relational and Causality Context for Tasks with Specialized Vocabularies that are Challenging for LLMs Ryosuke Nakanishi · Yan-Ying Chen · Francine Chen · Matt Klenk · Charlene C. Wu |
||
Workshop
|
Spectro: A multi-modal approach for molecule elucidation using IR and NMR data Edwin Chacko · Rudra Sondhi · Arnav Praveen · Kylie Luska · Rodrigo Vargas-Hernandez |
||
Workshop
|
Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks Yun-Shiuan Chuang · Krirk Nirunwiroj · Zach Studdiford · Agam Goyal · Vincent Frigo · Sijia Yang · Dhavan Shah · Junjie Hu · Timothy T Rogers |
||
Workshop
|
Sat 10:45 |
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning Xinyuan Lu · Liangming Pan · Yubo Ma · Preslav Nakov · Min-Yen Kan |
|
Workshop
|
VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search David Brandfonbrener · Simon Henniger · Sibi Raja · Tarun Prasad · Chloe Loughridge · Federico Cassano · Sabrina Hu · Jianang Yang · William Byrd · Robert Zinkov · Nada Amin |