Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

9 Results

<<   <   Page 1 of 1   >>   >
Workshop
Towards a Situational Awareness Benchmark for LLMs
Rudolf Laine · Alexander Meinke · Owain Evans
Workshop
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics
Haoqin Tu · Bingchen Zhao · Chen Wei · Cihang Xie
Workshop
Leveraging expert feedback to align proxy and ground truth rewards in goal-oriented molecular generation
Julien Martinelli · Yasmine Nahal · Duong Lê · Ola Engkvist · Samuel Kaski
Workshop
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou · Zhihong Shao · Yeyun Gong · yelong shen · Yujiu Yang · Nan Duan · Weizhu Chen
Workshop
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
Samuel Marks · Max Tegmark
Workshop
Seeking Truth and Beauty in Flavor Physics with Machine Learning
Konstantin Matchev · Katia Matcheva · Pierre Ramond · Sarunas Verner
Workshop
ObEy Anything: Quantifiable Object-based Explainability without Ground Truth Annotations
William Ho · Lennart Schulze · Richard Zemel
Poster
Tue 8:45 REASONER: An Explainable Recommendation Dataset with Comprehensive Labeling Ground Truths
Xu Chen · Jingsen Zhang · Lei Wang · Quanyu Dai · Zhenhua Dong · Ruiming Tang · Rui Zhang · Li Chen · Xin Zhao · Ji-Rong Wen
Poster
Wed 15:00 Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Kenneth Li · Oam Patel · Fernanda Viégas · Hanspeter Pfister · Martin Wattenberg