Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

24 Results

<<   <   Page 1 of 2   >   >>
Poster
Wed 8:45 Human-Aligned Calibration for AI-Assisted Decision Making
Nina Corvelo Benz · Manuel Rodriguez
Poster
Thu 15:00 Self-supervised video pretraining yields robust and more human-aligned visual representations
Nikhil Parthasarathy · S. M. Ali Eslami · Joao Carreira · Olivier Henaff
Poster
Wed 8:45 Alignment with human representations supports robust few-shot learning
Ilia Sucholutsky · Tom Griffiths
Workshop
Fri 7:50 #16: Machine Theory of Mind and the Structure of Human Values
Paul de Font-Reaulx
Workshop
Fri 7:50 #12: Concept Alignment
Sunayana Rane · Polyphony J. Bruna · Ilia Sucholutsky · Christopher T Kello · Tom Griffiths
Poster
Wed 8:45 Aligning Language Models with Human Preferences via a Bayesian Approach
Jiashuo WANG · Haozhao Wang · Shichao Sun · Wenjie Li
Poster
Tue 8:45 MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie · Yuhui Zhang · Atharva Shailesh Amdekar · Chris Piech · Tatsunori Hashimoto · Tobias Gerstenberg
Workshop
Fri 7:50 #01: MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks
Allen Nie · Yuhui Zhang · Atharva Shailesh Amdekar · Chris Piech · Tatsunori Hashimoto · Tobias Gerstenberg
Workshop
Ecological data and objectives align deep neural network representations with humans
Akash Nagaraj · Alekh Karkada Ashok · Drew Linsley · Francis Lewis · Peisen Zhou · Thomas Serre
Workshop
Ecological data and objectives align deep neural network representations with humans
Akash Nagaraj · Alekh Karkada Ashok · Drew Linsley · Francis Lewis · Peisen Zhou · Thomas Serre
Poster
Thu 15:00 VisAlign: Dataset for Measuring the Alignment between AI and Humans in Visual Perception
Jiyoung Lee · Seungho Kim · Seunghyun Won · Joonseok Lee · Marzyeh Ghassemi · James Thorne · Jaeseok Choi · O-Kil Kwon · Edward Choi
Poster
Thu 15:00 BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset
Jiaming Ji · Mickel Liu · Josef Dai · Xuehai Pan · Chi Zhang · Ce Bian · Boyuan Chen · Ruiyang Sun · Yizhou Wang · Yaodong Yang