firstbacksecondback
13 Results
Workshop
|
Simulating Iterative Human-AI Interaction in Programming with LLMs Hussein Mozannar · Valerie Chen · Dennis Wei · Prasanna Sattigeri · Manish Nagireddy · Subhro Das · Ameet Talwalkar · David Sontag |
||
Workshop
|
Diversity from Human Feedback Ren-Jian Wang · Ke Xue · Yutong Wang · Peng Yang · Haobo Fu · Qiang Fu · Chao Qian |
||
Affinity Workshop
|
Mon 13:30 |
Enhancing STEM education using Multimodal AI and Human in the Loop Feedback Karen DSouza · Pratibha Varma-Nelson · Shiaofen Fang · Snehasis Mukhopadhyay |
|
Workshop
|
Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov · Pierluca D'Oro · Shagun Sodhani · Roberta Raileanu · Pierre-Luc Bacon · Pascal Vincent · Amy Zhang · Mikael Henaff |
||
Workshop
|
Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov · Pierluca D'Oro · Shagun Sodhani · Roberta Raileanu · Pierre-Luc Bacon · Pascal Vincent · Amy Zhang · Mikael Henaff |
||
Poster
|
Wed 8:45 |
Sequential Preference Ranking for Efficient Reinforcement Learning from Human Feedback Minyoung Hwang · Gunmin Lee · Hogun Kee · Chan Woo Kim · Kyungjae Lee · Songhwai Oh |
|
Poster
|
Tue 15:15 |
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Zeqiu Wu · Yushi Hu · Weijia Shi · Nouha Dziri · Alane Suhr · Prithviraj Ammanabrolu · Noah Smith · Mari Ostendorf · Hannaneh Hajishirzi |
|
Workshop
|
Quality Diversity through Human Feedback Li Ding · Jenny Zhang · Jeff Clune · Lee Spector · Joel Lehman |
||
Workshop
|
Motif: Intrinsic Motivation from Artificial Intelligence Feedback Martin Klissarov · Pierluca D'Oro · Shagun Sodhani · Roberta Raileanu · Pierre-Luc Bacon · Pascal Vincent · Amy Zhang · Mikael Henaff |
||
Workshop
|
Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language Di Jin · Shikib Mehri · Devamanyu Hazarika · Aishwarya Padmakumar · SUNGJIN LEE · Yang Liu · Mahdi Namazifar |
||
Workshop
|
Reinforcement Learning in Control Theory: A New Approach to Mathematical Problem Solving Kala Bidi · Jean-Michel Coron · Amaury Hayat · Nathan Lichtlé |
||
Workshop
|
Quality-Diversity through AI Feedback Herbie Bradley · Andrew Dai · Hannah Teufel · Jenny Zhang · Koen Oostermeijer · Marco Bellagente · Jeff Clune · Kenneth Stanley · Grégory Schott · Joel Lehman |