firstbacksecondback
21 Results
Workshop
|
Simulating Iterative Human-AI Interaction in Programming with LLMs Hussein Mozannar · Valerie Chen · Dennis Wei · Prasanna Sattigeri · Manish Nagireddy · Subhro Das · Ameet Talwalkar · David Sontag |
||
Workshop
|
Fri 14:15 |
Quality Diversity through Human Feedback Li Ding |
|
Affinity Workshop
|
Mon 13:30 |
Enhancing STEM education using Multimodal AI and Human in the Loop Feedback Karen DSouza · Pratibha Varma-Nelson · Shiaofen Fang · Snehasis Mukhopadhyay |
|
Workshop
|
Fri 9:15 |
Universal jailbreak backdoors from poisoned human feedback Florian Tramer |
|
Workshop
|
Reward Model Ensembles Help Mitigate Overoptimization Thomas Coste · Usman Anwar · Robert Kirk · David Krueger |
||
Workshop
|
Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language Di Jin · Shikib Mehri · Devamanyu Hazarika · Aishwarya Padmakumar · SUNGJIN LEE · Yang Liu · Mahdi Namazifar |
||
Workshop
|
Quality Diversity through Human Feedback Li Ding · Jenny Zhang · Jeff Clune · Lee Spector · Joel Lehman |
||
Poster
|
Wed 8:45 |
Sequential Preference Ranking for Efficient Reinforcement Learning from Human Feedback Minyoung Hwang · Gunmin Lee · Hogun Kee · Chan Woo Kim · Kyungjae Lee · Songhwai Oh |
|
Poster
|
Tue 8:45 |
Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback TaeHo Yoon · Kibeom Myoung · Keon Lee · Jaewoong Cho · Albert No · Ernest Ryu |
|
Workshop
|
Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization Prakamya Mishra · Zonghai Yao · shuwei chen · Beining Wang · Rohan Mittal · Hong Yu |
||
Workshop
|
Diversity from Human Feedback Ren-Jian Wang · Ke Xue · Yutong Wang · Peng Yang · Haobo Fu · Qiang Fu · Chao Qian |
||
Poster
|
Wed 8:45 |
Off-Policy Evaluation for Human Feedback Qitong Gao · Ge Gao · Juncheng Dong · Vahid Tarokh · Min Chi · Miroslav Pajic |