Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

14 Results

<<   <   Page 1 of 2   >   >>
Poster
Fri 11:00 Noise Contrastive Alignment of Language Models with Explicit Rewards
Huayu Chen · Guande He · Lifan Yuan · Ganqu Cui · Hang Su · Jun Zhu
Poster
Fri 11:00 SimPO: Simple Preference Optimization with a Reference-Free Reward
Yu Meng · Mengzhou Xia · Danqi Chen
Poster
Fri 16:30 Mitigating Reward Overoptimization via Lightweight Uncertainty Estimation
Xiaoying Zhang · Jean-Francois Ton · Wei Shen · Hongning Wang · Yang Liu
Poster
Thu 16:30 ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Hyunseok Lee · Jihoon Tack · Jinwoo Shin
Poster
Thu 16:30 InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
Yuchun Miao · Sen Zhang · Liang Ding · Rong Bao · Lefei Zhang · Dacheng Tao
Workshop
Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization
Hao Sun · Thomas Pouplin · Nicolás Astorga · Tennison Liu · Mihaela van der Schaar
Workshop
Improving LLM Generation with Inverse and Forward Alignment: Reward Modeling, Prompting, Fine-Tuning, and Inference-Time Optimization
Hao Sun · Thomas Pouplin · Nicolás Astorga · Tennison Liu · Mihaela van der Schaar
Workshop
Diverse and Effective Red Teaming with Auto-generated Rewards and Multi-step Reinforcement Learning
Alex Beutel · Kai Xiao · Johannes Heidecke · Lilian Weng
Poster
Thu 11:00 ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
Luca Eyring · Shyamgopal Karthik · Karsten Roth · Alexey Dosovitskiy · Zeynep Akata
Workshop
Generative Verifiers: Reward Modeling as Next-Token Prediction
Lunjun Zhang · Arian Hosseini · Hritik Bansal · Mehran Kazemi · Aviral Kumar · Rishabh Agarwal
Poster
Thu 11:00 Learning Goal-Conditioned Representations for Language Reward Models
Vaskar Nath · Dylan Slack · Jeff Da · Yuntao Ma · Hugh Zhang · Spencer Whitehead · Sean Hendryx
Workshop
Prioritization Strategies for LLM-Designed Restless Bandit Rewards in Public Health
Shresth Verma · Niclas Boehmer · Lingkai Kong · Milind Tambe