Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

35 Results

<<   <   Page 1 of 3   >   >>
Workshop
Efficient Reinforcement Learning via Large Language Model-based Search
Siddhant Bhambri · Amrita Bhattacharjee · huan liu · Subbarao Kambhampati
Workshop
Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
Kefan Song · Jin Yao · Shangtong Zhang
Workshop
Sat 17:27 Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
Kefan Song · Jin Yao · Shangtong Zhang
Poster
Fri 11:00 DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data
Hanyang Chen · Yang Jiang · Shengnan Guo · Xiaowei Mao · Youfang Lin · Huaiyu Wan
Poster
Thu 11:00 T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li · Weixi Feng · Tsu-Jui Fu · Xinyi Wang · S Basu · Wenhu Chen · William Yang Wang
Poster
Fri 11:00 Noise Contrastive Alignment of Language Models with Explicit Rewards
Huayu Chen · Guande He · Lifan Yuan · Ganqu Cui · Hang Su · Jun Zhu
Poster
Thu 16:30 InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
Yuchun Miao · Sen Zhang · Liang Ding · Rong Bao · Lefei Zhang · Dacheng Tao
Poster
Thu 16:30 ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Hyunseok Lee · Jihoon Tack · Jinwoo Shin
Workshop
GFlowNet Pretraining with Inexpensive Rewards
Mohit Pandey · Gopeshh Subbaraj · Emmanuel Bengio
Workshop
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Chenyu Wang · Masatoshi Uehara · Yichun He · Amy Wang · Tommaso Biancalani · Avantika Lal · Tommi Jaakkola · Sergey Levine · Hanchen Wang · Aviv Regev
Workshop
Fine-Tuning Discrete Diffusion Models via Reward Optimization: Applications to DNA and Protein Design
Chenyu Wang · Masatoshi Uehara · Yichun He · Amy Wang · Tommaso Biancalani · Avantika Lal · Tommi Jaakkola · Sergey Levine · Hanchen Wang · Aviv Regev
Workshop
Sun 12:20 GFlowNet Pretraining with Inexpensive Rewards
Mohit Pandey · Gopeshh Subbaraj · Emmanuel Bengio