firstbacksecondback
35 Results
Workshop
|
Efficient Reinforcement Learning via Large Language Model-based Search Siddhant Bhambri · Amrita Bhattacharjee · huan liu · Subbarao Kambhampati |
||
Workshop
|
Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models Kefan Song · Jin Yao · Shangtong Zhang |
||
Workshop
|
Sat 17:27 |
Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models Kefan Song · Jin Yao · Shangtong Zhang |
|
Poster
|
Fri 11:00 |
DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data Hanyang Chen · Yang Jiang · Shengnan Guo · Xiaowei Mao · Youfang Lin · Huaiyu Wan |
|
Poster
|
Thu 11:00 |
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback Jiachen Li · Weixi Feng · Tsu-Jui Fu · Xinyi Wang · S Basu · Wenhu Chen · William Yang Wang |
|
Poster
|
Fri 11:00 |
Noise Contrastive Alignment of Language Models with Explicit Rewards Huayu Chen · Guande He · Lifan Yuan · Ganqu Cui · Hang Su · Jun Zhu |
|
Poster
|
Thu 16:30 |
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling Yuchun Miao · Sen Zhang · Liang Ding · Rong Bao · Lefei Zhang · Dacheng Tao |
|
Poster
|
Thu 16:30 |
ReMoDetect: Reward Models Recognize Aligned LLM's Generations Hyunseok Lee · Jihoon Tack · Jinwoo Shin |
|
Workshop
|
GFlowNet Pretraining with Inexpensive Rewards Mohit Pandey · Gopeshh Subbaraj · Emmanuel Bengio |
||
Workshop
|
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design Chenyu Wang · Masatoshi Uehara · Yichun He · Amy Wang · Tommaso Biancalani · Avantika Lal · Tommi Jaakkola · Sergey Levine · Hanchen Wang · Aviv Regev |
||
Workshop
|
Fine-Tuning Discrete Diffusion Models via Reward Optimization: Applications to DNA and Protein Design Chenyu Wang · Masatoshi Uehara · Yichun He · Amy Wang · Tommaso Biancalani · Avantika Lal · Tommi Jaakkola · Sergey Levine · Hanchen Wang · Aviv Regev |
||
Workshop
|
Sun 12:20 |
GFlowNet Pretraining with Inexpensive Rewards Mohit Pandey · Gopeshh Subbaraj · Emmanuel Bengio |