firstbacksecondback
55 Results
Workshop
|
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning Chongyu Fan · Jiancheng Liu · Licong Lin · Jinghan Jia · Ruiqi Zhang · Song Mei · Sijia Liu |
||
Workshop
|
Sun 14:55 |
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring Jiazheng Li · Hainiu Xu · ZHAOYUE SUN · Yuxiang Zhou · David West · Cesare Aloisi · Yulan He |
|
Workshop
|
Accelerated Preference Optimization for Large Language Model Alignment Jiafan He · Huizhuo Yuan · Quanquan Gu |
||
Workshop
|
Dueling in the Dark: An Efficient and Optimal Mirror Descent Approach for Online Optimization with Adversarial Preferences Aadirupa Saha · Barry-John Theobald · Yonathan Efroni |
||
Poster
|
Thu 16:30 |
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization Xiangxin Zhou · Dongyu Xue · Ruizhe Chen · Zaixiang Zheng · Liang Wang · Quanquan Gu |
|
Poster
|
Thu 16:30 |
-DPO: Direct Preference Optimization with Dynamic Junkang Wu · Yuexiang Xie · Zhengyi Yang · Jiancan Wu · Jinyang Gao · Bolin Ding · Xiang Wang · Xiangnan He |
|
Poster
|
Wed 16:30 |
Optimal Design for Human Preference Elicitation Subhojyoti Mukherjee · Anusha Lalitha · Kousha Kalantari · Aniket Anand Deshmukh · Ge Liu · Yifei Ma · Branislav Kveton |
|
Poster
|
Thu 11:00 |
Geometric-Averaged Preference Optimization for Soft Preference Labels Hiroki Furuta · Kuang-Huei Lee · Shixiang (Shane) Gu · Yutaka Matsuo · Aleksandra Faust · Heiga Zen · Izzeddin Gur |
|
Workshop
|
Multi-Step Preference Optimization via Two-Player Markov Games Yongtao Wu · Luca Viano · Yihang Chen · Zhenyu Zhu · Quanquan Gu · Volkan Cevher |
||
Poster
|
Fri 11:00 |
SimPO: Simple Preference Optimization with a Reference-Free Reward Yu Meng · Mengzhou Xia · Danqi Chen |
|
Workshop
|
Sat 10:15 |
Multi-Step Preference Optimization via Two-Player Markov Games Yongtao Wu · Luca Viano · Yihang Chen · Zhenyu Zhu · Quanquan Gu · Volkan Cevher |
|
Workshop
|
Optimizing Multi-Round Enhanced Training in Diffusion Models for Improved Preference Understanding Yangfan He · Jianhui Wang · Haoyuan Li · Sida Li · Li Sun · TIANYU SHI |