firstbacksecondback
54 Results
Workshop
|
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning Yuxi Xie · Anirudh Goyal · Wenyue Zheng · Min-Yen Kan · Timothy Lillicrap · Kenji Kawaguchi · Michael Qizhe Shieh |
||
Workshop
|
Bayesian Optimization of High-dimensional Outputs with Human Feedback Qing Feng · Zhiyuan Jerry Lin · Yujia Zhang · Ben Letham · Jelena Markovic-Voronov · Ryan-Rhys Griffiths · Peter Frazier · Eytan Bakshy |
||
Poster
|
Wed 16:30 |
Multi-turn Reinforcement Learning with Preference Human Feedback Lior Shani · Aviv Rosenberg · Asaf Cassel · Oran Lang · Daniele Calandriello · Avital Zipori · Hila Noga · Orgad Keller · Bilal Piot · Idan Szpektor · Avinatan Hassidim · Yossi Matias · Remi Munos |
|
Poster
|
Thu 11:00 |
FERERO: A Flexible Framework for Preference-Guided Multi-Objective Learning Lisha Chen · A Saif · Yanning Shen · Tianyi Chen |
|
Poster
|
Wed 11:00 |
Mobility-LLM: Learning Visiting Intentions and Travel Preference from Human Mobility Data with Large Language Models Letian Gong · Yan Lin · zxy · Yiwen Lu · Xuedi Han · Yichen Liu · Shengnan Guo · Youfang Lin · Huaiyu Wan |
|
Poster
|
Wed 11:00 |
Distributional Preference Alignment of LLMs via Optimal Transport Igor Melnyk · Youssef Mroueh · Brian Belgodere · Mattia Rigotti · Apoorva Nitsure · Mikhail Yurochkin · Kristjan Greenewald · Jiri Navratil · Jarret Ross |