firstbacksecondback
54 Results
Poster
|
Wed 11:00 |
Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning Yanting Miao · William Loh · Suraj Kothawade · Pascal Poupart · Abdullah Rashwan · Yeqing Li |
|
Workshop
|
Estimating Effects of Tokens in Preference Learning Hsiao-Ru Pan · Maximilian Mordig · Bernhard Schölkopf |
||
Oral
|
Wed 15:30 |
Enhancing Preference-based Linear Bandits via Human Response Time Shen Li · Yuyang Zhang · Zhaolin Ren · Claire Liang · Na Li · Julie A Shah |
|
Poster
|
Wed 16:30 |
Enhancing Preference-based Linear Bandits via Human Response Time Shen Li · Yuyang Zhang · Zhaolin Ren · Claire Liang · Na Li · Julie A Shah |
|
Workshop
|
Pareto-Optimal Learning from Preferences with Hidden Context Ryan Boldi · Li Ding · Lee Spector · Scott Niekum |
||
Workshop
|
Adaptive Alignment: Dynamic Preference Adjustments via Multi-Objective Reinforcement Learning for Pluralistic AI Hadassah Harland · Richard Dazeley · Peter Vamplew · Hashini Senaratne · Bahareh nakisa · Francisco Cruz |
||
Poster
|
Fri 16:30 |
Perplexity-aware Correction for Robust Alignment with Noisy Preferences Keyi Kong · Xilie Xu · Di Wang · Jingfeng ZHANG · Mohan Kankanhalli |
|
Poster
|
Thu 16:30 |
Queueing Matching Bandits with Preference Feedback Jung-hun Kim · Min-hwan Oh |
|
Poster
|
Wed 11:00 |
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment Xudong Yu · Chenjia Bai · Haoran He · Changhong Wang · Xuelong Li |
|
Workshop
|
Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment Chenliang Li · Siliang Zeng · Zeyi Liao · Jiaxiang Li · Dongyeop Kang · Alfredo Garcia · Mingyi Hong |
||
Poster
|
Fri 11:00 |
Deep Bayesian Active Learning for Preference Modeling in Large Language Models Luckeciano Carvalho Melo · Panagiotis Tigas · Alessandro Abate · Yarin Gal |
|
Workshop
|
Preference-based Multi-Objective Bayesian Optimization with Gradients Joshua Hang Sai Ip · Ankush Chakrabarty · Ali Mesbah · Diego Romeres |