Poster
|
Thu 9:00
|
Defining and Characterizing Reward Gaming
Joar Skalse · Nikolaus Howe · Dmitrii Krasheninnikov · David Krueger
|
|
Poster
|
Wed 14:00
|
Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget
Jasmin Brandt · Viktor Bengs · Björn Haddenhorst · Eyke Hüllermeier
|
|
Poster
|
Wed 14:00
|
On A Mallows-type Model For (Ranked) Choices
Yifan Feng · Yuxuan Tang
|
|
Poster
|
Tue 14:00
|
Expected Frequency Matrices of Elections: Computation, Geometry, and Preference Learning
Niclas Boehmer · Robert Bredereck · Edith Elkind · Piotr Faliszewski · Stanisław Szufa
|
|
Poster
|
|
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Zhizhou Ren · Anji Liu · Yitao Liang · Jian Peng · Jianzhu Ma
|
|
Workshop
|
|
PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm
Toygun Basaklar · Suat Gumussoy · Umit Ogras
|
|
Poster
|
Wed 9:00
|
One for All: Simultaneous Metric and Preference Learning over Multiple Users
Gregory Canal · Blake Mason · Ramya Korlakai Vinayak · Robert Nowak
|
|
Poster
|
|
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
Runze Liu · Fengshuo Bai · Yali Du · Yaodong Yang
|
|