firstbacksecondback
553 Results
Workshop
|
Aligning LLMs using Reinforcement Learning from Market Feedback (RLMF) for Regime Adaptation Raeid Saqur |
||
Workshop
|
Functional Alignment of Protein Language Models via Reinforcement Learning with Experimental Feedback Nathaniel Blalock · Srinath Seshadri · Philip Romero |
||
Poster
|
GUIDE: Real-Time Human-Shaped Agents Lingyu Zhang · Zhengran Ji · Nicholas Waytowich · Boyuan Chen |
||
Poster
|
Fri 11:00 |
SEL-BALD: Deep Bayesian Active Learning with Selective Labels Ruijiang Gao · Mingzhang Yin · Maytal Saar-Tsechansky |
|
Poster
|
Thu 16:30 |
Learning Human-like Representations to Enable Learning Human Values Andrea Wynn · Ilia Sucholutsky · Tom Griffiths |
|
Poster
|
Thu 16:30 |
Learning to Assist Humans without Inferring Rewards Vivek Myers · Evan Ellis · Sergey Levine · Benjamin Eysenbach · Anca Dragan |
|
Poster
|
Wed 16:30 |
Learning to Cooperate with Humans using Generative Agents Yancheng Liang · Daphne Chen · Abhishek Gupta · Simon Du · Natasha Jaques |
|
Oral
|
Wed 15:50 |
The Sample-Communication Complexity Trade-off in Federated Q-Learning Sudeep Salgia · Yuejie Chi |
|
Poster
|
Fri 11:00 |
Regularized Q-Learning Han-Dong Lim · Donghwan Lee |
|
Poster
|
Fri 11:00 |
Exclusively Penalized Q-learning for Offline Reinforcement Learning Junghyuk Yeom · Yonghyeon Jo · Jeongmo Kim · Sanghyeon Lee · Seungyul Han |
|
Poster
|
Thu 11:00 |
Periodic agent-state based Q-learning for POMDPs Amit Sinha · Matthieu Geist · Aditya Mahajan |
|
Poster
|
Fri 11:00 |
Inverse Factorized Soft Q-Learning for Cooperative Multi-agent Imitation Learning The Viet Bui · Tien Mai · Thanh Nguyen |