firstbacksecondback
Filter by Keyword:
590 Results
Poster
|
Thu 8:30 |
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation Nicklas Hansen · Hao Su · Xiaolong Wang |
|
Poster
|
Thu 8:30 |
TAAC: Temporally Abstract Actor-Critic for Continuous Control Haonan Yu · Wei Xu · Haichao Zhang |
|
Oral Session
|
Fri 16:00 |
Oral Session 5: Reinforcement Learning and Planning |
|
Poster
|
Fri 8:30 |
The Value of Information When Deciding What to Learn Dilip Arumugam · Benjamin Van Roy |
|
Poster
|
Thu 8:30 |
(Almost) Free Incentivized Exploration from Decentralized Learning Agents Chengshuai Shi · Haifeng Xu · Wei Xiong · Cong Shen |
|
Poster
|
Thu 0:30 |
Batched Thompson Sampling Cem Kalkanli · Ayfer Ozgur |
|
Poster
|
Fri 8:30 |
Hierarchical Skills for Efficient Exploration Jonas Gehring · Gabriel Synnaeve · Andreas Krause · Nicolas Usunier |
|
Poster
|
Thu 0:30 |
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning Junsu Kim · Younggyo Seo · Jinwoo Shin |
|
Poster
|
Wed 0:30 |
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization Jianhao Wang · Zhizhou Ren · Beining Han · Jianing Ye · Chongjie Zhang |
|
Poster
|
Tue 8:30 |
Bandit Quickest Changepoint Detection Aditya Gopalan · Braghadeesh Lakshminarayanan · Venkatesh Saligrama |
|
Poster
|
Tue 8:30 |
Fair Exploration via Axiomatic Bargaining Jackie Baek · Vivek Farias |
|
Poster
|
Tue 16:30 |
ELLA: Exploration through Learned Language Abstraction Suvir Mirchandani · Siddharth Karamcheti · Dorsa Sadigh |