firstbacksecondback
Filter by Keyword:
590 Results
Poster
|
Fri 8:30 |
Reward is enough for convex MDPs Tom Zahavy · Brendan O'Donoghue · Guillaume Desjardins · Satinder Singh |
|
Poster
|
Fri 8:30 |
Bridging the Imitation Gap by Adaptive Insubordination Luca Weihs · Unnat Jain · Iou-Jen Liu · Jordi Salvador · Svetlana Lazebnik · Aniruddha Kembhavi · Alex Schwing |
|
Poster
|
Thu 0:30 |
Decentralized Q-learning in Zero-sum Markov Games Muhammed Sayin · Kaiqing Zhang · David Leslie · Tamer Basar · Asuman Ozdaglar |
|
Poster
|
Tue 16:30 |
Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning Jinxin Liu · Hao Shen · Donglin Wang · Yachen Kang · Qiangxing Tian |
|
Poster
|
Wed 0:30 |
Policy Learning Using Weak Supervision Jingkang Wang · Hongyi Guo · Zhaowei Zhu · Yang Liu |
|
Poster
|
Thu 0:30 |
Faster Non-asymptotic Convergence for Double Q-learning Lin Zhao · Huaqing Xiong · Yingbin Liang |
|
Poster
|
Tue 16:30 |
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning Hiroki Furuta · Tadashi Kozuno · Tatsuya Matsushima · Yutaka Matsuo · Shixiang (Shane) Gu |
|
Poster
|
Fri 8:30 |
Towards Deeper Deep Reinforcement Learning with Spectral Normalization Nils Bjorck · Carla Gomes · Kilian Weinberger |
|
Poster
|
Thu 8:30 |
Offline Meta Reinforcement Learning -- Identifiability Challenges and Effective Data Collection Strategies Ron Dorfman · Idan Shenfeld · Aviv Tamar |
|
Poster
|
Thu 16:30 |
Heuristic-Guided Reinforcement Learning Ching-An Cheng · Andrey Kolobov · Adith Swaminathan |
|
Poster
|
Thu 0:30 |
Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks Rong Zhu · Mattia Rigotti |
|
Poster
|
Tue 16:30 |
Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality Songyuan Zhang · ZHANGJIE CAO · Dorsa Sadigh · Yanan Sui |