firstbacksecondback
Filter by Keyword:
62 Results
Poster
|
Mon 21:00 |
On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems Kaiqing Zhang · Bin Hu · Tamer Basar |
|
Spotlight
|
Tue 19:30 |
Large-Scale Adversarial Training for Vision-and-Language Representation Learning Zhe Gan · Yen-Chun Chen · Linjie Li · Chen Zhu · Yu Cheng · Jingjing Liu |
|
Poster
|
Tue 21:00 |
Large-Scale Adversarial Training for Vision-and-Language Representation Learning Zhe Gan · Yen-Chun Chen · Linjie Li · Chen Zhu · Yu Cheng · Jingjing Liu |
|
Poster
|
Mon 21:00 |
Adversarial Counterfactual Learning and Evaluation for Recommender System Da Xu · Chuanwei Ruan · Evren Korpeoglu · Sushant Kumar · Kannan Achan |
|
Poster
|
Tue 9:00 |
Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot Jingtong Su · Yihang Chen · Tianle Cai · Tianhao Wu · Ruiqi Gao · Liwei Wang · Jason Lee |
|
Poster
|
Thu 9:00 |
f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning Xin Zhang · Yanhua Li · Ziming Zhang · Zhi-Li Zhang |
|
Poster
|
Tue 9:00 |
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics Parameswaran Kamalaruban · Yu-Ting Huang · Ya-Ping Hsieh · Paul Rolland · Cheng Shi · Volkan Cevher |
|
Poster
|
Thu 9:00 |
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization Paul Barde · Julien Roy · Wonseok Jeon · Joelle Pineau · Chris Pal · Derek Nowrouzezahrai |
|
Spotlight
|
Thu 7:30 |
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization Paul Barde · Julien Roy · Wonseok Jeon · Joelle Pineau · Chris Pal · Derek Nowrouzezahrai |
|
Poster
|
Wed 21:00 |
Unsupervised Learning of Object Landmarks via Self-Training Correspondence Dimitrios Mallis · Enrique Sanchez · Matthew Bell · Georgios Tzimiropoulos |
|
Poster
|
Wed 9:00 |
Why are Adaptive Methods Good for Attention Models? Jingzhao Zhang · Sai Praneeth Karimireddy · Andreas Veit · Seungyeon Kim · Sashank Reddi · Sanjiv Kumar · Suvrit Sra |
|
Poster
|
Thu 9:00 |
Kernelized information bottleneck leads to biologically plausible 3-factor Hebbian learning in deep networks Roman Pogodin · Peter E Latham |