firstbacksecondback
74 Results
Poster
|
Mon 9:00 |
Generating Long-term Trajectories Using Deep Hierarchical Networks Stephan Zheng · Yisong Yue · Patrick Lucey |
|
Poster
|
Wed 9:00 |
Linear Contextual Bandits with Knapsacks Shipra Agrawal · Nikhil Devanur |
|
Poster
|
Wed 9:00 |
Refined Lower Bounds for Adversarial Bandits Sébastien Gerchinovitz · Tor Lattimore |
|
Poster
|
Tue 9:00 |
Learning Multiagent Communication with Backpropagation Sainbayar Sukhbaatar · arthur szlam · Rob Fergus |
|
Poster
|
Wed 9:00 |
Guided Policy Search via Approximate Mirror Descent William H Montgomery · Sergey Levine |
|
Poster
|
Wed 9:00 |
Unifying Count-Based Exploration and Intrinsic Motivation Marc Bellemare · Sriram Srinivasan · Georg Ostrovski · Tom Schaul · David Saxton · Remi Munos |
|
Poster
|
Tue 9:00 |
An algorithm for L1 nearest neighbor search via monotonic embedding Xinan Wang · Sanjoy Dasgupta |
|
Poster
|
Mon 9:00 |
Safe Exploration in Finite Markov Decision Processes with Gaussian Processes Matteo Turchetta · Felix Berkenkamp · Andreas Krause |
|
Poster
|
Mon 9:00 |
Multi-armed Bandits: Competing with Optimal Sequences Zohar Karnin · Oren Anava |
|
Oral
|
Tue 0:50 |
Value Iteration Networks Aviv Tamar · Sergey Levine · Pieter Abbeel · YI WU · Garrett Thomas |
|
Poster
|
Tue 9:00 |
Value Iteration Networks Aviv Tamar · Sergey Levine · Pieter Abbeel · YI WU · Garrett Thomas |
|
Poster
|
Mon 9:00 |
Catching heuristics are optimal control policies Boris Belousov · Gerhard Neumann · Constantin Rothkopf · Jan Peters |