firstbacksecondback
74 Results
Poster
|
Wed 9:00 |
Double Thompson Sampling for Dueling Bandits Huasen Wu · Xin Liu |
|
Poster
|
Mon 9:00 |
Cooperative Inverse Reinforcement Learning Dylan Hadfield-Menell · Stuart J Russell · Pieter Abbeel · Anca Dragan |
|
Poster
|
Wed 9:00 |
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation Tejas Kulkarni · Karthik Narasimhan · Ardavan Saeedi · Josh Tenenbaum |
|
Poster
|
Mon 9:00 |
Learning to Communicate with Deep Multi-Agent Reinforcement Learning Jakob Foerster · Yannis Assael · Nando de Freitas · Shimon Whiteson |
|
Poster
|
Mon 9:00 |
Threshold Bandits, With and Without Censored Feedback Jacob D Abernethy · Kareem Amin · Ruihao Zhu |
|
Poster
|
Mon 9:00 |
Fairness in Learning: Classic and Contextual Bandits Matthew Joseph · Michael Kearns · Jamie Morgenstern · Aaron Roth |
|
Poster
|
Tue 9:00 |
Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits Vasilis Syrgkanis · Haipeng Luo · Akshay Krishnamurthy · Robert Schapire |
|
Poster
|
Tue 9:00 |
Efficient state-space modularization for planning: theory, behavioral and neural signatures Daniel McNamee · Daniel M Wolpert · Mate Lengyel |
|
Poster
|
Mon 9:00 |
Phased Exploration with Greedy Exploitation in Stochastic Combinatorial Partial Monitoring Games Sougata Chaudhuri · Ambuj Tewari |
|
Poster
|
Wed 9:00 |
Learning values across many orders of magnitude Hado van Hasselt · Arthur Guez · Arthur Guez · Matteo Hessel · Volodymyr Mnih · David Silver |
|
Poster
|
Wed 9:00 |
Learning under uncertainty: a comparison between R-W and Bayesian approach He Huang · Martin Paulus |
|
Poster
|
Mon 9:00 |
Strategic Attentive Writer for Learning Macro-Actions Alexander (Sasha) Vezhnevets · Volodymyr Mnih · Simon Osindero · Alex Graves · Oriol Vinyals · John Agapiou · koray kavukcuoglu |