firstbacksecondback
Filter by Keyword:
590 Results
Poster
|
Tue 8:30 |
Parallelizing Thompson Sampling Amin Karbasi · Vahab Mirrokni · Mohammad Shadravan |
|
Poster
|
Tue 8:30 |
Design of Experiments for Stochastic Contextual Linear Bandits Andrea Zanette · Kefan Dong · Jonathan N Lee · Emma Brunskill |
|
Workshop
|
Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning Stefan Radic Webster |
||
Poster
|
Thu 8:30 |
Collaborating with Humans without Human Data DJ Strouse · Kevin McKee · Matt Botvinick · Edward Hughes · Richard Everett |
|
Poster
|
Fri 8:30 |
Contrastive Active Inference Pietro Mazzaglia · Tim Verbelen · Bart Dhoedt |
|
Poster
|
Thu 8:30 |
Storchastic: A Framework for General Stochastic Automatic Differentiation Emile van Krieken · Jakub Tomczak · Annette Ten Teije |
|
Poster
|
Thu 0:30 |
Graph Differentiable Architecture Search with Structure Learning Yijian Qin · Xin Wang · Zeyang Zhang · Wenwu Zhu |
|
Poster
|
Tue 8:30 |
A unified framework for bandit multiple testing Ziyu Xu · Ruodu Wang · Aaditya Ramdas |
|
Poster
|
Tue 16:30 |
Understanding the Effect of Stochasticity in Policy Optimization Jincheng Mei · Bo Dai · Chenjun Xiao · Csaba Szepesvari · Dale Schuurmans |
|
Poster
|
Tue 16:30 |
Regime Switching Bandits Xiang Zhou · Yi Xiong · Ningyuan Chen · Xuefeng GAO |
|
Poster
|
Fri 8:30 |
Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning Siyuan Zhang · Nan Jiang |
|
Poster
|
Tue 8:30 |
Monte Carlo Tree Search With Iteratively Refining State Abstractions Samuel Sokota · Caleb Y Ho · Zaheen Ahmad · J. Zico Kolter |