firstbacksecondback
661 Results
Poster
|
Wed 15:00 |
Bayesian Risk-Averse Q-Learning with Streaming Observations Yuhao Wang · Enlu Zhou |
|
Poster
|
Thu 15:00 |
Semantic HELM: A Human-Readable Memory for Reinforcement Learning Fabian Paischer · Thomas Adler · Markus Hofmarcher · Sepp Hochreiter |
|
Workshop
|
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization Bodhisattwa Prasad Majumder · Bhavana Dalvi Mishra · Peter A Jansen · Oyvind Tafjord · Niket Tandon · Li Zhang · Chris Callison-Burch · Peter Clark |
||
Poster
|
Wed 15:00 |
Double Gumbel Q-Learning David Yu-Tung Hui · Aaron Courville · Pierre-Luc Bacon |
|
Poster
|
Tue 8:45 |
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation GUOJUN XIONG · Jian Li |
|
Workshop
|
DGFN: Double Generative Flow Networks Elaine Lau · Nikhil Murali Vemgal · Doina Precup · Emmanuel Bengio |
||
Workshop
|
DGFN: Double Generative Flow Networks Elaine Lau · Nikhil Murali Vemgal · Doina Precup · Emmanuel Bengio |
||
Poster
|
Wed 8:45 |
Residual Q-Learning: Offline and Online Policy Customization without Value Chenran Li · Chen Tang · Haruki Nishimura · Jean Mercat · Masayoshi TOMIZUKA · Wei Zhan |
|
Workshop
|
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks Murtaza Dalal · Tarun Chiruvolu · Devendra Singh Chaplot · Russ Salakhutdinov |
||
Poster
|
Tue 8:45 |
Online Convex Optimization with Unbounded Memory Raunak Kumar · Sarah Dean · Robert Kleinberg |
|
Workshop
|
Cross-Entropy Estimators for Sequential Experiment Design with Reinforcement Learning Tom Blau · Iadine Chades · Amir Dezfouli · Daniel Steinberg · Edwin Bonilla |
||
Poster
|
Wed 8:45 |
Neural Modulation for Flash Memory: An Unsupervised Learning Framework for Improved Reliability Jonathan Zedaka · Elisha Halperin · Evgeny Blaichman · Amit Berman |