firstbacksecondback
69 Results
Workshop
|
Discovering Temporally-Aware Reinforcement Learning Algorithms Matthew T Jackson · Chris Lu · Louis Kirsch · Robert Lange · Shimon Whiteson · Jakob Foerster |
||
Workshop
|
Ever Evolving Evaluator (EV3): Towards Flexible and Reliable Meta-Optimization for Knowledge Distillation Li Ding · Masrour Zoghi · Guy Tennenholtz · Maryam Karimzadehgan |
||
Poster
|
Wed 15:00 |
Online Control for Meta-optimization Xinyi Chen · Elad Hazan |
|
Poster
|
Tue 8:45 |
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch Shangtong Zhang · Remi Tachet des Combes · Romain Laroche |
|
Poster
|
Thu 8:45 |
Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Method Constantine Caramanis · Dimitris Fotakis · Alkis Kalavasis · Vasilis Kontonis · Christos Tzamos |
|
Workshop
|
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs Zihan Zhou · Honghao Wei · Lei Ying |
||
Workshop
|
Associative Memories with Heavy-Tailed Data Vivien Cabannes · Elvis Dohmatob · Alberto Bietti |
||
Oral
|
Thu 8:30 |
Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Method Constantine Caramanis · Dimitris Fotakis · Alkis Kalavasis · Vasilis Kontonis · Christos Tzamos |
|
Poster
|
Wed 15:00 |
Optimistic Meta-Gradients Sebastian Flennerhag · Tom Zahavy · Brendan O'Donoghue · Hado van Hasselt · András György · Satinder Singh |
|
Workshop
|
AutoFT: Robust Fine-Tuning by Optimizing Hyperparameters on OOD Data Caroline Choi · Yoonho Lee · Annie Chen · Allan Zhou · Aditi Raghunathan · Chelsea Finn |
||
Poster
|
Tue 8:45 |
Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents wonje choi · Woo Kyung Kim · SeungHyun Kim · Honguk Woo |
|
Workshop
|
Continually Adapting Optimizers Improve Meta-Generalization Wenyi Wang · Louis Kirsch · Francesco Faccio · Mingchen Zhuge · Jürgen Schmidhuber |