Workshop
|
|
Chain-of-Thought Reasoning is a Policy Improvement Operator
Hugh Zhang · David Parkes
|
|
Poster
|
Thu 8:45
|
State-Action Similarity-Based Representations for Off-Policy Evaluation
Brahma Pavse · Josiah Hanna
|
|
Workshop
|
|
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Benedict Quartey · Ankit Shah · George Konidaris
|
|
Poster
|
Wed 8:45
|
f-Policy Gradients: A General Framework for Goal-Conditioned RL using f-Divergences
Siddhant Agarwal · Ishan Durugkar · Peter Stone · Amy Zhang
|
|
Workshop
|
Sat 8:30
|
Learning General Policies and Sketches
Hector Geffner
|
|
Poster
|
Tue 8:45
|
Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates
Guangchen Lan · Han Wang · James Anderson · Christopher Brinton · Vaneet Aggarwal
|
|
Poster
|
Wed 8:45
|
Fractal Landscapes in Policy Optimization
Tao Wang · Sylvia Herbert · Sicun Gao
|
|
Poster
|
Thu 8:45
|
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
Ruiqi Zhang · Andrea Zanette
|
|
Poster
|
Tue 15:15
|
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li · Chen GONG · Qiang He · Xinwen Hou
|
|
Workshop
|
|
Leveraging Behavioral Cloning for Representation Alignment in Cross-Domain Policy Transfer
Hayato Watahiki · Ryo Iwase · Ryosuke Unno · Yoshimasa Tsuruoka
|
|
Poster
|
Wed 15:00
|
Optimal and Fair Encouragement Policy Evaluation and Learning
Angela Zhou
|
|
Poster
|
Tue 8:45
|
Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents
wonje choi · Woo Kyung Kim · SeungHyun Kim · Honguk Woo
|
|