firstbacksecondback
2 Results
Workshop
|
Memory-Efficient Reinforcement Learning with Priority based on Surprise and On-policyness Ryosuke Unno · Yoshimasa Tsuruoka |
||
Poster
|
Learning to Constrain Policy Optimization with Virtual Trust Region Thai Hung Le · Thommen Karimpanal George · Majid Abdolshah · Dung Nguyen · Kien Do · Sunil Gupta · Svetha Venkatesh |