Timezone: »
The growing prospect of deep reinforcement learning (DRL) being used in cyber-physical systems has raised concerns around safety and robustness of autonomous agents. Recent work on generating adversarial attacks have shown that it is computationally feasible for a bad actor to fool a DRL policy into behaving sub optimally. Although certain adversarial attacks with specific attack models have been addressed, most studies are only interested in off-line optimization in the data space (e.g., example fitting, distillation). This paper introduces a Meta-Learned Advantage Hierarchy (MLAH) framework that is attack model-agnostic and more suited to reinforcement learning, via handling the attacks in the decision space (as opposed to data space) and directly mitigating learned bias introduced by the adversary. In MLAH, we learn separate sub-policies (nominal and adversarial) in an online manner, as guided by a supervisory master agent that detects the presence of the adversary by leveraging the advantage function for the sub-policies. We demonstrate that the proposed algorithm enables policy learning with significantly lower bias as compared to the state-of-the-art policy learning approaches even in the presence of heavy state information attacks. We present algorithm analysis and simulation results using popular OpenAI Gym environments.
Author Information
Aaron Havens (University of Illinois Urbana-Champaign)
I am a first-year graduate student in Aerospace Engineering working with Prof. Girish Chowdhary on robust decision making and control. I'm interested in making intelligent systems more adaptive and guaranteeing safety.
Zhanhong Jiang (Iowa State University)
Soumik Sarkar (Iowa State University)
More from the Same Authors
-
2021 : Cross-Modal Virtual Sensing for Combustion Instability Monitoring »
Tryambak Gangopadhyay · Vikram Ramanan · Chakravarthy S.R. · Soumik Sarkar -
2022 : 3D Reconstruction of Protein Complex Structures Using Synthesized Multi-View AFM Images »
Jaydeep Rade · Soumik Sarkar · Anwesha Sarkar · Adarsh Krishnamurthy -
2022 : Enhancing System-level Safety in Autonomous Driving via Feedback Learning »
Sin Yong Tan · Weisi Fan · Qisai Liu · Tichakorn Wongpiromsarn · Soumik Sarkar -
2022 : DriveCLIP: Zero-shot transfer for distracted driving activity understanding using CLIP »
Md Zahid Hasan · Ameya Joshi · Mohammed Shaiqur Rahman · Venkatachalapathy Archana · Anuj Sharma · Chinmay Hegde · Soumik Sarkar -
2022 : Generative Design of Material Microstructures for Organic Solar Cells using Diffusion Models »
Ethan Herron · Xian Yeow Lee · Aditya Balu · Baskar Ganapathysubramanian · Soumik Sarkar · Adarsh Krishnamurthy -
2022 : Communication-efficient Decentralized Deep Learning »
Fateme Fotouhi · Aditya Balu · Zhanhong Jiang · Yasaman Esfandiari · Salman Jahani · Soumik Sarkar -
2022 : A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations »
Qisai Liu · Xian Yeow Lee · Soumik Sarkar -
2023 Poster: Exploiting Connections between Lipschitz Structures for Certifiably Robust Deep Equilibrium Models »
Aaron Havens · Alexandre Araujo · Siddharth Garg · Farshad Khorrami · Bin Hu -
2019 : Poster Session »
Matthia Sabatelli · Adam Stooke · Amir Abdi · Paulo Rauber · Leonard Adolphs · Ian Osband · Hardik Meisheri · Karol Kurach · Johannes Ackermann · Matt Benatan · GUO ZHANG · Chen Tessler · Dinghan Shen · Mikayel Samvelyan · Riashat Islam · Murtaza Dalal · Luke Harries · Andrey Kurenkov · Konrad Żołna · Sudeep Dasari · Kristian Hartikainen · Ofir Nachum · Kimin Lee · Markus Holzleitner · Vu Nguyen · Francis Song · Christopher Grimm · Felipe Leno da Silva · Yuping Luo · Yifan Wu · Alex Lee · Thomas Paine · Wei-Yang Qu · Daniel Graves · Yannis Flet-Berliac · Yunhao Tang · Suraj Nair · Matthew Hausknecht · Akhil Bagaria · Simon Schmitt · Bowen Baker · Paavo Parmas · Benjamin Eysenbach · Lisa Lee · Siyu Lin · Daniel Seita · Abhishek Gupta · Riley Simmons-Edler · Yijie Guo · Kevin Corder · Vikash Kumar · Scott Fujimoto · Adam Lerer · Ignasi Clavera Gilaberte · Nicholas Rhinehart · Ashvin Nair · Ge Yang · Lingxiao Wang · Sungryull Sohn · J. Fernando Hernandez-Garcia · Xian Yeow Lee · Rupesh Srivastava · Khimya Khetarpal · Chenjun Xiao · Luckeciano Carvalho Melo · Rishabh Agarwal · Tianhe Yu · Glen Berseth · Devendra Singh Chaplot · Jie Tang · Anirudh Srinivasan · Tharun Kumar Reddy Medini · Aaron Havens · Misha Laskin · Asier Mujika · Rohan Saphal · Joseph Marino · Alex Ray · Joshua Achiam · Ajay Mandlekar · Zhuang Liu · Danijar Hafner · Zhiwen Tang · Ted Xiao · Michael Walton · Jeff Druce · Ferran Alet · Zhang-Wei Hong · Stephanie Chan · Anusha Nagabandi · Hao Liu · Hao Sun · Ge Liu · Dinesh Jayaraman · John Co-Reyes · Sophia Sanborn -
2017 Poster: Collaborative Deep Learning in Fixed Topology Networks »
Zhanhong Jiang · Aditya Balu · Chinmay Hegde · Soumik Sarkar