Timezone: »

Towards Safe Global Optimality in Robot Learning with GoSafe
Bhavya Sukhija · Matteo Turchetta · Andreas Krause · Sebastian Trimpe · Dominik Baumann

When learning control policies from trial and error directly on hardware systems, ensuring safety is crucial to avoid costly damage to the system. Existing model-free reinforcement learning methods that guarantee safety during exploration are limited to optima within the safe region connected to a safe initialization, which may be worse than the safe globally optimal solution. In this work, we present GoSafe, an algorithm that can search for globally optimal policies while guaranteeing safety and demonstrate its applicability in experiments on a real robot arm.

Author Information

Bhavya Sukhija (ETH Zürich)
Matteo Turchetta (ETH Zurich)
Andreas Krause (ETH Zurich)
Sebastian Trimpe (RWTH Aachen University)
Dominik Baumann (RWTH Aachen University)

More from the Same Authors