Programming manipulation behaviors can become increasingly difficult with a growing number and complexity of manipulation tasks, particularly in a dynamic and unstructured environment. Recent progress in unsupervised skill discovery algorithms has shown great promise in learning an extensive collection of behaviors without extrinsic supervision. On the other hand, safety is one of the most critical factors for real-world robot applications. As skill discovery methods typically encourage exploratory and dynamic behaviors, it can often be the case that a large portion of learned skills remain too dangerous and unsafe. In this paper, we introduce the novel problem of Safety-Aware Skill Discovery, which aims to learn, in a task-agnostic fashion, a repertoire of reusable skills that are inherently safe to be composed for solving downstream tasks. We present a computationally tractable algorithm that learns a latent-conditioned skill policy that maximizes intrinsic rewards regularized with a safety-critic that can model any user-defined safety constraints. Using the pretrained safe skill repertoire, hierarchical reinforcement learning can solve multiple downstream tasks without the need for explicit consideration of safety during training and testing. We evaluate our algorithm on a collection of force-controlled robotic manipulation tasks in simulation and show promising downstream task performance while satisfying safety constraints.
Since the discovered manipulation skills inherently satisfies the safety constraints,
one can perform safe-teleoperation in a zero-shot manner,
just by selecting and composing appropriate skills online!
@inproceedings{kim2023safeskill,
title={Safety-Aware Unsupervised Skill Discovery},
author={Kim, Sunin and Kwon, Jaewoon and Lee, Taeyoon and Park, Younghyo and Perez, Julien},
booktitle={2023 International Conference on Robotics and Automation (ICRA)},
year={2022},
organization={IEEE}
}