Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation

Existing neural network-based autonomous systems are shown to be vulnerable against adversarial attacks, therefore sophisticated evaluation of their robustness is of great importance. However, evaluating the robustness under the worst-case scenarios based on known attacks is not comprehensive, not t...

Full description

Saved in:

Bibliographic Details
Published in	IEEE robotics and automation letters Vol. 6; no. 2; pp. 1551 - 1558
Main Authors	Ding, Wenhao, Chen, Baiming, Li, Bo, Eun, Kim Ji, Zhao, Ding
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.04.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Adaptation models Algorithms Data models Decision making Generators Machine learning Neural networks reinforcement learning Robot safety Robustness Safety critical Sampling semantic scene understanding Task analysis Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Existing neural network-based autonomous systems are shown to be vulnerable against adversarial attacks, therefore sophisticated evaluation of their robustness is of great importance. However, evaluating the robustness under the worst-case scenarios based on known attacks is not comprehensive, not to mention that some of them even rarely occur in the real world. Also, the distribution of safety-critical data is usually multimodal, while most traditional attacks and evaluation methods focus on a single modality. To solve the above challenges, we propose a flow-based multimodal safety-critical scenario generator for evaluating decision-making algorithms. The proposed generative model is optimized with weighted likelihood maximization and a gradient-based sampling procedure is integrated to improve the sampling efficiency. The safety-critical scenarios are generated by efficiently querying the task algorithms and a simulator. Experiments on a self-driving task demonstrate our advantages in terms of testing efficiency and multimodal modeling capability. We evaluate six Reinforcement Learning algorithms with our generated traffic scenarios and provide empirical conclusions about their robustness.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2377-3766 2377-3766
DOI:	10.1109/LRA.2021.3058873