A Safe Preference Learning Approach for Personalization With Applications to Autonomous Vehicles

This letter introduces a preference learning method that ensures adherence to given specifications, with an application to autonomous vehicles. Our approach incorporates the priority ordering of Signal Temporal Logic (STL) formulas describing traffic rules into a learning framework. By leveraging Pa...

Full description

Saved in:

Bibliographic Details
Published in	IEEE robotics and automation letters Vol. 9; no. 5; pp. 4226 - 4233
Main Authors	Karagulle, Ruya, Arechiga, Nikos, Best, Andrew, DeCastro, Jonathan, Ozay, Necmiye
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.05.2024 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Autonomous vehicles Formal specifications Learning Learning systems machine learning algorithms Pedestrian crossings Preferences Robustness Safety Semantics Task analysis Teaching methods Temporal logic vehicle safety Vehicles
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This letter introduces a preference learning method that ensures adherence to given specifications, with an application to autonomous vehicles. Our approach incorporates the priority ordering of Signal Temporal Logic (STL) formulas describing traffic rules into a learning framework. By leveraging Parametric Weighted Signal Temporal Logic (PWSTL), we formulate the problem of safety-guaranteed preference learning based on pairwise comparisons and propose an approach to solve this learning problem. Our approach finds a feasible valuation for the weights of the given PWSTL formula such that, with these weights, preferred signals have weighted quantitative satisfaction measures greater than their non-preferred counterparts. The feasible valuation of weights given by our approach leads to a weighted STL formula that can be used in correct-and-custom-by-construction controller synthesis. We demonstrate the performance of our method with a pilot human subject study in two different simulated driving scenarios involving a stop sign and a pedestrian crossing. Our approach yields competitive results compared to existing preference learning methods in terms of capturing preferences and notably outperforms them when safety is considered.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2377-3766 2377-3766
DOI:	10.1109/LRA.2024.3375626