Model-Based Inverse Reinforcement Learning from Visual Demonstrations

Proceedings of the 2020 Conference on Robot Learning, PMLR 155:1930-1942, 2021 Scaling model-based inverse reinforcement learning (IRL) to real robotic manipulation tasks with unknown dynamics remains an open problem. The key challenges lie in learning good dynamics models, developing algorithms tha...

Full description

Saved in:

Bibliographic Details
Main Authors	Das, Neha, Bechtle, Sarah, Davchev, Todor, Jayaraman, Dinesh, Rai, Akshara, Meier, Franziska
Format	Journal Article
Language	English
Published	18.10.2020
Subjects	Computer Science - Learning Computer Science - Robotics
Online Access	Get full text

Cover

Loading…

Abstract	Proceedings of the 2020 Conference on Robot Learning, PMLR 155:1930-1942, 2021 Scaling model-based inverse reinforcement learning (IRL) to real robotic manipulation tasks with unknown dynamics remains an open problem. The key challenges lie in learning good dynamics models, developing algorithms that scale to high-dimensional state-spaces and being able to learn from both visual and proprioceptive demonstrations. In this work, we present a gradient-based inverse reinforcement learning framework that utilizes a pre-trained visual dynamics model to learn cost functions when given only visual human demonstrations. The learned cost functions are then used to reproduce the demonstrated behavior via visual model predictive control. We evaluate our framework on hardware on two basic object manipulation tasks.
AbstractList	Proceedings of the 2020 Conference on Robot Learning, PMLR 155:1930-1942, 2021 Scaling model-based inverse reinforcement learning (IRL) to real robotic manipulation tasks with unknown dynamics remains an open problem. The key challenges lie in learning good dynamics models, developing algorithms that scale to high-dimensional state-spaces and being able to learn from both visual and proprioceptive demonstrations. In this work, we present a gradient-based inverse reinforcement learning framework that utilizes a pre-trained visual dynamics model to learn cost functions when given only visual human demonstrations. The learned cost functions are then used to reproduce the demonstrated behavior via visual model predictive control. We evaluate our framework on hardware on two basic object manipulation tasks.
Author	Jayaraman, Dinesh Davchev, Todor Das, Neha Rai, Akshara Bechtle, Sarah Meier, Franziska
Author_xml	– sequence: 1 givenname: Neha surname: Das fullname: Das, Neha – sequence: 2 givenname: Sarah surname: Bechtle fullname: Bechtle, Sarah – sequence: 3 givenname: Todor surname: Davchev fullname: Davchev, Todor – sequence: 4 givenname: Dinesh surname: Jayaraman fullname: Jayaraman, Dinesh – sequence: 5 givenname: Akshara surname: Rai fullname: Rai, Akshara – sequence: 6 givenname: Franziska surname: Meier fullname: Meier, Franziska
BackLink	https://doi.org/10.48550/arXiv.2010.09034$$DView paper in arXiv
BookMark	eNotj7FOwzAUAD3AAIUPYMI_kPIcP6fxCKVApSAkVLFGdv2MLCU2skMFf08JTCfdcNKds5OYIjF2JWCJrVJwY_JXOCxrOArQIPGMbZ6To6G6M4Uc38YD5UL8lUL0Ke9ppDjxjkyOIb5zn9PI30L5NAO_pzHFMmUzhSMv2Kk3Q6HLfy7Y7mGzWz9V3cvjdn3bVaZZYaWFdmqPjbcWrWi9U-DAC9WKBi3gCiWSrWcnvNBKWdHUAICARkvp5IJd_2Xnj_4jh9Hk7_73p59_5A8zPUZ8
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY GOX
DOI	10.48550/arxiv.2010.09034
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2010_09034
GroupedDBID	AKY GOX
ID	FETCH-LOGICAL-a674-919d5c46fbb4b18fd50d0f158164b047434eb20d0f11f1955b162000404a933d3
IEDL.DBID	GOX
IngestDate	Mon Jan 08 05:49:11 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a674-919d5c46fbb4b18fd50d0f158164b047434eb20d0f11f1955b162000404a933d3
Notes	PMLR 155:1930-1942
OpenAccessLink	https://arxiv.org/abs/2010.09034
ParticipantIDs	arxiv_primary_2010_09034
PublicationCentury	2000
PublicationDate	2020-10-18
PublicationDateYYYYMMDD	2020-10-18
PublicationDate_xml	– month: 10 year: 2020 text: 2020-10-18 day: 18
PublicationDecade	2020
PublicationYear	2020
Score	1.7894969
SecondaryResourceType	preprint
Snippet	Proceedings of the 2020 Conference on Robot Learning, PMLR 155:1930-1942, 2021 Scaling model-based inverse reinforcement learning (IRL) to real robotic...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Learning Computer Science - Robotics
Title	Model-Based Inverse Reinforcement Learning from Visual Demonstrations
URI	https://arxiv.org/abs/2010.09034
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB7anryIolKf5OA1mGySfRx9tBYPClJlb0ueUtBVdlvx55vHil68JiGQCcw3XzLzDcB5IQ0LDZEwcdJiTgnBMrMOa8e0ctqQXMcs3_t88cTvalGPAP3Uwsjua_WZ9IFVf5EyryrC-BjGWRZStm4f6vQ5GaW4hvW_63yMGYf-gMR8B7aH6A5dpuvYhZFt92AW-o294iuPFwYFXYuut-jRRslSHV_n0KBy-oJCtQd6XvUbv8uNfQvBW7qifh-W89nyeoGH7gVY5gX3TqQyQvPcKcUVLZ0RxBBHRen5iSLcAzf3pDaOUUcrIRTNQ9kMJ1xWjBl2AJP2vbVTQFQX1DLJC-9XuXEBYZnnSTm1yhSeBR7CNJ65-UgCFU0wRxPNcfT_1DFsZYE7huyM8gQm625jTz3ArtVZtPI33ml6kQ
link.rule.ids	228,230,786,891
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Model-Based+Inverse+Reinforcement+Learning+from+Visual+Demonstrations&rft.au=Das%2C+Neha&rft.au=Bechtle%2C+Sarah&rft.au=Davchev%2C+Todor&rft.au=Jayaraman%2C+Dinesh&rft.date=2020-10-18&rft_id=info:doi/10.48550%2Farxiv.2010.09034&rft.externalDocID=2010_09034