Unmanned aerial vehicle safety path planning method based on maximum entropy multi-agent reinforcement learning

The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method comprises the steps: building a reinforcement learning air combat simulation environment without human-computer interaction on the basis of a...

Full description

Saved in:

Bibliographic Details
Main Authors	YANG FEIYU, FANG CHENGLIANG, LI YANG
Format	Patent
Language	Chinese English
Published	19.04.2024
Subjects	CONTROLLING PHYSICS REGULATING SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
Online Access	Get full text

Cover

Loading…

Abstract	The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method comprises the steps: building a reinforcement learning air combat simulation environment without human-computer interaction on the basis of a pre-assumed condition for the collaborative path planning problem of an unmanned aerial vehicle group, and completing the initialization setting of parameters; introducing a two-dimensional unmanned aerial vehicle kinetic equation; constructing a six-tuple of a partial observable Markov decision process of an unmanned aerial vehicle group collaborative path planning problem to obtain a POMDP model; based on a multi-agent soft actor commentator algorithm, through interaction of unmanned aerial vehicles and an air combat simulation environment, training agents to solve an unmanned aerial vehicle group collaborative path planning POMDP model strategy in the air combat simulation environment, and obtaining trained agen
AbstractList	The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method comprises the steps: building a reinforcement learning air combat simulation environment without human-computer interaction on the basis of a pre-assumed condition for the collaborative path planning problem of an unmanned aerial vehicle group, and completing the initialization setting of parameters; introducing a two-dimensional unmanned aerial vehicle kinetic equation; constructing a six-tuple of a partial observable Markov decision process of an unmanned aerial vehicle group collaborative path planning problem to obtain a POMDP model; based on a multi-agent soft actor commentator algorithm, through interaction of unmanned aerial vehicles and an air combat simulation environment, training agents to solve an unmanned aerial vehicle group collaborative path planning POMDP model strategy in the air combat simulation environment, and obtaining trained agen
Author	YANG FEIYU LI YANG FANG CHENGLIANG
Author_xml	– fullname: YANG FEIYU – fullname: FANG CHENGLIANG – fullname: LI YANG
BookMark	eNqNjEsKwjAURTvQgb89PBdQsIi_oRTFkSMdl2d72waSl5CkYndvCy7A0eXAOXeeTMQKZol9imERVMTwijW90apSgwLXiD05ji05PShKGjKIra3oxWEIrJDhjzKdIUj01vVkOh1Vys3A5KGktr6EGUmD_XixTKY164DVbxfJ-np55LcUzhYIjksIYpHfs-xw2hx3-915-4_zBbkaRXA
ContentType	Patent
DBID	EVB
DatabaseName	esp@cenet
DatabaseTitleList
Database_xml	– sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Chemistry Sciences Physics
DocumentTitleAlternate	基于最大熵多智能体强化学习的无人机安全路径规划方法
ExternalDocumentID	CN117908565A
GroupedDBID	EVB
ID	FETCH-epo_espacenet_CN117908565A3
IEDL.DBID	EVB
IngestDate	Fri Jul 19 12:46:40 EDT 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	Chinese English
LinkModel	DirectLink
MergedId	FETCHMERGED-epo_espacenet_CN117908565A3
Notes	Application Number: CN202410070432
OpenAccessLink	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240419&DB=EPODOC&CC=CN&NR=117908565A
ParticipantIDs	epo_espacenet_CN117908565A
PublicationCentury	2000
PublicationDate	20240419
PublicationDateYYYYMMDD	2024-04-19
PublicationDate_xml	– month: 04 year: 2024 text: 20240419 day: 19
PublicationDecade	2020
PublicationYear	2024
RelatedCompanies	CHONGQING KECHUANG CENTER OF NORTHWESTERN POLYTECHNICAL UNIVERSITY
RelatedCompanies_xml	– name: CHONGQING KECHUANG CENTER OF NORTHWESTERN POLYTECHNICAL UNIVERSITY
Score	3.643832
Snippet	The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method...
SourceID	epo
SourceType	Open Access Repository
SubjectTerms	CONTROLLING PHYSICS REGULATING SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
Title	Unmanned aerial vehicle safety path planning method based on maximum entropy multi-agent reinforcement learning
URI	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240419&DB=EPODOC&locale=&CC=CN&NR=117908565A
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPm-KGsVHxsRwa4RSXofGSAshJhZiwHAj23YVTPoILSr-emfWIl70uk022enOfrOzM98HcC08QcDqSo3At6wZDSk0l7aFJhuGrxNgMYUWV1s49d7IuB_Xxjl4XfXCKJ7Qd0WOSB7lkb-n6ryO10ksW9VWJjfujIai2-7QtEvZ7ZjgySAPtNtmZ9C3-1bJskzLKTmPJjOfUXRRr91twCaH0cyz33lqc1dK_BtSuvuwNaDZwvQAcp_TAuxaK-W1Auw8ZA_eBdhWFZpeQoOZFyaHEI3CQPD5iELtH3yTU_79mIhnmS6RVYYxztSI8FsjGhmufIxCDMTHLFgEyGndKF6iKinUBLdY4VwqIlVP5QwxU5R4OYKrbmdo9TRaw-THYBPLWS-3egz5MArlCSCFOnSfa7YM0SwbDdF0paj5FV33delVXd0_heLf8xT_-3gGe2x8fmWptM4hn84X8oLAOnUvlZW_ACYXmvc
link.rule.ids	230,309,786,891,25594,76906
linkProvider	European Patent Office
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPvCmqFF8rYnh1iilUDg0RloIKhRiwHAj23YVTPoILSr-emfWIl70uptsso_Zb3Z25vsALrnLEVgdoSD4XiuaLrji4LFQhK55KgIWUWhRtoVdbQ-1-1FllIHXZS2M5Al9l-SIaFEu2nsi7-toFcSyZG5lfOVMsSm8aQ0Mq5i-jhGeNLRAq2E0-z2rZxZN0zDtov1oEPMZehfVyu0arOvEzkuu01ODqlKi35DS2oGNPo4WJLuQ-ZzkIWculdfysNVNP7zzsCkzNN0YG1MrjPcgHAY-p_uRcXl-2JuY0PazmD-LZMFIZZhFqRoR-9aIZgRXHgsD5vOPqT_3GYV1w2jBZEqhwqnEis2EJFJ1ZcyQpYoSL_tw0WoOzLaCcxj_LNjYtFfTLR9ANggDcQgMXR18z9XqGq9dazqvOYJXvJKqeqpwy47qHUHh73EK_3WeQ6496HbGnTv74Ri2aSPox6VUP4FsMpuLUwTuxDmTK_4FjxOd5A
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Unmanned+aerial+vehicle+safety+path+planning+method+based+on+maximum+entropy+multi-agent+reinforcement+learning&rft.inventor=YANG+FEIYU&rft.inventor=FANG+CHENGLIANG&rft.inventor=LI+YANG&rft.date=2024-04-19&rft.externalDBID=A&rft.externalDocID=CN117908565A