Unmanned aerial vehicle safety path planning method based on maximum entropy multi-agent reinforcement learning

The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method comprises the steps: building a reinforcement learning air combat simulation environment without human-computer interaction on the basis of a...

Full description

Saved in:
Bibliographic Details
Main Authors YANG FEIYU, FANG CHENGLIANG, LI YANG
Format Patent
LanguageChinese
English
Published 19.04.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method comprises the steps: building a reinforcement learning air combat simulation environment without human-computer interaction on the basis of a pre-assumed condition for the collaborative path planning problem of an unmanned aerial vehicle group, and completing the initialization setting of parameters; introducing a two-dimensional unmanned aerial vehicle kinetic equation; constructing a six-tuple of a partial observable Markov decision process of an unmanned aerial vehicle group collaborative path planning problem to obtain a POMDP model; based on a multi-agent soft actor commentator algorithm, through interaction of unmanned aerial vehicles and an air combat simulation environment, training agents to solve an unmanned aerial vehicle group collaborative path planning POMDP model strategy in the air combat simulation environment, and obtaining trained agen
AbstractList The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method comprises the steps: building a reinforcement learning air combat simulation environment without human-computer interaction on the basis of a pre-assumed condition for the collaborative path planning problem of an unmanned aerial vehicle group, and completing the initialization setting of parameters; introducing a two-dimensional unmanned aerial vehicle kinetic equation; constructing a six-tuple of a partial observable Markov decision process of an unmanned aerial vehicle group collaborative path planning problem to obtain a POMDP model; based on a multi-agent soft actor commentator algorithm, through interaction of unmanned aerial vehicles and an air combat simulation environment, training agents to solve an unmanned aerial vehicle group collaborative path planning POMDP model strategy in the air combat simulation environment, and obtaining trained agen
Author YANG FEIYU
LI YANG
FANG CHENGLIANG
Author_xml – fullname: YANG FEIYU
– fullname: FANG CHENGLIANG
– fullname: LI YANG
BookMark eNqNjEsKwjAURTvQgb89PBdQsIi_oRTFkSMdl2d72waSl5CkYndvCy7A0eXAOXeeTMQKZol9imERVMTwijW90apSgwLXiD05ji05PShKGjKIra3oxWEIrJDhjzKdIUj01vVkOh1Vys3A5KGktr6EGUmD_XixTKY164DVbxfJ-np55LcUzhYIjksIYpHfs-xw2hx3-915-4_zBbkaRXA
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 基于最大熵多智能体强化学习的无人机安全路径规划方法
ExternalDocumentID CN117908565A
GroupedDBID EVB
ID FETCH-epo_espacenet_CN117908565A3
IEDL.DBID EVB
IngestDate Fri Jul 19 12:46:40 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language Chinese
English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_CN117908565A3
Notes Application Number: CN202410070432
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240419&DB=EPODOC&CC=CN&NR=117908565A
ParticipantIDs epo_espacenet_CN117908565A
PublicationCentury 2000
PublicationDate 20240419
PublicationDateYYYYMMDD 2024-04-19
PublicationDate_xml – month: 04
  year: 2024
  text: 20240419
  day: 19
PublicationDecade 2020
PublicationYear 2024
RelatedCompanies CHONGQING KECHUANG CENTER OF NORTHWESTERN POLYTECHNICAL UNIVERSITY
RelatedCompanies_xml – name: CHONGQING KECHUANG CENTER OF NORTHWESTERN POLYTECHNICAL UNIVERSITY
Score 3.643832
Snippet The invention discloses an unmanned aerial vehicle safe path planning method based on maximum entropy multi-agent deep reinforcement learning, and the method...
SourceID epo
SourceType Open Access Repository
SubjectTerms CONTROLLING
PHYSICS
REGULATING
SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
Title Unmanned aerial vehicle safety path planning method based on maximum entropy multi-agent reinforcement learning
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240419&DB=EPODOC&locale=&CC=CN&NR=117908565A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPm-KGsVHxsRwa4RSXofGSAshJhZiwHAj23YVTPoILSr-emfWIl70uk022enOfrOzM98HcC08QcDqSo3At6wZDSk0l7aFJhuGrxNgMYUWV1s49d7IuB_Xxjl4XfXCKJ7Qd0WOSB7lkb-n6ryO10ksW9VWJjfujIai2-7QtEvZ7ZjgySAPtNtmZ9C3-1bJskzLKTmPJjOfUXRRr91twCaH0cyz33lqc1dK_BtSuvuwNaDZwvQAcp_TAuxaK-W1Auw8ZA_eBdhWFZpeQoOZFyaHEI3CQPD5iELtH3yTU_79mIhnmS6RVYYxztSI8FsjGhmufIxCDMTHLFgEyGndKF6iKinUBLdY4VwqIlVP5QwxU5R4OYKrbmdo9TRaw-THYBPLWS-3egz5MArlCSCFOnSfa7YM0SwbDdF0paj5FV33delVXd0_heLf8xT_-3gGe2x8fmWptM4hn84X8oLAOnUvlZW_ACYXmvc
link.rule.ids 230,309,786,891,25594,76906
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPvCmqFF8rYnh1iilUDg0RloIKhRiwHAj23YVTPoILSr-emfWIl70uptsso_Zb3Z25vsALrnLEVgdoSD4XiuaLrji4LFQhK55KgIWUWhRtoVdbQ-1-1FllIHXZS2M5Al9l-SIaFEu2nsi7-toFcSyZG5lfOVMsSm8aQ0Mq5i-jhGeNLRAq2E0-z2rZxZN0zDtov1oEPMZehfVyu0arOvEzkuu01ODqlKi35DS2oGNPo4WJLuQ-ZzkIWculdfysNVNP7zzsCkzNN0YG1MrjPcgHAY-p_uRcXl-2JuY0PazmD-LZMFIZZhFqRoR-9aIZgRXHgsD5vOPqT_3GYV1w2jBZEqhwqnEis2EJFJ1ZcyQpYoSL_tw0WoOzLaCcxj_LNjYtFfTLR9ANggDcQgMXR18z9XqGq9dazqvOYJXvJKqeqpwy47qHUHh73EK_3WeQ6496HbGnTv74Ri2aSPox6VUP4FsMpuLUwTuxDmTK_4FjxOd5A
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Unmanned+aerial+vehicle+safety+path+planning+method+based+on+maximum+entropy+multi-agent+reinforcement+learning&rft.inventor=YANG+FEIYU&rft.inventor=FANG+CHENGLIANG&rft.inventor=LI+YANG&rft.date=2024-04-19&rft.externalDBID=A&rft.externalDocID=CN117908565A