Unmanned ship path tracking method based on reinforcement learning

The invention provides an unmanned ship path tracking method based on reinforcement learning. The method comprises the following steps: setting a tracking path of an unmanned ship; establishing an MDP model for the tracking path of the unmanned ship and an obstacle avoidance process, and setting a r...

Full description

Saved in:
Bibliographic Details
Main Authors DONG BOTAO, WU DI, SUN ZHIJIAN, XU XINLI, ZHANG WEIDONG, ZHANG YONGHUI, HUANG MENGXING
Format Patent
LanguageChinese
English
Published 11.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The invention provides an unmanned ship path tracking method based on reinforcement learning. The method comprises the following steps: setting a tracking path of an unmanned ship; establishing an MDP model for the tracking path of the unmanned ship and an obstacle avoidance process, and setting a reward function; performing reinforcement learning on a state information model by adopting a PPO algorithm to obtain control strategy parameters of the unmanned ship; enabling the unmanned ship to execute a path tracking task according to the control strategy parameters. According to the unmanned ship path tracking method based on reinforcement learning, the PPO algorithm is adopted in the strategy training process to improve the convergence of action strategy parameters, human intervention is not needed in the algorithm iteration process, and due to the fact that an agent actively interacts with the environment, various interference factors in the environment can be learned by the agent, the design process of the
AbstractList The invention provides an unmanned ship path tracking method based on reinforcement learning. The method comprises the following steps: setting a tracking path of an unmanned ship; establishing an MDP model for the tracking path of the unmanned ship and an obstacle avoidance process, and setting a reward function; performing reinforcement learning on a state information model by adopting a PPO algorithm to obtain control strategy parameters of the unmanned ship; enabling the unmanned ship to execute a path tracking task according to the control strategy parameters. According to the unmanned ship path tracking method based on reinforcement learning, the PPO algorithm is adopted in the strategy training process to improve the convergence of action strategy parameters, human intervention is not needed in the algorithm iteration process, and due to the fact that an agent actively interacts with the environment, various interference factors in the environment can be learned by the agent, the design process of the
Author ZHANG WEIDONG
DONG BOTAO
HUANG MENGXING
ZHANG YONGHUI
WU DI
XU XINLI
SUN ZHIJIAN
Author_xml – fullname: DONG BOTAO
– fullname: WU DI
– fullname: SUN ZHIJIAN
– fullname: XU XINLI
– fullname: ZHANG WEIDONG
– fullname: ZHANG YONGHUI
– fullname: HUANG MENGXING
BookMark eNqNyjsOwjAQRVEXUPDbw7AACpNIiDJEICoqqKPBecFW4rFle_8iBQugusU9a7WQIFipy0s8i6CnbF2kyMVSSWxGJx_yKDb09OY8_yCU4GQIycBDCk3gJDPbquXAU8bu143a367P9n5ADB1yZANB6dqH1sdzfaor3VT_mC9hJjPv
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 一种基于强化学习的无人船路径跟踪方法
ExternalDocumentID CN112947431A
GroupedDBID EVB
ID FETCH-epo_espacenet_CN112947431A3
IEDL.DBID EVB
IngestDate Fri Jul 19 14:31:44 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language Chinese
English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_CN112947431A3
Notes Application Number: CN202110149076
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210611&DB=EPODOC&CC=CN&NR=112947431A
ParticipantIDs epo_espacenet_CN112947431A
PublicationCentury 2000
PublicationDate 20210611
PublicationDateYYYYMMDD 2021-06-11
PublicationDate_xml – month: 06
  year: 2021
  text: 20210611
  day: 11
PublicationDecade 2020
PublicationYear 2021
RelatedCompanies HAIZHIYUN (SUZHOU) TECHNOLOGY CO., LTD
RelatedCompanies_xml – name: HAIZHIYUN (SUZHOU) TECHNOLOGY CO., LTD
Score 3.4652042
Snippet The invention provides an unmanned ship path tracking method based on reinforcement learning. The method comprises the following steps: setting a tracking path...
SourceID epo
SourceType Open Access Repository
SubjectTerms CONTROLLING
PHYSICS
REGULATING
SYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
Title Unmanned ship path tracking method based on reinforcement learning
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210611&DB=EPODOC&locale=&CC=CN&NR=112947431A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3dS8MwED_m_HzTquj8IIL0rditbaYPRWzaMgS7IZvsbbRpOhVsR1sR_OvNxc75om8hgZAcXO4jv_sdwKVpCyuzrdRIMioMO7Vi4zqJHSN10r7DqZlxodAWER1M7PupM23B67IWRvGEfihyRKlRXOp7rd7rxSqJ5StsZXWVvMip4jYcu77eRMc9DHC6uu-5wWjoD5nOmMsiPXp00a2w0VrercE6utHIsx88eViVsvhtUsJd2BjJ3fJ6D1qfzxpss2XnNQ22HpoPbw02FUKTV3Ky0cJqH7xJ_hbj-0gQakWwqTCpy5hj1pt8t4QmaJ1SUuSkFIoblas0IGmaRMwP4CIMxmxgyGPNfmQwY9HqBtYhtPMiF0dAHJOLrEfjPnK-yMENt1IZgTnST5CeGaXH0Pl7n85_iyewg_JEVFS3ewrtunwXZ9L-1sm5EtwXcf6Jwg
link.rule.ids 230,309,786,891,25594,76906
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1bS8MwFD7MeZlvOhWdtwjSt2K7XqYPRWy6MnXrhmyyt9GmqRewHW1F8NebEzvni76FBEJy4ORc8p3vAJxrJjcS04jVKLG5asZGqF5GoaXGVtyxmK0ljEu0RWD3Jubd1JrW4HVRCyN5Qj8kOaLQKCb0vZTv9XyZxPIktrK4iF7EVHbtjx1PqaLjNgY4uuK5Tnc09IZUodShgRI8OOhWmGgtb1ZgtYPsvOg6PbpYlTL_bVL8LVgbid3Schtqn89NaNBF57UmbAyqD-8mrEuEJivEZKWFxQ64k_QtxPeRINSKYFNhUuYhw6w3-W4JTdA6xSRLSc4lNyqTaUBSNYl42oUzvzumPVUca_YjgxkNljcw9qCeZinfB2JpjCdtO-wg54sYXDEjFhGYJfwE4ZnZ9gG0_t6n9d_iKTR640F_1r8N7g9hE2WLCCldP4J6mb_zY2GLy-hECvELDGOMrw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Unmanned+ship+path+tracking+method+based+on+reinforcement+learning&rft.inventor=DONG+BOTAO&rft.inventor=WU+DI&rft.inventor=SUN+ZHIJIAN&rft.inventor=XU+XINLI&rft.inventor=ZHANG+WEIDONG&rft.inventor=ZHANG+YONGHUI&rft.inventor=HUANG+MENGXING&rft.date=2021-06-11&rft.externalDBID=A&rft.externalDocID=CN112947431A