Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning

The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets withi...

Full description

Saved in:

Bibliographic Details
Published in	Bioinspiration & biomimetics Vol. 19; no. 5; pp. 56014 - 56028
Main Authors	Hou, Ji, He, Changling, Li, Tao, Zhang, Chunze, Zhou, Qin
Format	Journal Article
Language	English
Published	England IOP Publishing 01.09.2024
Subjects	Algorithms Animals Avoidance Learning - physiology Biomimetics - methods Computer Simulation Coral Reefs Deep Learning deep reinforcement learning Ecosystem Fishes - physiology fluid-structure interaction immersed boundary lattice Boltzmann method intelligent fish Models, Biological Predatory Behavior - physiology Reinforcement, Psychology sparse reward Swimming - physiology intelligent fish fluid-structure interaction deep reinforcement learning immersed boundary lattice Boltzmann method sparse reward
Online Access	Get full text
ISSN	1748-3182 1748-3190 1748-3190
DOI	10.1088/1748-3190/ad6544

Cover

Abstract	The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish’s capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.
AbstractList	The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments. The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.
Author	Zhou, Qin Li, Tao Zhang, Chunze He, Changling Hou, Ji
Author_xml	– sequence: 1 givenname: Ji surname: Hou fullname: Hou, Ji organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China – sequence: 2 givenname: Changling orcidid: 0009-0009-3186-6847 surname: He fullname: He, Changling organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China – sequence: 3 givenname: Tao orcidid: 0009-0008-7066-9558 surname: Li fullname: Li, Tao organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China – sequence: 4 givenname: Chunze surname: Zhang fullname: Zhang, Chunze organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China – sequence: 5 givenname: Qin surname: Zhou fullname: Zhou, Qin organization: LTD for Water Transport Engineering Chongqing Xike Consulting Co., Chongqing 402247, People’s Republic of China
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/39025108$$D View this record in MEDLINE/PubMed
BookMark	eNp9kbtPwzAQxi1URB-wMyGPDBT8SpqMqOIlVWKB2XLsCxgldrDTAv89Di0MSHDLnXzf90n-3RSNnHeA0DEl55QUxQVdiGLOaUkulMkzIfbQ5Odp9DMXbIymMb4QkomyYAdozEvCspQwQU8rUMFZ94R9FXulG8Bq461RTqfJGdwFMKq33mHrsPZt18A7DgA1BrexwbsWXB_xm-2fsQHo0s662gcNwwI3u_hDtF-rJsLRrs_Q4_XVw_J2vrq_uVteruaa87yfs9xwJkStNTPMcGPKUrBMEJX-V1W0LHNdUAEV1AtaVnmR6ayqVaosp4oz4DN0us3tgn9dQ-xla6OGplEO_DpKTgqWswXJFkl6spOuqxaM7IJtVfiQ33CSIN8KdPAxBqiltv0Xiz4o20hK5HAFOWCWA3K5vUIykl_G7-x_LGdbi_WdfPHr4BKlv-WfTRWY2A
CODEN	BBIICI
CitedBy_id	crossref_primary_10_1088_1748_3190_ada59c crossref_primary_10_1063_5_0259251 crossref_primary_10_1063_5_0244010
Cites_doi	10.1088/1748-3190/ab6b6c 10.1177/0954406220915216 10.4208/cicp.2014.m385 10.1007/s42241-020-0028-y 10.1016/j.compfluid.2021.104973 10.3390/app13189995 10.4208/aamm.2014.m512 10.1145/3459991 10.1080/19942060.2017.1406872 10.1002/fld.5025 10.1088/1748-3190/ac165e 10.1007/s11432-019-2760-5 10.1017/S0263574799271172 10.1016/j.compfluid.2015.03.024 10.1177/027836498600500106 10.1016/j.oceaneng.2022.112829 10.1103/PhysRevLett.118.158004 10.1002/zamm.19310110205 10.1016/S1672-6529(09)60184-0 10.3389/fphy.2022.870273 10.1063/5.0184690 10.1177/09544062221079693 10.1038/s41598-021-81124-8
ContentType	Journal Article
Copyright	2024 IOP Publishing Ltd 2024 IOP Publishing Ltd.
Copyright_xml	– notice: 2024 IOP Publishing Ltd – notice: 2024 IOP Publishing Ltd.
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8
DOI	10.1088/1748-3190/ad6544
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic
DatabaseTitleList	MEDLINE MEDLINE - Academic CrossRef
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Anatomy & Physiology
EISSN	1748-3190
ExternalDocumentID	39025108 10_1088_1748_3190_ad6544 bbad6544
Genre	Journal Article
GrantInformation_xml	– fundername: the Natural Science Foundation of Chongqing grantid: No. cstc2020jcyjmsxmX0965 – fundername: Joint Training Base Construction Project for Graduate Students in Chongqing grantid: JDLHPYJD2020024 – fundername: the National Natural Science Foundation of China grantid: No. 52109150
GroupedDBID	--- 1JI 4.4 53G 5B3 5VS 5ZH 7.M 7.Q AAGCD AAJIO AAJKP AATNI ABHWH ABJNI ABQJV ABVAM ACAFW ACGFS ACHIP AEFHF AENEX AFYNE AKPSB ALMA_UNASSIGNED_HOLDINGS AOAED ASPBG ATQHT AVWKF AZFZN CEBXE CJUJL CRLBU CS3 DU5 EBS EDWGO EMSAF EPQRW EQZZN F5P HAK IJHAN IOP IZVLO KOT LAP N5L N9A P2P PJBAE RIN RO9 ROL RPA SY9 TN5 UCJ W28 AAYXX ADEQX CITATION CGR CUY CVF ECM EIF NPM 7X8 AEINN
ID	FETCH-LOGICAL-c336t-26d3244fcc2d2d3dd9942540aad6bb1996c814ebef719b685c5bfaaaa561a32e3
IEDL.DBID	IOP
ISSN	1748-3182 1748-3190
IngestDate	Wed Jul 30 11:24:27 EDT 2025 Thu Apr 03 07:06:10 EDT 2025 Tue Jul 01 04:32:49 EDT 2025 Thu Apr 24 22:52:55 EDT 2025 Sun Aug 18 18:10:26 EDT 2024
IsPeerReviewed	true
IsScholarly	true
Issue	5
Keywords	intelligent fish fluid-structure interaction deep reinforcement learning immersed boundary lattice Boltzmann method sparse reward
Language	English
License	This article is available under the terms of the IOP-Standard License. 2024 IOP Publishing Ltd.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c336t-26d3244fcc2d2d3dd9942540aad6bb1996c814ebef719b685c5bfaaaa561a32e3
Notes	BB-103805.R1 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ORCID	0009-0008-7066-9558 0009-0009-3186-6847
PMID	39025108
PQID	3082627057
PQPubID	23479
PageCount	15
ParticipantIDs	iop_journals_10_1088_1748_3190_ad6544 crossref_primary_10_1088_1748_3190_ad6544 pubmed_primary_39025108 crossref_citationtrail_10_1088_1748_3190_ad6544 proquest_miscellaneous_3082627057
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2024-09-01
PublicationDateYYYYMMDD	2024-09-01
PublicationDate_xml	– month: 09 year: 2024 text: 2024-09-01 day: 01
PublicationDecade	2020
PublicationPlace	England
PublicationPlace_xml	– name: England
PublicationTitle	Bioinspiration & biomimetics
PublicationTitleAbbrev	BB
PublicationTitleAlternate	Bioinspir. Biomim
PublicationYear	2024
Publisher	IOP Publishing
Publisher_xml	– name: IOP Publishing
References	Padakandla (bbad6544bib24) 2021; 54 Wu (bbad6544bib15) 2015; 17 Zhu (bbad6544bib26) 2021; 11 Buzzicotti (bbad6544bib8) 2020 Sutton (bbad6544bib23) 1999; 17 Wang (bbad6544bib6) 2020; 63 Yan (bbad6544bib10) 2020; 234 Rabault (bbad6544bib20) 2020; 32 Zhang (bbad6544bib13) 2024; 36 Diao (bbad6544bib18) 2018; 12 Liu (bbad6544bib1) 2010; 7 Garnier (bbad6544bib21) 2021; 225 Colabrese (bbad6544bib9) 2017; 118 Haarnoja (bbad6544bib22) 2018 Zhai (bbad6544bib5) 2013 Tian (bbad6544bib2) 2022; 266 Zhu (bbad6544bib25) 2023; 237 Tian (bbad6544bib7) 2020; 15 Khatib (bbad6544bib4) 1986; 5 Zhu (bbad6544bib12) 2022; 10 Yan (bbad6544bib11) 2021; 93 Zermelo (bbad6544bib3) 1931; 11 Zhang (bbad6544bib17) 2016; 8 Zhang (bbad6544bib19) 2016; 124 Zhang (bbad6544bib16) 2023; 13 Li (bbad6544bib14) 2021; 16
References_xml	– volume: 15 year: 2020 ident: bbad6544bib7 article-title: CFD based parameter tuning for motion control of robotic fish publication-title: Bioinspir. Biomim. doi: 10.1088/1748-3190/ab6b6c – volume: 234 start-page: 3397 year: 2020 ident: bbad6544bib10 article-title: A numerical simulation method for bionic fish self-propelled swimming under control based on deep reinforcement learning publication-title: Proc. Inst. Mech. Eng. C doi: 10.1177/0954406220915216 – volume: 17 start-page: 1271 year: 2015 ident: bbad6544bib15 article-title: Three-dimensional simulation of balloon dynamics by the immersed boundary method coupled to the multiple-relaxation-time lattice Boltzmann method publication-title: Commun. Comput. Phys. doi: 10.4208/cicp.2014.m385 – volume: 32 start-page: 234 year: 2020 ident: bbad6544bib20 article-title: Deep reinforcement learning in fluid mechanics: a promising method for both active flow control and shape optimization publication-title: J. Hydrodynamics doi: 10.1007/s42241-020-0028-y – start-page: pp 1861 year: 2018 ident: bbad6544bib22 article-title: Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor – volume: 225 year: 2021 ident: bbad6544bib21 article-title: A review on deep reinforcement learning for fluid mechanics publication-title: Comput. Fluids doi: 10.1016/j.compfluid.2021.104973 – volume: 13 start-page: 9995 year: 2023 ident: bbad6544bib16 article-title: Stability improvement of the immersed boundary–lattice Boltzmann coupling scheme by semi-implicit weighting of external force publication-title: Appl. Sci. doi: 10.3390/app13189995 – volume: 8 start-page: 37 year: 2016 ident: bbad6544bib17 article-title: Improving the stability of the multiple-relaxation-time lattice Boltzmann method by a viscosity counteracting approach publication-title: Adv. Appl. Math. Mech. doi: 10.4208/aamm.2014.m512 – volume: 54 start-page: 1 year: 2021 ident: bbad6544bib24 article-title: A survey of reinforcement learning algorithms for dynamically varying environments publication-title: ACM Comput. Surv. doi: 10.1145/3459991 – volume: 12 start-page: 250 year: 2018 ident: bbad6544bib18 article-title: Simulation of hydraulic characteristics of an inclined overflow gate by the free-surface lattice Boltzmann-immersed boundary coupling scheme publication-title: Eng. Appl. Comput. Fluid Mech. doi: 10.1080/19942060.2017.1406872 – start-page: pp 223 year: 2020 ident: bbad6544bib8 article-title: Optimal control of point-to-point navigation in turbulent time dependent flows using reinforcement learning – start-page: pp 2616 year: 2013 ident: bbad6544bib5 article-title: Formation control of multiple robot fishes based on artificial potential field and leader-follower framework – volume: 93 start-page: 3073 year: 2021 ident: bbad6544bib11 article-title: Learning how to avoid obstacles: a numerical investigation for maneuvering of self-propelled fish based on deep reinforcement learning publication-title: Int. J. Numer. Methods Fluids doi: 10.1002/fld.5025 – volume: 16 year: 2021 ident: bbad6544bib14 article-title: Fish can save energy via proprioceptive sensing publication-title: Bioinspir. Biomim. doi: 10.1088/1748-3190/ac165e – volume: 63 start-page: 1 year: 2020 ident: bbad6544bib6 article-title: Trajectory tracking control of a bionic robotic fish based on iterative learning publication-title: Sci. China Inf. Sci. doi: 10.1007/s11432-019-2760-5 – volume: 17 start-page: 229 year: 1999 ident: bbad6544bib23 article-title: Reinforcement learning: an introduction publication-title: Robotica doi: 10.1017/S0263574799271172 – volume: 124 start-page: 246 year: 2016 ident: bbad6544bib19 article-title: Accuracy improvement of the immersed boundary–lattice Boltzmann coupling scheme by iterative force correction publication-title: Comput. Fluids doi: 10.1016/j.compfluid.2015.03.024 – volume: 5 start-page: 90 year: 1986 ident: bbad6544bib4 article-title: Real-time obstacle avoidance for manipulators and mobile robots publication-title: Int. J. Robot. Res. doi: 10.1177/027836498600500106 – volume: 266 year: 2022 ident: bbad6544bib2 article-title: A two-level optimization algorithm for path planning of bionic robotic fish in the three-dimensional environment with ocean currents and moving obstacles publication-title: Ocean Eng. doi: 10.1016/j.oceaneng.2022.112829 – volume: 118 year: 2017 ident: bbad6544bib9 article-title: Flow navigation by smart microswimmers via reinforcement learning publication-title: Phys. Rev. Lett. doi: 10.1103/PhysRevLett.118.158004 – volume: 11 start-page: 114 year: 1931 ident: bbad6544bib3 article-title: Über das navigationsproblem bei ruhender oder veränderlicher windverteilung publication-title: Z. Angew. Math. Mech. doi: 10.1002/zamm.19310110205 – volume: 7 start-page: 35 year: 2010 ident: bbad6544bib1 article-title: Biological inspiration: from carangiform fish to multi-joint robotic fish publication-title: J. Bionic Eng. doi: 10.1016/S1672-6529(09)60184-0 – volume: 10 year: 2022 ident: bbad6544bib12 article-title: Point-to-point navigation of a fish-like swimmer in a vortical flow with deep reinforcement learning publication-title: Front. Phys. doi: 10.3389/fphy.2022.870273 – volume: 36 year: 2024 ident: bbad6544bib13 article-title: A numerical simulation research on fish adaption behavior based on deep reinforcement learning and fluid–structure coupling: implementation of the ‘perceive-feedback-memory’ control system publication-title: Phys. Fluids doi: 10.1063/5.0184690 – volume: 237 start-page: 2450 year: 2023 ident: bbad6544bib25 article-title: A numerical simulation of target-directed swimming for a three-link bionic fish with deep reinforcement learning publication-title: Proc. Inst. Mech. Eng. C doi: 10.1177/09544062221079693 – volume: 11 start-page: 1691 year: 2021 ident: bbad6544bib26 article-title: A numerical study of fish adaption behaviors in complex environments with a deep reinforcement learning and immersed boundary–lattice Boltzmann method publication-title: Sci. Rep. doi: 10.1038/s41598-021-81124-8
SSID	ssj0054982
Score	2.3843873
Snippet	The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but...
SourceID	proquest pubmed crossref iop
SourceType	Aggregation Database Index Database Enrichment Source Publisher
StartPage	56014
SubjectTerms	Algorithms Animals Avoidance Learning - physiology Biomimetics - methods Computer Simulation Coral Reefs Deep Learning deep reinforcement learning Ecosystem Fishes - physiology fluid-structure interaction immersed boundary lattice Boltzmann method intelligent fish Models, Biological Predatory Behavior - physiology Reinforcement, Psychology sparse reward Swimming - physiology
Title	Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning
URI	https://iopscience.iop.org/article/10.1088/1748-3190/ad6544 https://www.ncbi.nlm.nih.gov/pubmed/39025108 https://www.proquest.com/docview/3082627057
Volume	19
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR3JSuRA9NHqZS7jNmq7UQMqeEgvlUp1gicRRYXROSh4EEJtaRo1adpuUb_eV0s3jmgjk1MgL7W8elX19gewk7VUJiSKqYxzHTGB4k4maScq8EAuqGJt1rbxzn8u-Ok1O79JbmpwMImFqfrh6G_gq08U7FEYHOLSJvLQVqWXtZpC84SxGZizhSsteZ9d_h0fwyj3uEpRATqlwUb5WQv_3Ekz2O_X7Ka7dk7m4XY8YO9tctcYDWVDvX7I5fifM1qAn4EdJYcedBFqplyC5cMSRfGHF7JHnIOo07wvQzfkYu2SSiJPifBEPFU9bemGiFKT_sD4Ck2kVxLnq26eyQBXg7wPpyNW9Uu0MX385vK2KqeiJKGARfcXXJ8cXx2dRqFOQ6TimA8jyjWyZaxQimqqY62zDE8C1hI4Gymtm7NK2wyppei0M8nTRCWyEPgg7yZiauIVmC2r0qwBiaVsKdWxQXKKKRTPacEMjyXXAolGmDo0xyuVq5DE3NbSuM-dMT1Nc4vL3OIy97isw_7kj75P4DEFdheXKA-7-HEK3O8xeeS4G62JRZSmGj3mNvkPx9EnnTqserqZ9Bpbiy62t_7NXjbgB0UOyju0bcLscDAyW8gBDeW2o_Q3bcX98A
linkProvider	IOP Publishing
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpR3ZbtNAcNQUCfFSjkJJgXYrARIPzrFeb-zHChq1pZQ8UClvy16OIlrbShPU8vXMHqkA0ahS_WTJs9fs7HiunQF4W_R0IRWqqYxzkzCJ6k6h6CApkSGXVLM-67v7zl9O-eEZOx5n41jn1N-FqZvI-jv4GhIFBxTGgLi8izK0M-kVva40PGOs25iyBQ8yZMUupuvo62jJilH38dWiYoucRj_l_3r567_UwrFvFzn9r2f4GL4vJx0iTn50FnPV0b_-yed4j1U9gY0olpL9AP4U1mz1DDb3K1TJL67Je-IDRb0FfhMmMSfrhNQKZUuEJ_JnPTWOfoisDGlmNlRqItOK-Jh1e0VmuCvkz2t1xJmAibG2wW8-f6v2pkoSC1lMnsPZ8ODbx8Mk1mtIdJryeUK5QfGMlVpTQ01qTFEgR2A9iStSyoU767zPkGrKQb9QPM90pkqJD8pwMqU2fQHrVV3Zl0BSpXpaD9xlOc00qum0ZJanihuJxCNtG7rL3RI6JjN3NTXOhXeq57lw-BQOnyLgsw0fblo0IZHHCth3uE0inubLFXB7SxIReCqdq0VWtl5cCpcEiOPss0EbtgLt3IyaOs8u9rd9x1F24eHo01CcHJ1-fgWPKApVIcbtNazPZwv7BoWiudrxhP8bsR4DYw
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Learning+obstacle+avoidance+and+predation+in+complex+reef+environments+with+deep+reinforcement+learning&rft.jtitle=Bioinspiration+%26+biomimetics&rft.au=Hou%2C+Ji&rft.au=He%2C+Changling&rft.au=Li%2C+Tao&rft.au=Zhang%2C+Chunze&rft.date=2024-09-01&rft.issn=1748-3190&rft.eissn=1748-3190&rft.volume=19&rft.issue=5&rft_id=info:doi/10.1088%2F1748-3190%2Fad6544&rft.externalDBID=NO_FULL_TEXT
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1748-3182&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1748-3182&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1748-3182&client=summon