Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning

The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets withi...

Full description

Saved in:
Bibliographic Details
Published inBioinspiration & biomimetics Vol. 19; no. 5; pp. 56014 - 56028
Main Authors Hou, Ji, He, Changling, Li, Tao, Zhang, Chunze, Zhou, Qin
Format Journal Article
LanguageEnglish
Published England IOP Publishing 01.09.2024
Subjects
Online AccessGet full text
ISSN1748-3182
1748-3190
1748-3190
DOI10.1088/1748-3190/ad6544

Cover

Abstract The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish’s capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.
AbstractList The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.
The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but also influencing their behavioral tendencies. Understanding the intricate mechanism through which fish adeptly navigate the moving targets within reef environments within complex water flow, all while evading obstacles and maintaining stable postures, has remained a challenging and prominent subject in the realms of fish behavior, ecology, and biomimetics alike. An integrated simulation framework is used to investigate fish predation problems within intricate environments, combining deep reinforcement learning algorithms (DRL) with high-precision fluid-structure interaction numerical methods-immersed boundary lattice Boltzmann method (lB-LBM). The Soft Actor-Critic (SAC) algorithm is used to improve the intelligent fish's capacity for random exploration, tackling the multi-objective sparse reward challenge inherent in real-world scenarios. Additionally, a reward shaping method tailored to its action purposes has been developed, capable of capturing outcomes and trend characteristics effectively. The convergence and robustness advantages of the method elucidated in this paper are showcased through two case studies: one addressing fish capturing randomly moving targets in hydrostatic flow field, and the other focusing on fish counter-current foraging in reef environments to capture drifting food. A comprehensive analysis was conducted of the influence and significance of various reward types on the decision-making processes of intelligent fish within intricate environments.
Author Zhou, Qin
Li, Tao
Zhang, Chunze
He, Changling
Hou, Ji
Author_xml – sequence: 1
  givenname: Ji
  surname: Hou
  fullname: Hou, Ji
  organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China
– sequence: 2
  givenname: Changling
  orcidid: 0009-0009-3186-6847
  surname: He
  fullname: He, Changling
  organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China
– sequence: 3
  givenname: Tao
  orcidid: 0009-0008-7066-9558
  surname: Li
  fullname: Li, Tao
  organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China
– sequence: 4
  givenname: Chunze
  surname: Zhang
  fullname: Zhang, Chunze
  organization: Chongqing Jiaotong University The College of River and Ocean Engineering, Chongqing 400074, People’s Republic of China
– sequence: 5
  givenname: Qin
  surname: Zhou
  fullname: Zhou, Qin
  organization: LTD for Water Transport Engineering Chongqing Xike Consulting Co., Chongqing 402247, People’s Republic of China
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39025108$$D View this record in MEDLINE/PubMed
BookMark eNp9kbtPwzAQxi1URB-wMyGPDBT8SpqMqOIlVWKB2XLsCxgldrDTAv89Di0MSHDLnXzf90n-3RSNnHeA0DEl55QUxQVdiGLOaUkulMkzIfbQ5Odp9DMXbIymMb4QkomyYAdozEvCspQwQU8rUMFZ94R9FXulG8Bq461RTqfJGdwFMKq33mHrsPZt18A7DgA1BrexwbsWXB_xm-2fsQHo0s662gcNwwI3u_hDtF-rJsLRrs_Q4_XVw_J2vrq_uVteruaa87yfs9xwJkStNTPMcGPKUrBMEJX-V1W0LHNdUAEV1AtaVnmR6ayqVaosp4oz4DN0us3tgn9dQ-xla6OGplEO_DpKTgqWswXJFkl6spOuqxaM7IJtVfiQ33CSIN8KdPAxBqiltv0Xiz4o20hK5HAFOWCWA3K5vUIykl_G7-x_LGdbi_WdfPHr4BKlv-WfTRWY2A
CODEN BBIICI
CitedBy_id crossref_primary_10_1088_1748_3190_ada59c
crossref_primary_10_1063_5_0259251
crossref_primary_10_1063_5_0244010
Cites_doi 10.1088/1748-3190/ab6b6c
10.1177/0954406220915216
10.4208/cicp.2014.m385
10.1007/s42241-020-0028-y
10.1016/j.compfluid.2021.104973
10.3390/app13189995
10.4208/aamm.2014.m512
10.1145/3459991
10.1080/19942060.2017.1406872
10.1002/fld.5025
10.1088/1748-3190/ac165e
10.1007/s11432-019-2760-5
10.1017/S0263574799271172
10.1016/j.compfluid.2015.03.024
10.1177/027836498600500106
10.1016/j.oceaneng.2022.112829
10.1103/PhysRevLett.118.158004
10.1002/zamm.19310110205
10.1016/S1672-6529(09)60184-0
10.3389/fphy.2022.870273
10.1063/5.0184690
10.1177/09544062221079693
10.1038/s41598-021-81124-8
ContentType Journal Article
Copyright 2024 IOP Publishing Ltd
2024 IOP Publishing Ltd.
Copyright_xml – notice: 2024 IOP Publishing Ltd
– notice: 2024 IOP Publishing Ltd.
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1088/1748-3190/ad6544
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE
MEDLINE - Academic
CrossRef
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Anatomy & Physiology
EISSN 1748-3190
ExternalDocumentID 39025108
10_1088_1748_3190_ad6544
bbad6544
Genre Journal Article
GrantInformation_xml – fundername: the Natural Science Foundation of Chongqing
  grantid: No. cstc2020jcyjmsxmX0965
– fundername: Joint Training Base Construction Project for Graduate Students in Chongqing
  grantid: JDLHPYJD2020024
– fundername: the National Natural Science Foundation of China
  grantid: No. 52109150
GroupedDBID ---
1JI
4.4
53G
5B3
5VS
5ZH
7.M
7.Q
AAGCD
AAJIO
AAJKP
AATNI
ABHWH
ABJNI
ABQJV
ABVAM
ACAFW
ACGFS
ACHIP
AEFHF
AENEX
AFYNE
AKPSB
ALMA_UNASSIGNED_HOLDINGS
AOAED
ASPBG
ATQHT
AVWKF
AZFZN
CEBXE
CJUJL
CRLBU
CS3
DU5
EBS
EDWGO
EMSAF
EPQRW
EQZZN
F5P
HAK
IJHAN
IOP
IZVLO
KOT
LAP
N5L
N9A
P2P
PJBAE
RIN
RO9
ROL
RPA
SY9
TN5
UCJ
W28
AAYXX
ADEQX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
AEINN
ID FETCH-LOGICAL-c336t-26d3244fcc2d2d3dd9942540aad6bb1996c814ebef719b685c5bfaaaa561a32e3
IEDL.DBID IOP
ISSN 1748-3182
1748-3190
IngestDate Wed Jul 30 11:24:27 EDT 2025
Thu Apr 03 07:06:10 EDT 2025
Tue Jul 01 04:32:49 EDT 2025
Thu Apr 24 22:52:55 EDT 2025
Sun Aug 18 18:10:26 EDT 2024
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords intelligent fish
fluid-structure interaction
deep reinforcement learning
immersed boundary lattice Boltzmann method
sparse reward
Language English
License This article is available under the terms of the IOP-Standard License.
2024 IOP Publishing Ltd.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c336t-26d3244fcc2d2d3dd9942540aad6bb1996c814ebef719b685c5bfaaaa561a32e3
Notes BB-103805.R1
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0009-0008-7066-9558
0009-0009-3186-6847
PMID 39025108
PQID 3082627057
PQPubID 23479
PageCount 15
ParticipantIDs iop_journals_10_1088_1748_3190_ad6544
crossref_primary_10_1088_1748_3190_ad6544
pubmed_primary_39025108
crossref_citationtrail_10_1088_1748_3190_ad6544
proquest_miscellaneous_3082627057
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2024-09-01
PublicationDateYYYYMMDD 2024-09-01
PublicationDate_xml – month: 09
  year: 2024
  text: 2024-09-01
  day: 01
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Bioinspiration & biomimetics
PublicationTitleAbbrev BB
PublicationTitleAlternate Bioinspir. Biomim
PublicationYear 2024
Publisher IOP Publishing
Publisher_xml – name: IOP Publishing
References Padakandla (bbad6544bib24) 2021; 54
Wu (bbad6544bib15) 2015; 17
Zhu (bbad6544bib26) 2021; 11
Buzzicotti (bbad6544bib8) 2020
Sutton (bbad6544bib23) 1999; 17
Wang (bbad6544bib6) 2020; 63
Yan (bbad6544bib10) 2020; 234
Rabault (bbad6544bib20) 2020; 32
Zhang (bbad6544bib13) 2024; 36
Diao (bbad6544bib18) 2018; 12
Liu (bbad6544bib1) 2010; 7
Garnier (bbad6544bib21) 2021; 225
Colabrese (bbad6544bib9) 2017; 118
Haarnoja (bbad6544bib22) 2018
Zhai (bbad6544bib5) 2013
Tian (bbad6544bib2) 2022; 266
Zhu (bbad6544bib25) 2023; 237
Tian (bbad6544bib7) 2020; 15
Khatib (bbad6544bib4) 1986; 5
Zhu (bbad6544bib12) 2022; 10
Yan (bbad6544bib11) 2021; 93
Zermelo (bbad6544bib3) 1931; 11
Zhang (bbad6544bib17) 2016; 8
Zhang (bbad6544bib19) 2016; 124
Zhang (bbad6544bib16) 2023; 13
Li (bbad6544bib14) 2021; 16
References_xml – volume: 15
  year: 2020
  ident: bbad6544bib7
  article-title: CFD based parameter tuning for motion control of robotic fish
  publication-title: Bioinspir. Biomim.
  doi: 10.1088/1748-3190/ab6b6c
– volume: 234
  start-page: 3397
  year: 2020
  ident: bbad6544bib10
  article-title: A numerical simulation method for bionic fish self-propelled swimming under control based on deep reinforcement learning
  publication-title: Proc. Inst. Mech. Eng. C
  doi: 10.1177/0954406220915216
– volume: 17
  start-page: 1271
  year: 2015
  ident: bbad6544bib15
  article-title: Three-dimensional simulation of balloon dynamics by the immersed boundary method coupled to the multiple-relaxation-time lattice Boltzmann method
  publication-title: Commun. Comput. Phys.
  doi: 10.4208/cicp.2014.m385
– volume: 32
  start-page: 234
  year: 2020
  ident: bbad6544bib20
  article-title: Deep reinforcement learning in fluid mechanics: a promising method for both active flow control and shape optimization
  publication-title: J. Hydrodynamics
  doi: 10.1007/s42241-020-0028-y
– start-page: pp 1861
  year: 2018
  ident: bbad6544bib22
  article-title: Soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor
– volume: 225
  year: 2021
  ident: bbad6544bib21
  article-title: A review on deep reinforcement learning for fluid mechanics
  publication-title: Comput. Fluids
  doi: 10.1016/j.compfluid.2021.104973
– volume: 13
  start-page: 9995
  year: 2023
  ident: bbad6544bib16
  article-title: Stability improvement of the immersed boundary–lattice Boltzmann coupling scheme by semi-implicit weighting of external force
  publication-title: Appl. Sci.
  doi: 10.3390/app13189995
– volume: 8
  start-page: 37
  year: 2016
  ident: bbad6544bib17
  article-title: Improving the stability of the multiple-relaxation-time lattice Boltzmann method by a viscosity counteracting approach
  publication-title: Adv. Appl. Math. Mech.
  doi: 10.4208/aamm.2014.m512
– volume: 54
  start-page: 1
  year: 2021
  ident: bbad6544bib24
  article-title: A survey of reinforcement learning algorithms for dynamically varying environments
  publication-title: ACM Comput. Surv.
  doi: 10.1145/3459991
– volume: 12
  start-page: 250
  year: 2018
  ident: bbad6544bib18
  article-title: Simulation of hydraulic characteristics of an inclined overflow gate by the free-surface lattice Boltzmann-immersed boundary coupling scheme
  publication-title: Eng. Appl. Comput. Fluid Mech.
  doi: 10.1080/19942060.2017.1406872
– start-page: pp 223
  year: 2020
  ident: bbad6544bib8
  article-title: Optimal control of point-to-point navigation in turbulent time dependent flows using reinforcement learning
– start-page: pp 2616
  year: 2013
  ident: bbad6544bib5
  article-title: Formation control of multiple robot fishes based on artificial potential field and leader-follower framework
– volume: 93
  start-page: 3073
  year: 2021
  ident: bbad6544bib11
  article-title: Learning how to avoid obstacles: a numerical investigation for maneuvering of self-propelled fish based on deep reinforcement learning
  publication-title: Int. J. Numer. Methods Fluids
  doi: 10.1002/fld.5025
– volume: 16
  year: 2021
  ident: bbad6544bib14
  article-title: Fish can save energy via proprioceptive sensing
  publication-title: Bioinspir. Biomim.
  doi: 10.1088/1748-3190/ac165e
– volume: 63
  start-page: 1
  year: 2020
  ident: bbad6544bib6
  article-title: Trajectory tracking control of a bionic robotic fish based on iterative learning
  publication-title: Sci. China Inf. Sci.
  doi: 10.1007/s11432-019-2760-5
– volume: 17
  start-page: 229
  year: 1999
  ident: bbad6544bib23
  article-title: Reinforcement learning: an introduction
  publication-title: Robotica
  doi: 10.1017/S0263574799271172
– volume: 124
  start-page: 246
  year: 2016
  ident: bbad6544bib19
  article-title: Accuracy improvement of the immersed boundary–lattice Boltzmann coupling scheme by iterative force correction
  publication-title: Comput. Fluids
  doi: 10.1016/j.compfluid.2015.03.024
– volume: 5
  start-page: 90
  year: 1986
  ident: bbad6544bib4
  article-title: Real-time obstacle avoidance for manipulators and mobile robots
  publication-title: Int. J. Robot. Res.
  doi: 10.1177/027836498600500106
– volume: 266
  year: 2022
  ident: bbad6544bib2
  article-title: A two-level optimization algorithm for path planning of bionic robotic fish in the three-dimensional environment with ocean currents and moving obstacles
  publication-title: Ocean Eng.
  doi: 10.1016/j.oceaneng.2022.112829
– volume: 118
  year: 2017
  ident: bbad6544bib9
  article-title: Flow navigation by smart microswimmers via reinforcement learning
  publication-title: Phys. Rev. Lett.
  doi: 10.1103/PhysRevLett.118.158004
– volume: 11
  start-page: 114
  year: 1931
  ident: bbad6544bib3
  article-title: Über das navigationsproblem bei ruhender oder veränderlicher windverteilung
  publication-title: Z. Angew. Math. Mech.
  doi: 10.1002/zamm.19310110205
– volume: 7
  start-page: 35
  year: 2010
  ident: bbad6544bib1
  article-title: Biological inspiration: from carangiform fish to multi-joint robotic fish
  publication-title: J. Bionic Eng.
  doi: 10.1016/S1672-6529(09)60184-0
– volume: 10
  year: 2022
  ident: bbad6544bib12
  article-title: Point-to-point navigation of a fish-like swimmer in a vortical flow with deep reinforcement learning
  publication-title: Front. Phys.
  doi: 10.3389/fphy.2022.870273
– volume: 36
  year: 2024
  ident: bbad6544bib13
  article-title: A numerical simulation research on fish adaption behavior based on deep reinforcement learning and fluid–structure coupling: implementation of the ‘perceive-feedback-memory’ control system
  publication-title: Phys. Fluids
  doi: 10.1063/5.0184690
– volume: 237
  start-page: 2450
  year: 2023
  ident: bbad6544bib25
  article-title: A numerical simulation of target-directed swimming for a three-link bionic fish with deep reinforcement learning
  publication-title: Proc. Inst. Mech. Eng. C
  doi: 10.1177/09544062221079693
– volume: 11
  start-page: 1691
  year: 2021
  ident: bbad6544bib26
  article-title: A numerical study of fish adaption behaviors in complex environments with a deep reinforcement learning and immersed boundary–lattice Boltzmann method
  publication-title: Sci. Rep.
  doi: 10.1038/s41598-021-81124-8
SSID ssj0054982
Score 2.3843873
Snippet The reef ecosystem plays a vital role as a habitat for fish species with limited swimming capabilities, serving not only as a sanctuary and food source but...
SourceID proquest
pubmed
crossref
iop
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 56014
SubjectTerms Algorithms
Animals
Avoidance Learning - physiology
Biomimetics - methods
Computer Simulation
Coral Reefs
Deep Learning
deep reinforcement learning
Ecosystem
Fishes - physiology
fluid-structure interaction
immersed boundary lattice Boltzmann method
intelligent fish
Models, Biological
Predatory Behavior - physiology
Reinforcement, Psychology
sparse reward
Swimming - physiology
Title Learning obstacle avoidance and predation in complex reef environments with deep reinforcement learning
URI https://iopscience.iop.org/article/10.1088/1748-3190/ad6544
https://www.ncbi.nlm.nih.gov/pubmed/39025108
https://www.proquest.com/docview/3082627057
Volume 19
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR3JSuRA9NHqZS7jNmq7UQMqeEgvlUp1gicRRYXROSh4EEJtaRo1adpuUb_eV0s3jmgjk1MgL7W8elX19gewk7VUJiSKqYxzHTGB4k4maScq8EAuqGJt1rbxzn8u-Ok1O79JbmpwMImFqfrh6G_gq08U7FEYHOLSJvLQVqWXtZpC84SxGZizhSsteZ9d_h0fwyj3uEpRATqlwUb5WQv_3Ekz2O_X7Ka7dk7m4XY8YO9tctcYDWVDvX7I5fifM1qAn4EdJYcedBFqplyC5cMSRfGHF7JHnIOo07wvQzfkYu2SSiJPifBEPFU9bemGiFKT_sD4Ck2kVxLnq26eyQBXg7wPpyNW9Uu0MX385vK2KqeiJKGARfcXXJ8cXx2dRqFOQ6TimA8jyjWyZaxQimqqY62zDE8C1hI4Gymtm7NK2wyppei0M8nTRCWyEPgg7yZiauIVmC2r0qwBiaVsKdWxQXKKKRTPacEMjyXXAolGmDo0xyuVq5DE3NbSuM-dMT1Nc4vL3OIy97isw_7kj75P4DEFdheXKA-7-HEK3O8xeeS4G62JRZSmGj3mNvkPx9EnnTqserqZ9Bpbiy62t_7NXjbgB0UOyju0bcLscDAyW8gBDeW2o_Q3bcX98A
linkProvider IOP Publishing
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpR3ZbtNAcNQUCfFSjkJJgXYrARIPzrFeb-zHChq1pZQ8UClvy16OIlrbShPU8vXMHqkA0ahS_WTJs9fs7HiunQF4W_R0IRWqqYxzkzCJ6k6h6CApkSGXVLM-67v7zl9O-eEZOx5n41jn1N-FqZvI-jv4GhIFBxTGgLi8izK0M-kVva40PGOs25iyBQ8yZMUupuvo62jJilH38dWiYoucRj_l_3r567_UwrFvFzn9r2f4GL4vJx0iTn50FnPV0b_-yed4j1U9gY0olpL9AP4U1mz1DDb3K1TJL67Je-IDRb0FfhMmMSfrhNQKZUuEJ_JnPTWOfoisDGlmNlRqItOK-Jh1e0VmuCvkz2t1xJmAibG2wW8-f6v2pkoSC1lMnsPZ8ODbx8Mk1mtIdJryeUK5QfGMlVpTQ01qTFEgR2A9iStSyoU767zPkGrKQb9QPM90pkqJD8pwMqU2fQHrVV3Zl0BSpXpaD9xlOc00qum0ZJanihuJxCNtG7rL3RI6JjN3NTXOhXeq57lw-BQOnyLgsw0fblo0IZHHCth3uE0inubLFXB7SxIReCqdq0VWtl5cCpcEiOPss0EbtgLt3IyaOs8u9rd9x1F24eHo01CcHJ1-fgWPKApVIcbtNazPZwv7BoWiudrxhP8bsR4DYw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Learning+obstacle+avoidance+and+predation+in+complex+reef+environments+with+deep+reinforcement+learning&rft.jtitle=Bioinspiration+%26+biomimetics&rft.au=Hou%2C+Ji&rft.au=He%2C+Changling&rft.au=Li%2C+Tao&rft.au=Zhang%2C+Chunze&rft.date=2024-09-01&rft.issn=1748-3190&rft.eissn=1748-3190&rft.volume=19&rft.issue=5&rft_id=info:doi/10.1088%2F1748-3190%2Fad6544&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1748-3182&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1748-3182&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1748-3182&client=summon