An audiovisual cognitive optimization strategy guided by salient object ranking for intelligent visual prothesis systems

Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial visi...

Full description

Saved in:
Bibliographic Details
Published inJournal of neural engineering Vol. 21; no. 6; pp. 66021 - 66041
Main Authors Liang, Junling, Li, Heng, Chai, Xinyu, Gao, Qi, Zhou, Meixuan, Guo, Tianruo, Chen, Yao, Di, Liqing
Format Journal Article
LanguageEnglish
Published England IOP Publishing 01.12.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind. Approach. This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision. Main results. Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects’ performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition. Significance. This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.
AbstractList Objective.Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind.Approach.This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision.Main results.Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects' performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition.Significance.This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.Objective.Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind.Approach.This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision.Main results.Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects' performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition.Significance.This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.
Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind. Approach. This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision. Main results. Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects’ performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition. Significance. This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.
Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind. This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision. Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects' performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition. This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.
Author Gao, Qi
Zhou, Meixuan
Li, Heng
Chai, Xinyu
Guo, Tianruo
Liang, Junling
Chen, Yao
Di, Liqing
Author_xml – sequence: 1
  givenname: Junling
  surname: Liang
  fullname: Liang, Junling
  organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China
– sequence: 2
  givenname: Heng
  orcidid: 0000-0002-1303-8898
  surname: Li
  fullname: Li, Heng
  organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China
– sequence: 3
  givenname: Xinyu
  orcidid: 0000-0003-2702-665X
  surname: Chai
  fullname: Chai, Xinyu
  organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China
– sequence: 4
  givenname: Qi
  surname: Gao
  fullname: Gao, Qi
  organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China
– sequence: 5
  givenname: Meixuan
  surname: Zhou
  fullname: Zhou, Meixuan
  organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China
– sequence: 6
  givenname: Tianruo
  orcidid: 0000-0001-6348-6771
  surname: Guo
  fullname: Guo, Tianruo
  organization: Graduate School of Biomedical Engineering, UNSW , Sydney, NSW 2052, Australia
– sequence: 7
  givenname: Yao
  surname: Chen
  fullname: Chen, Yao
  organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China
– sequence: 8
  givenname: Liqing
  surname: Di
  fullname: Di, Liqing
  organization: Shanghai Pudong Hospital , Shanghai 200240, People’s Republic of China
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39569905$$D View this record in MEDLINE/PubMed
BookMark eNp1kT1PxDAMhiME4ntnQhkZOEiapm1GhPiSkFhgjpLGKTna5EhSxPHr6emO25hs2Y-t16-P0K4PHhA6o-SKkqa5pnVJZwXnxbUyolTlDjrclna3eUUO0FFKc0IYrQXZRwdM8EoIwg_R943HajQufLk0qh63ofMuuy_AYZHd4H5UdsHjlKPK0C1xNzoDBuslTqp34DMOeg5txlH5D-c7bEPEzmfoe9et2pu9ixjyOySXcFqmDEM6QXtW9QlON_EYvd3fvd4-zp5fHp5ub55nbdGIPBO6rITVFa_bqtSMQKEFa1htjW5tZWuwrKA1K43WQhvOuWnqlpvK2IlmGtgxuljvnRR8jpCyHFxqJ3nKQxiTZJRRTigT1YSeb9BRD2DkIrpBxaX8c2sCyBpoY0gpgt0ilMjVQ-TKcblyX64fMo1crkdcWMh5GKOfjv0f_wUeyI8-
CODEN JNEOBH
Cites_doi 10.1016/j.ins.2014.02.136
10.1109/ICCE59016.2024.10444304
10.3390/s24010166
10.1016/j.neuropsychologia.2022.108305
10.1016/j.ins.2022.07.094
10.1007/s10462-023-10419-1
10.1088/1741-2552/aca69d
10.48550/arXiv.2111.12233
10.1097/WCO.0000000000000412
10.1167/tvst.10.10.14
10.3389/fnins.2023.1270850
10.1109/CVPR.2012.6248100
10.1111/j.1442-9071.2010.02363.x
10.1088/1741-2552/ab9e1d
10.1587/transinf.2018EDP7405
10.1109/CVPR.2014.43
10.1007/s10209-022-00868-w
10.1016/j.oret.2017.08.008
10.18653/v1/2021.acl-long.157
10.1109/CVPR42600.2020.01215
10.47102/annals-acadmedsg.V44N4p116
10.1101/148015
10.1109/CVPR.2018.00474
10.1109/CVPR.2011.5995466
10.3233/RNN-130338
10.48550/arXiv.1703.06870
10.3966/160792642020012101017
10.1093/bmb/ldu002
10.11772/j.issn.1001-9081.2022071109
10.1007/978-3-662-44848-9_9
10.1088/1741-2552/aa966d
10.1186/s42234-018-0013-8
10.1016/j.nicl.2019.102041
10.48550/arXiv.1409.4842
10.1080/1206212x.2024.2328498
10.1038/35058500
10.1016/S0140-6736(06)69740-7
10.1016/j.actpsy.2010.02.006
10.48550/arXiv.2212.01803
10.18653/v1/2020.coling-main.174
10.3390/ijms23062922
10.48550/arXiv.2108.10904
10.1109/TPAMI.2021.3107872
10.1016/S0896-6273(03)00097
10.1016/j.sigpro.2019.06.014
10.1007/978-3-319-10602-1_48
10.1016/j.ins.2017.06.014
10.1016/j.preteyeres.2015.09.003
10.1016/j.ins.2010.04.021
10.1016/j.artmed.2017.11.001
10.1109/ICCV.2015.298
10.1007/978-3-030-29753-4_4
10.1136/bjo.85.3.327
10.1109/CVPR.2019.00859
10.48550/arXiv.1612.03144
10.1109/TASE.2023.3340335
10.1080/17483107.2021.1922522
10.1002/adtp.202300162
10.1109/CVPR.2015.7298642
10.48550/arXiv.2106.05047
10.48550/arXiv.1707.07998
10.1088/1741-2560/13/3/036013
10.48550/arXiv.1607.07155
10.1109/34.730558
10.1016/j.preteyeres.2015.01.00
10.1109/CVPR.2015.7298935
10.1016/j.visres.2021.01.008
10.1109/CVPR.2017.690
10.24963/ijcai.2020/495
10.26599/BSA.2023.9050008
10.1109/ICCV.2009.5459462
10.3389/fpsyg.2011.00264
10.1007/978-981-13-9795-0_9
10.1007/978-3-642-15561-1_2
10.1088/1741-2552/acb295
10.48550/arXiv.1811.10652
10.1109/10.121642
10.48550/arXiv.1803.05082
10.1109/TPAMI.2023.3257546
10.1186/s40942-023-00498-1
10.1111/aor.12498
ContentType Journal Article
Copyright 2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
Copyright_xml – notice: 2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1088/1741-2552/ad94a4
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
CrossRef
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Anatomy & Physiology
EISSN 1741-2552
ExternalDocumentID 39569905
10_1088_1741_2552_ad94a4
jnead94a4
Genre Journal Article
GrantInformation_xml – fundername: Shanghai Jiao Tong University
  sequence: 0
  grantid: YG2022QN077
  funderid: http://dx.doi.org/10.13039/501100004921
– fundername: National Natural Science Foundation of China
  sequence: 0
  grantid: 62073221; 62103269; 62176151
  funderid: http://dx.doi.org/10.13039/501100001809
GroupedDBID ---
1JI
4.4
53G
5B3
5GY
5VS
5ZH
7.M
7.Q
AAGCD
AAJIO
AAJKP
AATNI
ABHWH
ABJNI
ABQJV
ABVAM
ACAFW
ACGFS
ACHIP
AEFHF
AENEX
AFYNE
AKPSB
ALMA_UNASSIGNED_HOLDINGS
AOAED
ASPBG
ATQHT
AVWKF
AZFZN
CEBXE
CJUJL
CRLBU
CS3
DU5
EBS
EDWGO
EMSAF
EPQRW
EQZZN
F5P
HAK
IHE
IJHAN
IOP
IZVLO
KOT
LAP
N5L
N9A
P2P
PJBAE
RIN
RO9
ROL
RPA
SY9
W28
XPP
AAYXX
ADEQX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
ID FETCH-LOGICAL-c289t-9b469fb657c64b30e2b93837fdbcf6f7ef321734dbb9bd555d87c5d6df4b33be3
IEDL.DBID IOP
ISSN 1741-2560
1741-2552
IngestDate Fri Jul 11 04:53:22 EDT 2025
Thu Jan 02 22:23:43 EST 2025
Tue Jul 01 01:48:13 EDT 2025
Thu Dec 05 13:12:05 EST 2024
IsPeerReviewed true
IsScholarly true
Issue 6
Keywords image semantic description
intelligent visual prosthesis
audiovisual cognition for prosthetic vision
salient object ranking
prior knowledge
Language English
License This article is available under the terms of the IOP-Standard License.
2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c289t-9b469fb657c64b30e2b93837fdbcf6f7ef321734dbb9bd555d87c5d6df4b33be3
Notes JNE-107641.R1
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0003-2702-665X
0000-0002-1303-8898
0000-0001-6348-6771
PMID 39569905
PQID 3131501396
PQPubID 23479
PageCount 21
ParticipantIDs iop_journals_10_1088_1741_2552_ad94a4
crossref_primary_10_1088_1741_2552_ad94a4
proquest_miscellaneous_3131501396
pubmed_primary_39569905
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2024-12-01
PublicationDateYYYYMMDD 2024-12-01
PublicationDate_xml – month: 12
  year: 2024
  text: 2024-12-01
  day: 01
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Journal of neural engineering
PublicationTitleAbbrev JNE
PublicationTitleAlternate J. Neural Eng
PublicationYear 2024
Publisher IOP Publishing
Publisher_xml – sequence: 0
  name: IOP Publishing
References Chen (jnead94a4bib7) 2024; 7
Wang (jnead94a4bib38) 2023
Li (jnead94a4bib28) 2019; 164
Islam (jnead94a4bib71) 2018
Wang (jnead94a4bib31) 2022
Siris (jnead94a4bib64) 2020
Zhu (jnead94a4bib37) 2024; 44
Finn (jnead94a4bib81) 2018; 2
Bellapianta (jnead94a4bib2) 2022; 23
Li (jnead94a4bib69) 2017; 415
Wang (jnead94a4bib22) 2014; 277
Abboud (jnead94a4bib42) 2014; 32
Berg (jnead94a4bib60) 2012
Wang (jnead94a4bib17) 2016; 40
Judd (jnead94a4bib62) 2009
Luo (jnead94a4bib10) 2014; 109
Theeuwes (jnead94a4bib76) 2010; 135
Zhang (jnead94a4bib33) 2024; 46
Scalvini (jnead94a4bib45) 2023; 24
Fernando (jnead94a4bib47) 2023; 18
Song (jnead94a4bib46) 2023
Li (jnead94a4bib21) 2018; 84
Zhao (jnead94a4bib50) 2010; 180
Reynolds (jnead94a4bib59) 2003; 37
Willoughby (jnead94a4bib1) 2010; 38
Yang (jnead94a4bib24) 2024
Maimon (jnead94a4bib14) 2022; 173
Xia (jnead94a4bib19) 2022; 609
Hanson (jnead94a4bib11) 2020
Liu (jnead94a4bib73) 2020
Szegedy (jnead94a4bib58) 2014
Zheng (jnead94a4bib35) 2019
Gilhooley (jnead94a4bib20) 2017; 30
Beyeler (jnead94a4bib12) 2022; 19
He (jnead94a4bib55) 2017
Lin (jnead94a4bib66) 2014
Sun (jnead94a4bib27) 2015
Li (jnead94a4bib82) 2023; 45
Valipoor (jnead94a4bib48) 2023; 22
Itti (jnead94a4bib68) 2001; 2
Barnes (jnead94a4bib79) 2016; 13
Redmon (jnead94a4bib70) 2017
Fernandez (jnead94a4bib8) 2018; 4
Farhadi (jnead94a4bib25) 2010
Ramirez (jnead94a4bib51) 2023; 9
Hartong (jnead94a4bib3) 2006; 368
Wagle (jnead94a4bib5) 2015; 44
Li (jnead94a4bib16) 2018; 15
Brown (jnead94a4bib4) 2001; 85
Yan (jnead94a4bib36) 2022
Anandan (jnead94a4bib43) 2020; 21
Li (jnead94a4bib65) 2014
Fang (jnead94a4bib75) 2021
Wang (jnead94a4bib9) 2023; 20
Cornia (jnead94a4bib39) 2018
Cai (jnead94a4bib57) 2016
Vinyals (jnead94a4bib29) 2015
Jayakody (jnead94a4bib6) 2015; 46
Xu (jnead94a4bib52) 2021; 10
Stiles (jnead94a4bib13) 2021; 182
Hariharan (jnead94a4bib61) 2015
Kulkarni (jnead94a4bib26) 2013; 35
Asudani (jnead94a4bib63) 2023; 56
Liu (jnead94a4bib74) 2021; 44
Chen (jnead94a4bib53) 2023
Kvansakul (jnead94a4bib15) 2020; 17
Anderson (jnead94a4bib40) 2018
Luo (jnead94a4bib49) 2016; 50
Beyeler (jnead94a4bib23) 2017
Hu (jnead94a4bib32) 2022
Meijer (jnead94a4bib41) 1992; 39
Guo (jnead94a4bib18) 2019; 102
Guan (jnead94a4bib77) 2023; 9
Zhang (jnead94a4bib80) 2019; 24
Bai (jnead94a4bib30) 2023; 17
Sandler (jnead94a4bib78) 2018
Jacko (jnead94a4bib44) 2020
Shams (jnead94a4bib54) 2011; 2
Lin (jnead94a4bib56) 2016
Itti (jnead94a4bib67) 1998; 20
Bian (jnead94a4bib72) 2014
Lindh (jnead94a4bib34) 2020
References_xml – volume: 277
  start-page: 512
  year: 2014
  ident: jnead94a4bib22
  article-title: Moving object recognition under simulated prosthetic vision using background-subtraction-based image processing strategies
  publication-title: Inf. Sci.
  doi: 10.1016/j.ins.2014.02.136
– start-page: 1
  year: 2024
  ident: jnead94a4bib24
  article-title: Scene simplification for simulated prosthetic vision with improved scene understanding
  doi: 10.1109/ICCE59016.2024.10444304
– volume: 24
  start-page: 166
  year: 2023
  ident: jnead94a4bib45
  article-title: Outdoor navigation assistive system based on robust and real-time visual–auditory substitution approach
  publication-title: Sensors
  doi: 10.3390/s24010166
– volume: 173
  year: 2022
  ident: jnead94a4bib14
  article-title: A case study in phenomenology of visual experience with retinal prosthesis versus visual-to-auditory sensory substitution
  publication-title: Neuropsychologia
  doi: 10.1016/j.neuropsychologia.2022.108305
– volume: 609
  start-page: 507
  year: 2022
  ident: jnead94a4bib19
  article-title: Semantic translation of face image with limited pixels for simulated prosthetic vision
  publication-title: Inf. Sci.
  doi: 10.1016/j.ins.2022.07.094
– volume: 56
  start-page: 10345
  year: 2023
  ident: jnead94a4bib63
  article-title: Impact of word embedding models on text analytics in deep learning environment: a review
  publication-title: Artif. Intell. Rev.
  doi: 10.1007/s10462-023-10419-1
– volume: 19
  year: 2022
  ident: jnead94a4bib12
  article-title: Towards a Smart Bionic Eye: AI-powered artificial vision for the treatment of incurable blindness
  publication-title: J. Neural Eng.
  doi: 10.1088/1741-2552/aca69d
– start-page: 17980
  year: 2022
  ident: jnead94a4bib32
  article-title: Scaling up vision-language pre-training for image captioning
  doi: 10.48550/arXiv.2111.12233
– volume: 30
  start-page: 55
  year: 2017
  ident: jnead94a4bib20
  article-title: Artificial vision: principles and prospects
  publication-title: Curr. Opin. Neurol.
  doi: 10.1097/WCO.0000000000000412
– volume: 10
  start-page: 14
  year: 2021
  ident: jnead94a4bib52
  article-title: First human results with the 256 channel Intelligent Micro Implant Eye (IMIE 256)
  publication-title: Trans. Vis. Sci. Technol.
  doi: 10.1167/tvst.10.10.14
– volume: 17
  year: 2023
  ident: jnead94a4bib30
  article-title: An image caption model based on attention mechanism and deep reinforcement learning
  publication-title: Front. Neurosci.
  doi: 10.3389/fnins.2023.1270850
– start-page: 3562
  year: 2012
  ident: jnead94a4bib60
  article-title: Understanding and predicting importance in images
  doi: 10.1109/CVPR.2012.6248100
– volume: 38
  start-page: 2
  year: 2010
  ident: jnead94a4bib1
  article-title: Anatomy and physiology of the human eye: effects of mucopolysaccharidoses disease on structure and function–a review
  publication-title: Clin. Exp. Ophthalmol.
  doi: 10.1111/j.1442-9071.2010.02363.x
– volume: 17
  year: 2020
  ident: jnead94a4bib15
  article-title: Sensory augmentation to aid training with retinal prostheses
  publication-title: J. Neural Eng.
  doi: 10.1088/1741-2552/ab9e1d
– volume: 102
  start-page: 1321
  year: 2019
  ident: jnead94a4bib18
  article-title: Recognition of moving object in high dynamic scene for visual prosthesis
  publication-title: IEICE Trans. Inf. Syst.
  doi: 10.1587/transinf.2018EDP7405
– start-page: 280
  year: 2014
  ident: jnead94a4bib65
  article-title: The secrets of salient object segmentation
  doi: 10.1109/CVPR.2014.43
– volume: 22
  start-page: 983
  year: 2023
  ident: jnead94a4bib48
  article-title: Recent trends in computer vision-driven scene understanding for VI/blind users: a systematic mapping
  publication-title: Univ. Access Inf. Soc.
  doi: 10.1007/s10209-022-00868-w
– volume: 2
  start-page: 382
  year: 2018
  ident: jnead94a4bib81
  article-title: Synergistic visual gains attained using Argus II retinal prosthesis with OrCam MyEye
  publication-title: Ophthalmol. Retina
  doi: 10.1016/j.oret.2017.08.008
– start-page: 2014
  year: 2022
  ident: jnead94a4bib36
  article-title: Control image captioning spatially and temporally
  doi: 10.18653/v1/2021.acl-long.157
– start-page: 12133
  year: 2020
  ident: jnead94a4bib64
  article-title: Inferring attention shift ranks of objects for image saliency
  doi: 10.1109/CVPR42600.2020.01215
– volume: 44
  start-page: 116
  year: 2015
  ident: jnead94a4bib5
  article-title: A decade of progress in the understanding, prevention and treatment of age-related macular degeneration in Singapore
  publication-title: Ann. Acad. Med. Singapore
  doi: 10.47102/annals-acadmedsg.V44N4p116
– year: 2017
  ident: jnead94a4bib23
  article-title: pulse2percept: a Python-based simulation framework for bionic vision
  doi: 10.1101/148015
– start-page: 4510
  year: 2018
  ident: jnead94a4bib78
  article-title: Mobilenetv2: inverted residuals and linear bottlenecks
  doi: 10.1109/CVPR.2018.00474
– volume: 35
  start-page: 2891
  year: 2013
  ident: jnead94a4bib26
  article-title: Babytalk: understanding and generating simple image descriptions
  publication-title: IEEE Trans. Pattern Anal. Mach. Intell.
  doi: 10.1109/CVPR.2011.5995466
– volume: 32
  start-page: 247
  year: 2014
  ident: jnead94a4bib42
  article-title: EyeMusic: introducing a “visual” colorful experience for the blind using auditory sensory substitution
  publication-title: Restor. Neurol. Neurosci.
  doi: 10.3233/RNN-130338
– start-page: 2961
  year: 2017
  ident: jnead94a4bib55
  article-title: Mask r-cnn
  doi: 10.48550/arXiv.1703.06870
– volume: 21
  start-page: 183
  year: 2020
  ident: jnead94a4bib43
  article-title: Advanced indoor and outdoor navigation system for blind people using raspberry-pi
  publication-title: J. Internet Technol.
  doi: 10.3966/160792642020012101017
– volume: 109
  start-page: 31
  year: 2014
  ident: jnead94a4bib10
  article-title: A review and update on the current status of retinal prostheses (bionic eye)
  publication-title: Br. Med. Bul.
  doi: 10.1093/bmb/ldu002
– volume: 44
  start-page: 58
  year: 2024
  ident: jnead94a4bib37
  article-title: Scene graph-aware cross-modal image captioning model
  publication-title: J. Comput. Appl.
  doi: 10.11772/j.issn.1001-9081.2022071109
– start-page: 132
  year: 2014
  ident: jnead94a4bib72
  article-title: Knowledge-powered deep learning for word embedding
  doi: 10.1007/978-3-662-44848-9_9
– volume: 15
  year: 2018
  ident: jnead94a4bib16
  article-title: An optimized content-aware image retargeting method: toward expanding the perceived visual field of the high-density retinal prosthesis recipients
  publication-title: J. Neural Eng.
  doi: 10.1088/1741-2552/aa966d
– volume: 4
  start-page: 12
  year: 2018
  ident: jnead94a4bib8
  article-title: Development of visual Neuroprostheses: trends and challenges
  publication-title: Bioelectron. Med.
  doi: 10.1186/s42234-018-0013-8
– volume: 24
  year: 2019
  ident: jnead94a4bib80
  article-title: Properties of cross-modal occipital responses in early blindness: an ALE meta-analysis
  publication-title: NeuroImage Clin.
  doi: 10.1016/j.nicl.2019.102041
– start-page: 1
  year: 2014
  ident: jnead94a4bib58
  article-title: Going deeper with convolutions
  doi: 10.48550/arXiv.1409.4842
– volume: 46
  start-page: 1
  year: 2024
  ident: jnead94a4bib33
  article-title: Mobilenet V3-transformer, a lightweight model for image caption
  publication-title: Int. J. Comput. Appl.
  doi: 10.1080/1206212x.2024.2328498
– volume: 2
  start-page: 194
  year: 2001
  ident: jnead94a4bib68
  article-title: Computational modelling of visual attention
  publication-title: Nat. Rev. Neurosci.
  doi: 10.1038/35058500
– volume: 368
  start-page: 1795
  year: 2006
  ident: jnead94a4bib3
  article-title: Retinitis pigmentosa
  publication-title: Lancet
  doi: 10.1016/S0140-6736(06)69740-7
– volume: 135
  start-page: 77
  year: 2010
  ident: jnead94a4bib76
  article-title: Top–down and bottom–up control of visual selection
  publication-title: Acta psychol.
  doi: 10.1016/j.actpsy.2010.02.006
– start-page: 2617
  year: 2023
  ident: jnead94a4bib38
  article-title: Controllable image captioning via prompting
  doi: 10.48550/arXiv.2212.01803
– year: 2020
  ident: jnead94a4bib34
  article-title: Language-driven region pointer advancement for controllable image captioning
  doi: 10.18653/v1/2020.coling-main.174
– volume: 23
  start-page: 2922
  year: 2022
  ident: jnead94a4bib2
  article-title: Retinal organoids and retinal prostheses: an overview
  publication-title: Int. J. Mol. Sci.
  doi: 10.3390/ijms23062922
– year: 2022
  ident: jnead94a4bib31
  article-title: Simvlm: simple visual language model pretraining with weak supervision
  doi: 10.48550/arXiv.2108.10904
– volume: 44
  start-page: 8321
  year: 2021
  ident: jnead94a4bib74
  article-title: Instance-level relative saliency ranking with graph reasoning
  publication-title: IEEE Trans. Pattern Anal. Mach. Intell.
  doi: 10.1109/TPAMI.2021.3107872
– volume: 37
  start-page: 853
  year: 2003
  ident: jnead94a4bib59
  article-title: Interacting roles of attention and visual salience in V4
  publication-title: Neuron
  doi: 10.1016/S0896-6273(03)00097
– volume: 164
  start-page: 193
  year: 2019
  ident: jnead94a4bib28
  article-title: Generating steganographic image description by dynamic synonym substitution
  publication-title: Signal Process.
  doi: 10.1016/j.sigpro.2019.06.014
– start-page: 740
  year: 2014
  ident: jnead94a4bib66
  article-title: Microsoft coco: common objects in context
  doi: 10.1007/978-3-319-10602-1_48
– volume: 415
  start-page: 1
  year: 2017
  ident: jnead94a4bib69
  article-title: A real-time image optimization strategy based on global saliency detection for artificial retinal prostheses
  publication-title: Inf. Sci.
  doi: 10.1016/j.ins.2017.06.014
– volume: 50
  start-page: 89
  year: 2016
  ident: jnead94a4bib49
  article-title: The Argus® II retinal prosthesis system
  publication-title: Prog. Retinal Eye Res.
  doi: 10.1016/j.preteyeres.2015.09.003
– volume: 180
  start-page: 2915
  year: 2010
  ident: jnead94a4bib50
  article-title: Image processing based recognition of images with a limited number of pixels using simulated prosthetic vision
  publication-title: Inf. Sci.
  doi: 10.1016/j.ins.2010.04.021
– volume: 84
  start-page: 64
  year: 2018
  ident: jnead94a4bib21
  article-title: Image processing strategies based on saliency segmentation for object recognition under simulated prosthetic vision
  publication-title: Artif. Intell. Med.
  doi: 10.1016/j.artmed.2017.11.001
– start-page: 2596
  year: 2015
  ident: jnead94a4bib27
  article-title: Automatic concept discovery from parallel text and visual corpora
  doi: 10.1109/ICCV.2015.298
– start-page: 37
  year: 2020
  ident: jnead94a4bib44
  article-title: Psychological and cognitive adjustment to vision loss
  doi: 10.1007/978-3-030-29753-4_4
– volume: 85
  start-page: 327
  year: 2001
  ident: jnead94a4bib4
  article-title: Utility values associated with blindness in an adult population
  publication-title: Br. J. Ophthalmol.
  doi: 10.1136/bjo.85.3.327
– start-page: 8395
  year: 2019
  ident: jnead94a4bib35
  article-title: Intention oriented image captions with guiding objects
  doi: 10.1109/CVPR.2019.00859
– start-page: 2117
  year: 2016
  ident: jnead94a4bib56
  article-title: Feature pyramid networks for object detection
  doi: 10.48550/arXiv.1612.03144
– year: 2023
  ident: jnead94a4bib53
– start-page: 1
  year: 2023
  ident: jnead94a4bib46
  article-title: Multi-sensory visual-auditory fusion of wearable navigation assistance for people with impaired vision
  publication-title: IEEE Trans. Autom. Sci. Eng.
  doi: 10.1109/TASE.2023.3340335
– volume: 18
  start-page: 763
  year: 2023
  ident: jnead94a4bib47
  article-title: Route planning methods in indoor navigation tools for vision impaired persons: a systematic review
  publication-title: Disabil. Rehabil.
  doi: 10.1080/17483107.2021.1922522
– volume: 7
  year: 2024
  ident: jnead94a4bib7
  article-title: Multidisciplinary approaches in the treatment of retinal degenerative diseases: a review
  publication-title: Adv. Ther.
  doi: 10.1002/adtp.202300162
– start-page: 447
  year: 2015
  ident: jnead94a4bib61
  article-title: Hypercolumns for object segmentation and fine-grained localization
  doi: 10.1109/CVPR.2015.7298642
– start-page: 16331
  year: 2021
  ident: jnead94a4bib75
  article-title: Salient object ranking with position-preserved attention
  doi: 10.48550/arXiv.2106.05047
– start-page: 6077
  year: 2018
  ident: jnead94a4bib40
  article-title: Bottom-up and top-down attention for image captioning and visual question answering
  doi: 10.48550/arXiv.1707.07998
– volume: 13
  year: 2016
  ident: jnead94a4bib79
  article-title: Vision function testing for a suprachoroidal retinal prosthesis: effects of image filtering
  publication-title: J. Neural Eng.
  doi: 10.1088/1741-2560/13/3/036013
– start-page: 354
  year: 2016
  ident: jnead94a4bib57
  article-title: A unified multi-scale deep convolutional neural network for fast object detection
  doi: 10.48550/arXiv.1607.07155
– volume: 20
  start-page: 1254
  year: 1998
  ident: jnead94a4bib67
  article-title: A model of saliency-based visual attention for rapid scene analysis
  publication-title: IEEE Trans. Pattern Anal. Mach. Intell.
  doi: 10.1109/34.730558
– volume: 46
  start-page: 31
  year: 2015
  ident: jnead94a4bib6
  article-title: Cellular strategies for retinal repair by photoreceptor replacement
  publication-title: Prog. Retinal Eye Res.
  doi: 10.1016/j.preteyeres.2015.01.00
– start-page: 3156
  year: 2015
  ident: jnead94a4bib29
  article-title: Show and tell: a neural image caption generator
  doi: 10.1109/CVPR.2015.7298935
– volume: 182
  start-page: 58
  year: 2021
  ident: jnead94a4bib13
  article-title: Multisensory perception in Argus II retinal prosthesis patients: leveraging auditory-visual mappings to enhance prosthesis outcomes
  publication-title: Vis. Res.
  doi: 10.1016/j.visres.2021.01.008
– start-page: 7263
  year: 2017
  ident: jnead94a4bib70
  article-title: YOLO9000: better, faster, stronger
  doi: 10.1109/CVPR.2017.690
– start-page: 3608
  year: 2020
  ident: jnead94a4bib73
  article-title: Knowledge enhanced event causality identification with mention masking generalizations
  doi: 10.24963/ijcai.2020/495
– volume: 9
  start-page: 95
  year: 2023
  ident: jnead94a4bib77
  article-title: Neural mechanisms of top–down divided and selective spatial attention in visual and auditory perception
  publication-title: Brain Sci. Adv.
  doi: 10.26599/BSA.2023.9050008
– start-page: 2106
  year: 2009
  ident: jnead94a4bib62
  article-title: Learning to predict where humans look
  doi: 10.1109/ICCV.2009.5459462
– volume: 2
  start-page: 264
  year: 2011
  ident: jnead94a4bib54
  article-title: Influences of multisensory experience on subsequent unisensory processing
  publication-title: Front. Psychol.
  doi: 10.3389/fpsyg.2011.00264
– start-page: 133
  year: 2020
  ident: jnead94a4bib11
  article-title: Newer techniques in vision restoration and rehabilitation
  doi: 10.1007/978-981-13-9795-0_9
– start-page: 15
  year: 2010
  ident: jnead94a4bib25
  article-title: Every picture tells a story: generating sentences from images
  doi: 10.1007/978-3-642-15561-1_2
– volume: 20
  year: 2023
  ident: jnead94a4bib9
  article-title: Artificial intelligence techniques for retinal prostheses: a comprehensive review and future direction
  publication-title: J. Neural Eng.
  doi: 10.1088/1741-2552/acb295
– start-page: 8307
  year: 2018
  ident: jnead94a4bib39
  article-title: Show, control and tell: a framework for generating controllable and grounded captions
  doi: 10.48550/arXiv.1811.10652
– volume: 39
  start-page: 112
  year: 1992
  ident: jnead94a4bib41
  article-title: An experimental system for auditory image representations
  publication-title: IEEE Trans. Biomed. Eng.
  doi: 10.1109/10.121642
– start-page: 7142
  year: 2018
  ident: jnead94a4bib71
  article-title: Revisiting salient object detection: simultaneous detection, ranking, and subitizing of multiple salient objects
  doi: 10.48550/arXiv.1803.05082
– volume: 45
  start-page: 10555
  year: 2023
  ident: jnead94a4bib82
  article-title: When object detection meets knowledge distillation: a survey
  publication-title: IEEE Trans. Pattern Anal. Mach. Intell.
  doi: 10.1109/TPAMI.2023.3257546
– volume: 9
  start-page: 73
  year: 2023
  ident: jnead94a4bib51
  article-title: An update on visual prosthesis
  publication-title: Int. J. Retina Vitreous
  doi: 10.1186/s40942-023-00498-1
– volume: 40
  start-page: 94
  year: 2016
  ident: jnead94a4bib17
  article-title: Image processing strategies based on a visual saliency model for object recognition under simulated prosthetic vision
  publication-title: Artif. Organs
  doi: 10.1111/aor.12498
SSID ssj0031790
Score 2.4142387
Snippet Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the...
Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of...
Objective.Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the...
SourceID proquest
pubmed
crossref
iop
SourceType Aggregation Database
Index Database
Publisher
StartPage 66021
SubjectTerms Adult
Algorithms
Artificial Intelligence
Attention - physiology
audiovisual cognition for prosthetic vision
Auditory Perception - physiology
Cognition - physiology
Deep Learning
Depth Perception - physiology
Female
Humans
image semantic description
intelligent visual prosthesis
Male
Photic Stimulation - methods
prior knowledge
Prosthesis Design - methods
salient object ranking
Visual Perception - physiology
Visual Prosthesis
Title An audiovisual cognitive optimization strategy guided by salient object ranking for intelligent visual prothesis systems
URI https://iopscience.iop.org/article/10.1088/1741-2552/ad94a4
https://www.ncbi.nlm.nih.gov/pubmed/39569905
https://www.proquest.com/docview/3131501396
Volume 21
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8QwEB50vXjx_VhfRFDBQ_eVR1s8LaKo4OOgsAehbJpEF9l2sS24_nonTXdBURFvhU6TdibNfJNMvgE4YKIVmyDmHjVSe0z5YUl566mOQYAQGMGkXdC_vhEXD-yqx3szcDI9C5OOqqm_gZeOKNipsEqIC5qIodseIuFOs69C1mezMEcDIWz5gsvbu8k0TC31lDsNaaVFq9qj_K6FTz5pFvv9GW6Wbud8ER4nL-yyTV4aRS4b8fsXLsd_ftESLFRwlHSd6DLM6GQFVrsJhuLDMTkiZYJoufK-Cm_dhPQLm706yAp8aJp5RFKcd4bVgU6SOb7bMXkqBkorIsckQ7CPzo2k0q76EFsnHj0mQbxMBlNK0JxU7VrqiGedDTLiaKazNXg4P7s_vfCqwg1ejPFb7oUSg24jBfdjtDVt6Y4MbSRslIyNML42FCMhypSUoVSccxX4MVdCGZSmUtN1qCVpojeBGOHHTNpaFMZnhhtJfWraFMPQMGB4ow7HE9NFI8fPEZX76kEQWbVGVq2RU2sdDtECUfWTZr_I7U-sH-HPZndQ-olOiyyibYoAGkGzqMOGGxbTXilGmuja-dYfe9mG-Q4CJJcaswO1_LXQuwhwcrlXDuQPdMD1Vg
linkProvider IOP Publishing
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1ZT9wwEB5xSFVfOEpLl7bgSgWJh-zlI8njirKCQikPIPFm1rFNV4hkRRKJ7a_vOPauVAQVEm-R4iMZH_ONPfMNwDcmuplNMh5Rq0zEdJw2lLeR7lsECIkVTLkD_Z9n4uiS_bjiVyHPaRMLU0zC1t_GR08U7EUYHOKSDmLoXoRIuN8Z6ZSNWGei7SIscyqoI88__nU-24qpo5_yEZGuhuiGe8qnWvlHLy1i389Dzkb1DFfhevbR3uPktl1Xqp39ecTn-Iq_WoOVAEvJwBdfhwWTv4ONQY4m-d2U7JHGUbQ5gd-Ah0FORrXzYh2XNVaaeyCRAvefuxDYSUrPezslN_VYG03UlJQI-lHJkUK50x_i8sWj5iSIm8l4Tg1akdCuo5D4bcpxSTzddPkeLoeHFwdHUUjgEGVox1VRqtD4tkrwOMMxp13TV6mziK1WmRU2NpaiRUSZVipVmnOukzjjWmiLpaky9AMs5UVuPgKxIs6YcjkpbMwst4rG1PYomqNpwvBFC_ZnwycnnqdDNvfrSSKdaKUTrfSibcEujoIMi7X8T7mvsxkgcdG5m5RRboq6lLRHEUgjeBYt2PRTY94rRYsTVTzfemEvO_Dm_PtQnh6fnXyCt33ETN5b5jMsVfe1-YKYp1Lbzbz-C_ws-ro
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+audiovisual+cognitive+optimization+strategy+guided+by+salient+object+ranking+for+intelligent+visual+prothesis+systems&rft.jtitle=Journal+of+neural+engineering&rft.au=Liang%2C+Junling&rft.au=Li%2C+Heng&rft.au=Chai%2C+Xinyu&rft.au=Gao%2C+Qi&rft.date=2024-12-01&rft.issn=1741-2560&rft.eissn=1741-2552&rft.volume=21&rft.issue=6&rft.spage=66021&rft_id=info:doi/10.1088%2F1741-2552%2Fad94a4&rft.externalDBID=n%2Fa&rft.externalDocID=10_1088_1741_2552_ad94a4
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1741-2560&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1741-2560&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1741-2560&client=summon