An audiovisual cognitive optimization strategy guided by salient object ranking for intelligent visual prothesis systems

Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial visi...

Full description

Saved in:

Bibliographic Details
Published in	Journal of neural engineering Vol. 21; no. 6; pp. 66021 - 66041
Main Authors	Liang, Junling, Li, Heng, Chai, Xinyu, Gao, Qi, Zhou, Meixuan, Guo, Tianruo, Chen, Yao, Di, Liqing
Format	Journal Article
Language	English
Published	England IOP Publishing 01.12.2024
Subjects	Adult Algorithms Artificial Intelligence Attention - physiology audiovisual cognition for prosthetic vision Auditory Perception - physiology Cognition - physiology Deep Learning Depth Perception - physiology Female Humans image semantic description intelligent visual prosthesis Male Photic Stimulation - methods prior knowledge Prosthesis Design - methods salient object ranking Visual Perception - physiology Visual Prosthesis image semantic description intelligent visual prosthesis audiovisual cognition for prosthetic vision salient object ranking prior knowledge
Online Access	Get full text

Cover

Loading…

Abstract	Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind. Approach. This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision. Main results. Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects’ performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition. Significance. This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.
AbstractList	Objective.Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind.Approach.This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision.Main results.Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects' performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition.Significance.This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.Objective.Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind.Approach.This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision.Main results.Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects' performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition.Significance.This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available. Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind. Approach. This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision. Main results. Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects’ performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition. Significance. This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of the concept of intelligent visual prosthetics with auditory support, leveraging deep learning to create practical artificial vision perception beyond merely restoring natural sight for the blind. This study introduces an object-based attention mechanism that simulates human gaze points when observing the external world to descriptions of physical regions. By transforming this mechanism into a ranking problem of salient entity regions, we introduce prior visual attention cues to build a new salient object ranking (SaOR) dataset, and propose a SaOR network aimed at providing depth perception for prosthetic vision. Furthermore, we propose a SaOR-guided image description method to align with human observation patterns, toward providing additional visual information by auditory feedback. Finally, the integration of the two aforementioned algorithms constitutes an audiovisual cognitive optimization strategy for prosthetic vision. Through conducting psychophysical experiments based on scene description tasks under simulated prosthetic vision, we verify that the SaOR method improves the subjects' performance in terms of object identification and understanding the correlation among objects. Additionally, the cognitive optimization strategy incorporating image description further enhances their prosthetic visual cognition. This offers valuable technical insights for designing next-generation intelligent visual prostheses and establishes a theoretical groundwork for developing their visual information processing strategies. Code will be made publicly available.
Author	Gao, Qi Zhou, Meixuan Li, Heng Chai, Xinyu Guo, Tianruo Liang, Junling Chen, Yao Di, Liqing
Author_xml	– sequence: 1 givenname: Junling surname: Liang fullname: Liang, Junling organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China – sequence: 2 givenname: Heng orcidid: 0000-0002-1303-8898 surname: Li fullname: Li, Heng organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China – sequence: 3 givenname: Xinyu orcidid: 0000-0003-2702-665X surname: Chai fullname: Chai, Xinyu organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China – sequence: 4 givenname: Qi surname: Gao fullname: Gao, Qi organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China – sequence: 5 givenname: Meixuan surname: Zhou fullname: Zhou, Meixuan organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China – sequence: 6 givenname: Tianruo orcidid: 0000-0001-6348-6771 surname: Guo fullname: Guo, Tianruo organization: Graduate School of Biomedical Engineering, UNSW , Sydney, NSW 2052, Australia – sequence: 7 givenname: Yao surname: Chen fullname: Chen, Yao organization: Shanghai Jiao Tong University School of Biomedical Engineering, Shanghai 200240, People’s Republic of China – sequence: 8 givenname: Liqing surname: Di fullname: Di, Liqing organization: Shanghai Pudong Hospital , Shanghai 200240, People’s Republic of China
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/39569905$$D View this record in MEDLINE/PubMed
BookMark	eNp1kT1PxDAMhiME4ntnQhkZOEiapm1GhPiSkFhgjpLGKTna5EhSxPHr6emO25hs2Y-t16-P0K4PHhA6o-SKkqa5pnVJZwXnxbUyolTlDjrclna3eUUO0FFKc0IYrQXZRwdM8EoIwg_R943HajQufLk0qh63ofMuuy_AYZHd4H5UdsHjlKPK0C1xNzoDBuslTqp34DMOeg5txlH5D-c7bEPEzmfoe9et2pu9ixjyOySXcFqmDEM6QXtW9QlON_EYvd3fvd4-zp5fHp5ub55nbdGIPBO6rITVFa_bqtSMQKEFa1htjW5tZWuwrKA1K43WQhvOuWnqlpvK2IlmGtgxuljvnRR8jpCyHFxqJ3nKQxiTZJRRTigT1YSeb9BRD2DkIrpBxaX8c2sCyBpoY0gpgt0ilMjVQ-TKcblyX64fMo1crkdcWMh5GKOfjv0f_wUeyI8-
CODEN	JNEOBH
Cites_doi	10.1016/j.ins.2014.02.136 10.1109/ICCE59016.2024.10444304 10.3390/s24010166 10.1016/j.neuropsychologia.2022.108305 10.1016/j.ins.2022.07.094 10.1007/s10462-023-10419-1 10.1088/1741-2552/aca69d 10.48550/arXiv.2111.12233 10.1097/WCO.0000000000000412 10.1167/tvst.10.10.14 10.3389/fnins.2023.1270850 10.1109/CVPR.2012.6248100 10.1111/j.1442-9071.2010.02363.x 10.1088/1741-2552/ab9e1d 10.1587/transinf.2018EDP7405 10.1109/CVPR.2014.43 10.1007/s10209-022-00868-w 10.1016/j.oret.2017.08.008 10.18653/v1/2021.acl-long.157 10.1109/CVPR42600.2020.01215 10.47102/annals-acadmedsg.V44N4p116 10.1101/148015 10.1109/CVPR.2018.00474 10.1109/CVPR.2011.5995466 10.3233/RNN-130338 10.48550/arXiv.1703.06870 10.3966/160792642020012101017 10.1093/bmb/ldu002 10.11772/j.issn.1001-9081.2022071109 10.1007/978-3-662-44848-9_9 10.1088/1741-2552/aa966d 10.1186/s42234-018-0013-8 10.1016/j.nicl.2019.102041 10.48550/arXiv.1409.4842 10.1080/1206212x.2024.2328498 10.1038/35058500 10.1016/S0140-6736(06)69740-7 10.1016/j.actpsy.2010.02.006 10.48550/arXiv.2212.01803 10.18653/v1/2020.coling-main.174 10.3390/ijms23062922 10.48550/arXiv.2108.10904 10.1109/TPAMI.2021.3107872 10.1016/S0896-6273(03)00097 10.1016/j.sigpro.2019.06.014 10.1007/978-3-319-10602-1_48 10.1016/j.ins.2017.06.014 10.1016/j.preteyeres.2015.09.003 10.1016/j.ins.2010.04.021 10.1016/j.artmed.2017.11.001 10.1109/ICCV.2015.298 10.1007/978-3-030-29753-4_4 10.1136/bjo.85.3.327 10.1109/CVPR.2019.00859 10.48550/arXiv.1612.03144 10.1109/TASE.2023.3340335 10.1080/17483107.2021.1922522 10.1002/adtp.202300162 10.1109/CVPR.2015.7298642 10.48550/arXiv.2106.05047 10.48550/arXiv.1707.07998 10.1088/1741-2560/13/3/036013 10.48550/arXiv.1607.07155 10.1109/34.730558 10.1016/j.preteyeres.2015.01.00 10.1109/CVPR.2015.7298935 10.1016/j.visres.2021.01.008 10.1109/CVPR.2017.690 10.24963/ijcai.2020/495 10.26599/BSA.2023.9050008 10.1109/ICCV.2009.5459462 10.3389/fpsyg.2011.00264 10.1007/978-981-13-9795-0_9 10.1007/978-3-642-15561-1_2 10.1088/1741-2552/acb295 10.48550/arXiv.1811.10652 10.1109/10.121642 10.48550/arXiv.1803.05082 10.1109/TPAMI.2023.3257546 10.1186/s40942-023-00498-1 10.1111/aor.12498
ContentType	Journal Article
Copyright	2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
Copyright_xml	– notice: 2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8
DOI	10.1088/1741-2552/ad94a4
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic
DatabaseTitleList	MEDLINE - Academic CrossRef MEDLINE
Database_xml	– sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Anatomy & Physiology
EISSN	1741-2552
ExternalDocumentID	39569905 10_1088_1741_2552_ad94a4 jnead94a4
Genre	Journal Article
GrantInformation_xml	– fundername: Shanghai Jiao Tong University sequence: 0 grantid: YG2022QN077 funderid: http://dx.doi.org/10.13039/501100004921 – fundername: National Natural Science Foundation of China sequence: 0 grantid: 62073221; 62103269; 62176151 funderid: http://dx.doi.org/10.13039/501100001809
GroupedDBID	--- 1JI 4.4 53G 5B3 5GY 5VS 5ZH 7.M 7.Q AAGCD AAJIO AAJKP AATNI ABHWH ABJNI ABQJV ABVAM ACAFW ACGFS ACHIP AEFHF AENEX AFYNE AKPSB ALMA_UNASSIGNED_HOLDINGS AOAED ASPBG ATQHT AVWKF AZFZN CEBXE CJUJL CRLBU CS3 DU5 EBS EDWGO EMSAF EPQRW EQZZN F5P HAK IHE IJHAN IOP IZVLO KOT LAP N5L N9A P2P PJBAE RIN RO9 ROL RPA SY9 W28 XPP AAYXX ADEQX CITATION CGR CUY CVF ECM EIF NPM 7X8
ID	FETCH-LOGICAL-c289t-9b469fb657c64b30e2b93837fdbcf6f7ef321734dbb9bd555d87c5d6df4b33be3
IEDL.DBID	IOP
ISSN	1741-2560 1741-2552
IngestDate	Fri Jul 11 04:53:22 EDT 2025 Thu Jan 02 22:23:43 EST 2025 Tue Jul 01 01:48:13 EDT 2025 Thu Dec 05 13:12:05 EST 2024
IsPeerReviewed	true
IsScholarly	true
Issue	6
Keywords	image semantic description intelligent visual prosthesis audiovisual cognition for prosthetic vision salient object ranking prior knowledge
Language	English
License	This article is available under the terms of the IOP-Standard License. 2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c289t-9b469fb657c64b30e2b93837fdbcf6f7ef321734dbb9bd555d87c5d6df4b33be3
Notes	JNE-107641.R1 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ORCID	0000-0003-2702-665X 0000-0002-1303-8898 0000-0001-6348-6771
PMID	39569905
PQID	3131501396
PQPubID	23479
PageCount	21
ParticipantIDs	iop_journals_10_1088_1741_2552_ad94a4 crossref_primary_10_1088_1741_2552_ad94a4 proquest_miscellaneous_3131501396 pubmed_primary_39569905
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2024-12-01
PublicationDateYYYYMMDD	2024-12-01
PublicationDate_xml	– month: 12 year: 2024 text: 2024-12-01 day: 01
PublicationDecade	2020
PublicationPlace	England
PublicationPlace_xml	– name: England
PublicationTitle	Journal of neural engineering
PublicationTitleAbbrev	JNE
PublicationTitleAlternate	J. Neural Eng
PublicationYear	2024
Publisher	IOP Publishing
Publisher_xml	– sequence: 0 name: IOP Publishing
References	Chen (jnead94a4bib7) 2024; 7 Wang (jnead94a4bib38) 2023 Li (jnead94a4bib28) 2019; 164 Islam (jnead94a4bib71) 2018 Wang (jnead94a4bib31) 2022 Siris (jnead94a4bib64) 2020 Zhu (jnead94a4bib37) 2024; 44 Finn (jnead94a4bib81) 2018; 2 Bellapianta (jnead94a4bib2) 2022; 23 Li (jnead94a4bib69) 2017; 415 Wang (jnead94a4bib22) 2014; 277 Abboud (jnead94a4bib42) 2014; 32 Berg (jnead94a4bib60) 2012 Wang (jnead94a4bib17) 2016; 40 Judd (jnead94a4bib62) 2009 Luo (jnead94a4bib10) 2014; 109 Theeuwes (jnead94a4bib76) 2010; 135 Zhang (jnead94a4bib33) 2024; 46 Scalvini (jnead94a4bib45) 2023; 24 Fernando (jnead94a4bib47) 2023; 18 Song (jnead94a4bib46) 2023 Li (jnead94a4bib21) 2018; 84 Zhao (jnead94a4bib50) 2010; 180 Reynolds (jnead94a4bib59) 2003; 37 Willoughby (jnead94a4bib1) 2010; 38 Yang (jnead94a4bib24) 2024 Maimon (jnead94a4bib14) 2022; 173 Xia (jnead94a4bib19) 2022; 609 Hanson (jnead94a4bib11) 2020 Liu (jnead94a4bib73) 2020 Szegedy (jnead94a4bib58) 2014 Zheng (jnead94a4bib35) 2019 Gilhooley (jnead94a4bib20) 2017; 30 Beyeler (jnead94a4bib12) 2022; 19 He (jnead94a4bib55) 2017 Lin (jnead94a4bib66) 2014 Sun (jnead94a4bib27) 2015 Li (jnead94a4bib82) 2023; 45 Valipoor (jnead94a4bib48) 2023; 22 Itti (jnead94a4bib68) 2001; 2 Barnes (jnead94a4bib79) 2016; 13 Redmon (jnead94a4bib70) 2017 Fernandez (jnead94a4bib8) 2018; 4 Farhadi (jnead94a4bib25) 2010 Ramirez (jnead94a4bib51) 2023; 9 Hartong (jnead94a4bib3) 2006; 368 Wagle (jnead94a4bib5) 2015; 44 Li (jnead94a4bib16) 2018; 15 Brown (jnead94a4bib4) 2001; 85 Yan (jnead94a4bib36) 2022 Anandan (jnead94a4bib43) 2020; 21 Li (jnead94a4bib65) 2014 Fang (jnead94a4bib75) 2021 Wang (jnead94a4bib9) 2023; 20 Cornia (jnead94a4bib39) 2018 Cai (jnead94a4bib57) 2016 Vinyals (jnead94a4bib29) 2015 Jayakody (jnead94a4bib6) 2015; 46 Xu (jnead94a4bib52) 2021; 10 Stiles (jnead94a4bib13) 2021; 182 Hariharan (jnead94a4bib61) 2015 Kulkarni (jnead94a4bib26) 2013; 35 Asudani (jnead94a4bib63) 2023; 56 Liu (jnead94a4bib74) 2021; 44 Chen (jnead94a4bib53) 2023 Kvansakul (jnead94a4bib15) 2020; 17 Anderson (jnead94a4bib40) 2018 Luo (jnead94a4bib49) 2016; 50 Beyeler (jnead94a4bib23) 2017 Hu (jnead94a4bib32) 2022 Meijer (jnead94a4bib41) 1992; 39 Guo (jnead94a4bib18) 2019; 102 Guan (jnead94a4bib77) 2023; 9 Zhang (jnead94a4bib80) 2019; 24 Bai (jnead94a4bib30) 2023; 17 Sandler (jnead94a4bib78) 2018 Jacko (jnead94a4bib44) 2020 Shams (jnead94a4bib54) 2011; 2 Lin (jnead94a4bib56) 2016 Itti (jnead94a4bib67) 1998; 20 Bian (jnead94a4bib72) 2014 Lindh (jnead94a4bib34) 2020
References_xml	– volume: 277 start-page: 512 year: 2014 ident: jnead94a4bib22 article-title: Moving object recognition under simulated prosthetic vision using background-subtraction-based image processing strategies publication-title: Inf. Sci. doi: 10.1016/j.ins.2014.02.136 – start-page: 1 year: 2024 ident: jnead94a4bib24 article-title: Scene simplification for simulated prosthetic vision with improved scene understanding doi: 10.1109/ICCE59016.2024.10444304 – volume: 24 start-page: 166 year: 2023 ident: jnead94a4bib45 article-title: Outdoor navigation assistive system based on robust and real-time visual–auditory substitution approach publication-title: Sensors doi: 10.3390/s24010166 – volume: 173 year: 2022 ident: jnead94a4bib14 article-title: A case study in phenomenology of visual experience with retinal prosthesis versus visual-to-auditory sensory substitution publication-title: Neuropsychologia doi: 10.1016/j.neuropsychologia.2022.108305 – volume: 609 start-page: 507 year: 2022 ident: jnead94a4bib19 article-title: Semantic translation of face image with limited pixels for simulated prosthetic vision publication-title: Inf. Sci. doi: 10.1016/j.ins.2022.07.094 – volume: 56 start-page: 10345 year: 2023 ident: jnead94a4bib63 article-title: Impact of word embedding models on text analytics in deep learning environment: a review publication-title: Artif. Intell. Rev. doi: 10.1007/s10462-023-10419-1 – volume: 19 year: 2022 ident: jnead94a4bib12 article-title: Towards a Smart Bionic Eye: AI-powered artificial vision for the treatment of incurable blindness publication-title: J. Neural Eng. doi: 10.1088/1741-2552/aca69d – start-page: 17980 year: 2022 ident: jnead94a4bib32 article-title: Scaling up vision-language pre-training for image captioning doi: 10.48550/arXiv.2111.12233 – volume: 30 start-page: 55 year: 2017 ident: jnead94a4bib20 article-title: Artificial vision: principles and prospects publication-title: Curr. Opin. Neurol. doi: 10.1097/WCO.0000000000000412 – volume: 10 start-page: 14 year: 2021 ident: jnead94a4bib52 article-title: First human results with the 256 channel Intelligent Micro Implant Eye (IMIE 256) publication-title: Trans. Vis. Sci. Technol. doi: 10.1167/tvst.10.10.14 – volume: 17 year: 2023 ident: jnead94a4bib30 article-title: An image caption model based on attention mechanism and deep reinforcement learning publication-title: Front. Neurosci. doi: 10.3389/fnins.2023.1270850 – start-page: 3562 year: 2012 ident: jnead94a4bib60 article-title: Understanding and predicting importance in images doi: 10.1109/CVPR.2012.6248100 – volume: 38 start-page: 2 year: 2010 ident: jnead94a4bib1 article-title: Anatomy and physiology of the human eye: effects of mucopolysaccharidoses disease on structure and function–a review publication-title: Clin. Exp. Ophthalmol. doi: 10.1111/j.1442-9071.2010.02363.x – volume: 17 year: 2020 ident: jnead94a4bib15 article-title: Sensory augmentation to aid training with retinal prostheses publication-title: J. Neural Eng. doi: 10.1088/1741-2552/ab9e1d – volume: 102 start-page: 1321 year: 2019 ident: jnead94a4bib18 article-title: Recognition of moving object in high dynamic scene for visual prosthesis publication-title: IEICE Trans. Inf. Syst. doi: 10.1587/transinf.2018EDP7405 – start-page: 280 year: 2014 ident: jnead94a4bib65 article-title: The secrets of salient object segmentation doi: 10.1109/CVPR.2014.43 – volume: 22 start-page: 983 year: 2023 ident: jnead94a4bib48 article-title: Recent trends in computer vision-driven scene understanding for VI/blind users: a systematic mapping publication-title: Univ. Access Inf. Soc. doi: 10.1007/s10209-022-00868-w – volume: 2 start-page: 382 year: 2018 ident: jnead94a4bib81 article-title: Synergistic visual gains attained using Argus II retinal prosthesis with OrCam MyEye publication-title: Ophthalmol. Retina doi: 10.1016/j.oret.2017.08.008 – start-page: 2014 year: 2022 ident: jnead94a4bib36 article-title: Control image captioning spatially and temporally doi: 10.18653/v1/2021.acl-long.157 – start-page: 12133 year: 2020 ident: jnead94a4bib64 article-title: Inferring attention shift ranks of objects for image saliency doi: 10.1109/CVPR42600.2020.01215 – volume: 44 start-page: 116 year: 2015 ident: jnead94a4bib5 article-title: A decade of progress in the understanding, prevention and treatment of age-related macular degeneration in Singapore publication-title: Ann. Acad. Med. Singapore doi: 10.47102/annals-acadmedsg.V44N4p116 – year: 2017 ident: jnead94a4bib23 article-title: pulse2percept: a Python-based simulation framework for bionic vision doi: 10.1101/148015 – start-page: 4510 year: 2018 ident: jnead94a4bib78 article-title: Mobilenetv2: inverted residuals and linear bottlenecks doi: 10.1109/CVPR.2018.00474 – volume: 35 start-page: 2891 year: 2013 ident: jnead94a4bib26 article-title: Babytalk: understanding and generating simple image descriptions publication-title: IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/CVPR.2011.5995466 – volume: 32 start-page: 247 year: 2014 ident: jnead94a4bib42 article-title: EyeMusic: introducing a “visual” colorful experience for the blind using auditory sensory substitution publication-title: Restor. Neurol. Neurosci. doi: 10.3233/RNN-130338 – start-page: 2961 year: 2017 ident: jnead94a4bib55 article-title: Mask r-cnn doi: 10.48550/arXiv.1703.06870 – volume: 21 start-page: 183 year: 2020 ident: jnead94a4bib43 article-title: Advanced indoor and outdoor navigation system for blind people using raspberry-pi publication-title: J. Internet Technol. doi: 10.3966/160792642020012101017 – volume: 109 start-page: 31 year: 2014 ident: jnead94a4bib10 article-title: A review and update on the current status of retinal prostheses (bionic eye) publication-title: Br. Med. Bul. doi: 10.1093/bmb/ldu002 – volume: 44 start-page: 58 year: 2024 ident: jnead94a4bib37 article-title: Scene graph-aware cross-modal image captioning model publication-title: J. Comput. Appl. doi: 10.11772/j.issn.1001-9081.2022071109 – start-page: 132 year: 2014 ident: jnead94a4bib72 article-title: Knowledge-powered deep learning for word embedding doi: 10.1007/978-3-662-44848-9_9 – volume: 15 year: 2018 ident: jnead94a4bib16 article-title: An optimized content-aware image retargeting method: toward expanding the perceived visual field of the high-density retinal prosthesis recipients publication-title: J. Neural Eng. doi: 10.1088/1741-2552/aa966d – volume: 4 start-page: 12 year: 2018 ident: jnead94a4bib8 article-title: Development of visual Neuroprostheses: trends and challenges publication-title: Bioelectron. Med. doi: 10.1186/s42234-018-0013-8 – volume: 24 year: 2019 ident: jnead94a4bib80 article-title: Properties of cross-modal occipital responses in early blindness: an ALE meta-analysis publication-title: NeuroImage Clin. doi: 10.1016/j.nicl.2019.102041 – start-page: 1 year: 2014 ident: jnead94a4bib58 article-title: Going deeper with convolutions doi: 10.48550/arXiv.1409.4842 – volume: 46 start-page: 1 year: 2024 ident: jnead94a4bib33 article-title: Mobilenet V3-transformer, a lightweight model for image caption publication-title: Int. J. Comput. Appl. doi: 10.1080/1206212x.2024.2328498 – volume: 2 start-page: 194 year: 2001 ident: jnead94a4bib68 article-title: Computational modelling of visual attention publication-title: Nat. Rev. Neurosci. doi: 10.1038/35058500 – volume: 368 start-page: 1795 year: 2006 ident: jnead94a4bib3 article-title: Retinitis pigmentosa publication-title: Lancet doi: 10.1016/S0140-6736(06)69740-7 – volume: 135 start-page: 77 year: 2010 ident: jnead94a4bib76 article-title: Top–down and bottom–up control of visual selection publication-title: Acta psychol. doi: 10.1016/j.actpsy.2010.02.006 – start-page: 2617 year: 2023 ident: jnead94a4bib38 article-title: Controllable image captioning via prompting doi: 10.48550/arXiv.2212.01803 – year: 2020 ident: jnead94a4bib34 article-title: Language-driven region pointer advancement for controllable image captioning doi: 10.18653/v1/2020.coling-main.174 – volume: 23 start-page: 2922 year: 2022 ident: jnead94a4bib2 article-title: Retinal organoids and retinal prostheses: an overview publication-title: Int. J. Mol. Sci. doi: 10.3390/ijms23062922 – year: 2022 ident: jnead94a4bib31 article-title: Simvlm: simple visual language model pretraining with weak supervision doi: 10.48550/arXiv.2108.10904 – volume: 44 start-page: 8321 year: 2021 ident: jnead94a4bib74 article-title: Instance-level relative saliency ranking with graph reasoning publication-title: IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/TPAMI.2021.3107872 – volume: 37 start-page: 853 year: 2003 ident: jnead94a4bib59 article-title: Interacting roles of attention and visual salience in V4 publication-title: Neuron doi: 10.1016/S0896-6273(03)00097 – volume: 164 start-page: 193 year: 2019 ident: jnead94a4bib28 article-title: Generating steganographic image description by dynamic synonym substitution publication-title: Signal Process. doi: 10.1016/j.sigpro.2019.06.014 – start-page: 740 year: 2014 ident: jnead94a4bib66 article-title: Microsoft coco: common objects in context doi: 10.1007/978-3-319-10602-1_48 – volume: 415 start-page: 1 year: 2017 ident: jnead94a4bib69 article-title: A real-time image optimization strategy based on global saliency detection for artificial retinal prostheses publication-title: Inf. Sci. doi: 10.1016/j.ins.2017.06.014 – volume: 50 start-page: 89 year: 2016 ident: jnead94a4bib49 article-title: The Argus® II retinal prosthesis system publication-title: Prog. Retinal Eye Res. doi: 10.1016/j.preteyeres.2015.09.003 – volume: 180 start-page: 2915 year: 2010 ident: jnead94a4bib50 article-title: Image processing based recognition of images with a limited number of pixels using simulated prosthetic vision publication-title: Inf. Sci. doi: 10.1016/j.ins.2010.04.021 – volume: 84 start-page: 64 year: 2018 ident: jnead94a4bib21 article-title: Image processing strategies based on saliency segmentation for object recognition under simulated prosthetic vision publication-title: Artif. Intell. Med. doi: 10.1016/j.artmed.2017.11.001 – start-page: 2596 year: 2015 ident: jnead94a4bib27 article-title: Automatic concept discovery from parallel text and visual corpora doi: 10.1109/ICCV.2015.298 – start-page: 37 year: 2020 ident: jnead94a4bib44 article-title: Psychological and cognitive adjustment to vision loss doi: 10.1007/978-3-030-29753-4_4 – volume: 85 start-page: 327 year: 2001 ident: jnead94a4bib4 article-title: Utility values associated with blindness in an adult population publication-title: Br. J. Ophthalmol. doi: 10.1136/bjo.85.3.327 – start-page: 8395 year: 2019 ident: jnead94a4bib35 article-title: Intention oriented image captions with guiding objects doi: 10.1109/CVPR.2019.00859 – start-page: 2117 year: 2016 ident: jnead94a4bib56 article-title: Feature pyramid networks for object detection doi: 10.48550/arXiv.1612.03144 – year: 2023 ident: jnead94a4bib53 – start-page: 1 year: 2023 ident: jnead94a4bib46 article-title: Multi-sensory visual-auditory fusion of wearable navigation assistance for people with impaired vision publication-title: IEEE Trans. Autom. Sci. Eng. doi: 10.1109/TASE.2023.3340335 – volume: 18 start-page: 763 year: 2023 ident: jnead94a4bib47 article-title: Route planning methods in indoor navigation tools for vision impaired persons: a systematic review publication-title: Disabil. Rehabil. doi: 10.1080/17483107.2021.1922522 – volume: 7 year: 2024 ident: jnead94a4bib7 article-title: Multidisciplinary approaches in the treatment of retinal degenerative diseases: a review publication-title: Adv. Ther. doi: 10.1002/adtp.202300162 – start-page: 447 year: 2015 ident: jnead94a4bib61 article-title: Hypercolumns for object segmentation and fine-grained localization doi: 10.1109/CVPR.2015.7298642 – start-page: 16331 year: 2021 ident: jnead94a4bib75 article-title: Salient object ranking with position-preserved attention doi: 10.48550/arXiv.2106.05047 – start-page: 6077 year: 2018 ident: jnead94a4bib40 article-title: Bottom-up and top-down attention for image captioning and visual question answering doi: 10.48550/arXiv.1707.07998 – volume: 13 year: 2016 ident: jnead94a4bib79 article-title: Vision function testing for a suprachoroidal retinal prosthesis: effects of image filtering publication-title: J. Neural Eng. doi: 10.1088/1741-2560/13/3/036013 – start-page: 354 year: 2016 ident: jnead94a4bib57 article-title: A unified multi-scale deep convolutional neural network for fast object detection doi: 10.48550/arXiv.1607.07155 – volume: 20 start-page: 1254 year: 1998 ident: jnead94a4bib67 article-title: A model of saliency-based visual attention for rapid scene analysis publication-title: IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/34.730558 – volume: 46 start-page: 31 year: 2015 ident: jnead94a4bib6 article-title: Cellular strategies for retinal repair by photoreceptor replacement publication-title: Prog. Retinal Eye Res. doi: 10.1016/j.preteyeres.2015.01.00 – start-page: 3156 year: 2015 ident: jnead94a4bib29 article-title: Show and tell: a neural image caption generator doi: 10.1109/CVPR.2015.7298935 – volume: 182 start-page: 58 year: 2021 ident: jnead94a4bib13 article-title: Multisensory perception in Argus II retinal prosthesis patients: leveraging auditory-visual mappings to enhance prosthesis outcomes publication-title: Vis. Res. doi: 10.1016/j.visres.2021.01.008 – start-page: 7263 year: 2017 ident: jnead94a4bib70 article-title: YOLO9000: better, faster, stronger doi: 10.1109/CVPR.2017.690 – start-page: 3608 year: 2020 ident: jnead94a4bib73 article-title: Knowledge enhanced event causality identification with mention masking generalizations doi: 10.24963/ijcai.2020/495 – volume: 9 start-page: 95 year: 2023 ident: jnead94a4bib77 article-title: Neural mechanisms of top–down divided and selective spatial attention in visual and auditory perception publication-title: Brain Sci. Adv. doi: 10.26599/BSA.2023.9050008 – start-page: 2106 year: 2009 ident: jnead94a4bib62 article-title: Learning to predict where humans look doi: 10.1109/ICCV.2009.5459462 – volume: 2 start-page: 264 year: 2011 ident: jnead94a4bib54 article-title: Influences of multisensory experience on subsequent unisensory processing publication-title: Front. Psychol. doi: 10.3389/fpsyg.2011.00264 – start-page: 133 year: 2020 ident: jnead94a4bib11 article-title: Newer techniques in vision restoration and rehabilitation doi: 10.1007/978-981-13-9795-0_9 – start-page: 15 year: 2010 ident: jnead94a4bib25 article-title: Every picture tells a story: generating sentences from images doi: 10.1007/978-3-642-15561-1_2 – volume: 20 year: 2023 ident: jnead94a4bib9 article-title: Artificial intelligence techniques for retinal prostheses: a comprehensive review and future direction publication-title: J. Neural Eng. doi: 10.1088/1741-2552/acb295 – start-page: 8307 year: 2018 ident: jnead94a4bib39 article-title: Show, control and tell: a framework for generating controllable and grounded captions doi: 10.48550/arXiv.1811.10652 – volume: 39 start-page: 112 year: 1992 ident: jnead94a4bib41 article-title: An experimental system for auditory image representations publication-title: IEEE Trans. Biomed. Eng. doi: 10.1109/10.121642 – start-page: 7142 year: 2018 ident: jnead94a4bib71 article-title: Revisiting salient object detection: simultaneous detection, ranking, and subitizing of multiple salient objects doi: 10.48550/arXiv.1803.05082 – volume: 45 start-page: 10555 year: 2023 ident: jnead94a4bib82 article-title: When object detection meets knowledge distillation: a survey publication-title: IEEE Trans. Pattern Anal. Mach. Intell. doi: 10.1109/TPAMI.2023.3257546 – volume: 9 start-page: 73 year: 2023 ident: jnead94a4bib51 article-title: An update on visual prosthesis publication-title: Int. J. Retina Vitreous doi: 10.1186/s40942-023-00498-1 – volume: 40 start-page: 94 year: 2016 ident: jnead94a4bib17 article-title: Image processing strategies based on a visual saliency model for object recognition under simulated prosthetic vision publication-title: Artif. Organs doi: 10.1111/aor.12498
SSID	ssj0031790
Score	2.4142387
Snippet	Objective. Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the... Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the emergence of... Objective.Visual prostheses are effective tools for restoring vision, yet real-world complexities pose ongoing challenges. The progress in AI has led to the...
SourceID	proquest pubmed crossref iop
SourceType	Aggregation Database Index Database Publisher
StartPage	66021
SubjectTerms	Adult Algorithms Artificial Intelligence Attention - physiology audiovisual cognition for prosthetic vision Auditory Perception - physiology Cognition - physiology Deep Learning Depth Perception - physiology Female Humans image semantic description intelligent visual prosthesis Male Photic Stimulation - methods prior knowledge Prosthesis Design - methods salient object ranking Visual Perception - physiology Visual Prosthesis
Title	An audiovisual cognitive optimization strategy guided by salient object ranking for intelligent visual prothesis systems
URI	https://iopscience.iop.org/article/10.1088/1741-2552/ad94a4 https://www.ncbi.nlm.nih.gov/pubmed/39569905 https://www.proquest.com/docview/3131501396
Volume	21
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8QwEB50vXjx_VhfRFDBQ_eVR1s8LaKo4OOgsAehbJpEF9l2sS24_nonTXdBURFvhU6TdibNfJNMvgE4YKIVmyDmHjVSe0z5YUl566mOQYAQGMGkXdC_vhEXD-yqx3szcDI9C5OOqqm_gZeOKNipsEqIC5qIodseIuFOs69C1mezMEcDIWz5gsvbu8k0TC31lDsNaaVFq9qj_K6FTz5pFvv9GW6Wbud8ER4nL-yyTV4aRS4b8fsXLsd_ftESLFRwlHSd6DLM6GQFVrsJhuLDMTkiZYJoufK-Cm_dhPQLm706yAp8aJp5RFKcd4bVgU6SOb7bMXkqBkorIsckQ7CPzo2k0q76EFsnHj0mQbxMBlNK0JxU7VrqiGedDTLiaKazNXg4P7s_vfCqwg1ejPFb7oUSg24jBfdjtDVt6Y4MbSRslIyNML42FCMhypSUoVSccxX4MVdCGZSmUtN1qCVpojeBGOHHTNpaFMZnhhtJfWraFMPQMGB4ow7HE9NFI8fPEZX76kEQWbVGVq2RU2sdDtECUfWTZr_I7U-sH-HPZndQ-olOiyyibYoAGkGzqMOGGxbTXilGmuja-dYfe9mG-Q4CJJcaswO1_LXQuwhwcrlXDuQPdMD1Vg
linkProvider	IOP Publishing
linkToPdf	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1ZT9wwEB5xSFVfOEpLl7bgSgWJh-zlI8njirKCQikPIPFm1rFNV4hkRRKJ7a_vOPauVAQVEm-R4iMZH_ONPfMNwDcmuplNMh5Rq0zEdJw2lLeR7lsECIkVTLkD_Z9n4uiS_bjiVyHPaRMLU0zC1t_GR08U7EUYHOKSDmLoXoRIuN8Z6ZSNWGei7SIscyqoI88__nU-24qpo5_yEZGuhuiGe8qnWvlHLy1i389Dzkb1DFfhevbR3uPktl1Xqp39ecTn-Iq_WoOVAEvJwBdfhwWTv4ONQY4m-d2U7JHGUbQ5gd-Ah0FORrXzYh2XNVaaeyCRAvefuxDYSUrPezslN_VYG03UlJQI-lHJkUK50x_i8sWj5iSIm8l4Tg1akdCuo5D4bcpxSTzddPkeLoeHFwdHUUjgEGVox1VRqtD4tkrwOMMxp13TV6mziK1WmRU2NpaiRUSZVipVmnOukzjjWmiLpaky9AMs5UVuPgKxIs6YcjkpbMwst4rG1PYomqNpwvBFC_ZnwycnnqdDNvfrSSKdaKUTrfSibcEujoIMi7X8T7mvsxkgcdG5m5RRboq6lLRHEUgjeBYt2PRTY94rRYsTVTzfemEvO_Dm_PtQnh6fnXyCt33ETN5b5jMsVfe1-YKYp1Lbzbz-C_ws-ro
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+audiovisual+cognitive+optimization+strategy+guided+by+salient+object+ranking+for+intelligent+visual+prothesis+systems&rft.jtitle=Journal+of+neural+engineering&rft.au=Liang%2C+Junling&rft.au=Li%2C+Heng&rft.au=Chai%2C+Xinyu&rft.au=Gao%2C+Qi&rft.date=2024-12-01&rft.issn=1741-2560&rft.eissn=1741-2552&rft.volume=21&rft.issue=6&rft.spage=66021&rft_id=info:doi/10.1088%2F1741-2552%2Fad94a4&rft.externalDBID=n%2Fa&rft.externalDocID=10_1088_1741_2552_ad94a4
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1741-2560&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1741-2560&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1741-2560&client=summon