Dual Modality Reverse Reranking (DM-RR) Based Image Retrieval Framework
Retrieval of a product with desired modifications from a vast inventory of online industrial platforms is frequently encountered in our daily life. This study presents a specialized framework to retrieve user's queried product with its desired changes incorporated. To facilitate interaction bet...
Saved in:
Published in | IEEE open journal of the Industrial Electronics Society Vol. 5; pp. 886 - 897 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 2644-1284 2644-1284 |
DOI | 10.1109/OJIES.2024.3435956 |
Cover
Loading…
Abstract | Retrieval of a product with desired modifications from a vast inventory of online industrial platforms is frequently encountered in our daily life. This study presents a specialized framework to retrieve user's queried product with its desired changes incorporated. To facilitate interaction between the end-user and agent in such scenarios, a multimodal content-based image retrieval system is essential. The system extracts textual and visual attributes, combining them through inductive learning to a unified representation. It is based on an in-depth understanding of visual characteristics that are modified by textual semantics. Lastly, a novel reverse reranking (RR) algorithm arranges the joint representation of dual modality queries and their corresponding target images for efficient retrieval. The proposed framework is novel compared to earlier methodologies. First, it achieves successful fusion of two different modalities. Second, it introduces a RR algorithm in the inference stage for efficient retrieval. The proposed framework's enhanced performance has been assessed using the Fashion-200 K and MIT-States real-world benchmark datasets. The proposed system can be used in real-world applications subject to its practical implications, such as generalization to diverse domains, availability of domain specific data, nature of the data and queries, and availability of computational resources. |
---|---|
AbstractList | Retrieval of a product with desired modifications from a vast inventory of online industrial platforms is frequently encountered in our daily life. This study presents a specialized framework to retrieve user's queried product with its desired changes incorporated. To facilitate interaction between the end-user and agent in such scenarios, a multimodal content-based image retrieval system is essential. The system extracts textual and visual attributes, combining them through inductive learning to a unified representation. It is based on an in-depth understanding of visual characteristics that are modified by textual semantics. Lastly, a novel reverse reranking (RR) algorithm arranges the joint representation of dual modality queries and their corresponding target images for efficient retrieval. The proposed framework is novel compared to earlier methodologies. First, it achieves successful fusion of two different modalities. Second, it introduces a RR algorithm in the inference stage for efficient retrieval. The proposed framework's enhanced performance has been assessed using the Fashion-200 K and MIT-States real-world benchmark datasets. The proposed system can be used in real-world applications subject to its practical implications, such as generalization to diverse domains, availability of domain specific data, nature of the data and queries, and availability of computational resources. |
Author | Latif, Rabia Ahmed, Ikhlaq Khan, Zafran Jamail, Nor Shahida Mohd Iltaf, Naima |
Author_xml | – sequence: 1 givenname: Ikhlaq surname: Ahmed fullname: Ahmed, Ikhlaq email: iahmed.phdsemcs@student.nust.edu.pk organization: Department of Computer Software Engineering, National University of Sciences and Technology Islamabad, Rawalpindi, Pakistan – sequence: 2 givenname: Naima orcidid: 0000-0001-5392-5187 surname: Iltaf fullname: Iltaf, Naima email: naima@mcs.edu.pk organization: Department of Computer Software Engineering, National University of Sciences and Technology Islamabad, Rawalpindi, Pakistan – sequence: 3 givenname: Rabia orcidid: 0000-0001-5304-5948 surname: Latif fullname: Latif, Rabia email: rlatif@psu.edu.sa organization: College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia – sequence: 4 givenname: Nor Shahida Mohd surname: Jamail fullname: Jamail, Nor Shahida Mohd email: njamail@psu.edu.sa organization: College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia – sequence: 5 givenname: Zafran orcidid: 0000-0001-5543-7750 surname: Khan fullname: Khan, Zafran email: zafrankhan1830@gm.gist.ac.kr organization: School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, South Korea |
BookMark | eNpNUE1PwkAU3BhMROQPGA9NvOihuN9tjwqoGAgJct-8bl9J-ejitmD89xYwhtO8vDczbzLXpFW6Egm5ZbTHGE2eph-j4WePUy57QgqVKH1B2lxLGTIey9bZfEW6VbWklHLFGBO6Td4GO1gHE5fBuqh_ghnu0VfYoIdyVZSL4GEwCWezx-AFKsyC0QYWh2vtC9w3wlcPG_x2fnVDLnNYV9j9ww6Zvw7n_fdwPH0b9Z_HoRUqqsMcIU1RxxTSSCqlAFSqI241gFW51YpjKqi0FIHKOGaUJ4pFCtI4YSkmokNGJ9vMwdJsfbEB_2McFOa4cH5hwNeFXaOxUZJxGmmpeSTjLAGW5kzaXKIQFjhtvO5PXlvvvnZY1Wbpdr5s0hvBKNW8yccbFj-xrHdV5TH__8qoOdRvjvWbQ_3mr_5GdHcSFYh4JtBMRkksfgFkmIBa |
CODEN | IOJIAJ |
Cites_doi | 10.1109/WACV48630.2021.00118 10.1109/CVPR.2011.5995373 10.1109/ICCV.2015.154 10.1109/CCNC.2013.6488510 10.1109/ACCESS.2023.3313977 10.1609/aaai.v32i1.11671 10.1016/j.asoc.2021.107552 10.1109/CVPR.2016.90 10.1145/1873951.1873977 10.1109/CVPR.2015.7298682 10.1109/TMM.2016.2605058 10.3390/s22062188 10.1007/978-3-319-14445-0_10 10.1109/ICME55011.2023.00027 10.3390/s23208362 10.3390/life13102091 10.1109/CVPR.2016.11 10.1109/ICCV.2015.11 10.1007/s11042-014-1949-7 10.1109/TPAMI.2016.2577031 10.1109/ICECET52533.2021.9698617 10.1109/TGRS.2022.3163706 10.1109/TCE.2023.3319565 10.1109/CVPR.2012.6248031 10.1109/CVPR.2015.7298935 10.1016/j.ins.2022.08.119 10.1007/978-3-030-01246-5_11 10.1145/1180639.1180654 10.1109/CVPR.2019.00660 10.1109/CVPR42600.2020.00307 10.48550/arXiv.1810.04805 10.1109/WACV56688.2023.00107 10.1016/j.ins.2023.119641 10.1016/j.cosrev.2023.100596 10.1109/DASC-PICom-DataCom-CyberSciTec.2017.214 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024 |
DBID | 97E ESBDL RIA RIE AAYXX CITATION 7SP 8FD L7M DOA |
DOI | 10.1109/OJIES.2024.3435956 |
DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE Xplore Open Access (Activated by CARLI) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE/IET Electronic Library (IEL) (UW System Shared) CrossRef Electronics & Communications Abstracts Technology Research Database Advanced Technologies Database with Aerospace DOAJ Directory of Open Access Journals |
DatabaseTitle | CrossRef Technology Research Database Advanced Technologies Database with Aerospace Electronics & Communications Abstracts |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: RIE name: IEEE/IET Electronic Library (IEL) (UW System Shared) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISSN | 2644-1284 |
EndPage | 897 |
ExternalDocumentID | oai_doaj_org_article_c79d2076462748d9a1bf14cf4e33ca20 10_1109_OJIES_2024_3435956 10614798 |
Genre | orig-research |
GrantInformation_xml | – fundername: Prince Sultan University for paying the Article Processing Charges |
GroupedDBID | 0R~ 97E AAJGR ABAZT ABVLG ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ EBS ESBDL GROUPED_DOAJ JAVBF M~E OCL OK1 RIA RIE AAYXX CITATION 7SP 8FD L7M |
ID | FETCH-LOGICAL-c357t-feabbe680ab74555aa5b672c6aac5fc652eb304c0ea048810295175ab891be93 |
IEDL.DBID | RIE |
ISSN | 2644-1284 |
IngestDate | Wed Aug 27 00:53:24 EDT 2025 Mon Jun 30 16:42:11 EDT 2025 Tue Jul 01 02:06:17 EDT 2025 Wed Aug 27 01:56:59 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
License | https://creativecommons.org/licenses/by-nc-nd/4.0 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c357t-feabbe680ab74555aa5b672c6aac5fc652eb304c0ea048810295175ab891be93 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ORCID | 0000-0001-5543-7750 0000-0001-5304-5948 0000-0001-5392-5187 |
OpenAccessLink | https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/document/10614798 |
PQID | 3100625552 |
PQPubID | 5075781 |
PageCount | 12 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_c79d2076462748d9a1bf14cf4e33ca20 crossref_primary_10_1109_OJIES_2024_3435956 proquest_journals_3100625552 ieee_primary_10614798 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 20240000 2024-00-00 20240101 2024-01-01 |
PublicationDateYYYYMMDD | 2024-01-01 |
PublicationDate_xml | – year: 2024 text: 20240000 |
PublicationDecade | 2020 |
PublicationPlace | New York |
PublicationPlace_xml | – name: New York |
PublicationTitle | IEEE open journal of the Industrial Electronics Society |
PublicationTitleAbbrev | OJIES |
PublicationYear | 2024 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref35 ref34 Shoib (ref6) 2023 ref15 ref37 ref14 ref36 ref31 ref30 ref11 ref10 ref32 ref2 ref1 ref17 ref39 ref16 ref19 ref18 ref24 ref23 ref26 ref25 ref20 Santoro (ref38) 2017; 30 ref22 ref21 Krizhevsky (ref12) 2012 ref28 ref27 ref29 ref8 ref7 Simonyan (ref13) 2014 Annamoradnejad (ref33) 2020 ref9 ref4 ref3 ref5 ref40 |
References_xml | – ident: ref3 doi: 10.1109/WACV48630.2021.00118 – ident: ref22 doi: 10.1109/CVPR.2011.5995373 – ident: ref17 doi: 10.1109/ICCV.2015.154 – ident: ref28 doi: 10.1109/CCNC.2013.6488510 – ident: ref29 doi: 10.1109/ACCESS.2023.3313977 – ident: ref39 doi: 10.1609/aaai.v32i1.11671 – ident: ref21 doi: 10.1016/j.asoc.2021.107552 – ident: ref31 doi: 10.1109/CVPR.2016.90 – year: 2023 ident: ref6 article-title: Methods and advancement of content-based fashion image retrieval: A review – ident: ref27 doi: 10.1145/1873951.1873977 – ident: ref35 doi: 10.1109/CVPR.2015.7298682 – ident: ref20 doi: 10.1109/TMM.2016.2605058 – ident: ref7 doi: 10.3390/s22062188 – ident: ref19 doi: 10.1007/978-3-319-14445-0_10 – ident: ref5 doi: 10.1109/ICME55011.2023.00027 – ident: ref14 doi: 10.3390/s23208362 – ident: ref8 doi: 10.3390/life13102091 – ident: ref37 doi: 10.1109/CVPR.2016.11 – ident: ref34 doi: 10.1109/ICCV.2015.11 – ident: ref18 doi: 10.1007/s11042-014-1949-7 – ident: ref23 doi: 10.1109/TPAMI.2016.2577031 – ident: ref4 doi: 10.1109/ICECET52533.2021.9698617 – ident: ref30 doi: 10.1109/TGRS.2022.3163706 – year: 2020 ident: ref33 article-title: Colbert: Using bert sentence embedding for humor detection – ident: ref16 doi: 10.1109/TCE.2023.3319565 – ident: ref25 doi: 10.1109/CVPR.2012.6248031 – start-page: 1106 volume-title: Proc. Adv. Neural Inf. Process. Syst. year: 2012 ident: ref12 article-title: ImageNet classification with deep convolutional neural networks – ident: ref36 doi: 10.1109/CVPR.2015.7298935 – ident: ref1 doi: 10.1016/j.ins.2022.08.119 – ident: ref9 doi: 10.1007/978-3-030-01246-5_11 – ident: ref26 doi: 10.1145/1180639.1180654 – ident: ref10 doi: 10.1109/CVPR.2019.00660 – year: 2014 ident: ref13 article-title: Very deep convolutional networks for large-scale image recognition – ident: ref11 doi: 10.1109/CVPR42600.2020.00307 – ident: ref32 doi: 10.48550/arXiv.1810.04805 – ident: ref40 doi: 10.1109/WACV56688.2023.00107 – ident: ref2 doi: 10.1016/j.ins.2023.119641 – ident: ref15 doi: 10.1016/j.cosrev.2023.100596 – ident: ref24 doi: 10.1109/DASC-PICom-DataCom-CyberSciTec.2017.214 – volume: 30 start-page: 4974 year: 2017 ident: ref38 article-title: A simple neural network module for relational reasoning publication-title: Adv. Neural Inf. Process. Syst. |
SSID | ssj0002511136 |
Score | 2.2430544 |
Snippet | Retrieval of a product with desired modifications from a vast inventory of online industrial platforms is frequently encountered in our daily life. This study... |
SourceID | doaj proquest crossref ieee |
SourceType | Open Website Aggregation Database Index Database Publisher |
StartPage | 886 |
SubjectTerms | Algorithms Availability Bidirectional control bidirectional encoder representation from transformer (BERT) collaborative embeddings composition Computer architecture Encoding Feature extraction Image enhancement Image retrieval inference-based learning Neural networks Queries Representations residual neural network-50 (RESNET-50) Residual neural networks Retrieval reverse reranking (RR) Semantics Text analysis Textual and visual embedding generations Transformers Visualization |
SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV09T8MwELVQJxgQH0W0FJSBAYRCk_grGSmllEoFqSpSN-vsOBstgvb_c5ekKIiBhSlScpKTd7bvPdu5Y-wSBGjrUx5aiEQoUudDy4s45FJ5ixHJx0BCcfqsxq9ispCLRqkvOhNWpQeugOs7neUJim1BRWLSPIPYFrFwhfCcO0hKtY4xryGmaA4m4hzTvmS3TqrZf5kgvUI9mIhbLuhvVPUjEpUJ--sKK7-m5TLWjA7Yfk0Sg7vq5Q7Zjl8esb1G6sBj9jjcoMV0lZc8Oph5Ol7h8UpF2NEiuBpOw9nsOhhgmMqDpzecN_Aplc_CvhWMtmey2mw-epjfj8O6KELouNTrsPBgrVdpBFYLKSWAtEonTgE4WTglE5THkXCRBxqcyB-QQ2kJNs1i6zN-wlrL1dKfskBk6CgRR5DaQkjEVVnJFaKaC-BcQ4fdbPEx71XqC1NKhigzJZqG0DQ1mh02IAi_LSltdXkDnWlqZ5q_nNlhbXJAozlkDzpLO6y39Yipx9inoa0JVG9SJt3_aPuM7dL3VMsrPdZaf2z8ORKOtb0o-9YXvOPOaQ priority: 102 providerName: Directory of Open Access Journals |
Title | Dual Modality Reverse Reranking (DM-RR) Based Image Retrieval Framework |
URI | https://ieeexplore.ieee.org/document/10614798 https://www.proquest.com/docview/3100625552 https://doaj.org/article/c79d2076462748d9a1bf14cf4e33ca20 |
Volume | 5 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9QwEB61PZUDhVLEllLl0AOoyjaJX8mR0m4f0hZp1Uq9WWNnckHsIrp74cBvZ8bJVgWExCWJEkdjzdie-ex5AByhRheoVnnAQue6jpQH1ZW5MpYCayQqUYDi9MZe3unre3M_BKunWBgiSs5nNJbHdJbfLuJKtspOEnxxTb0Jm4zc-mCtxw0VsZVLOYrcH_Jonny-ZouKIWClx0pLAKr9TfmkHP1DUZW_VuKkXiY7cLPuWO9V8mW8WoZx_PFHzsb_7vkLeD4YmtnHfmS8hA2a78KzJ-kHX8HF2YpbTBdtssWzGYmLBvFdCrlzi-z92TSfzT5kp6zq2uzqK689_FVKcPH4zCZrv649uJ2c3366zIfCCnlUxi3zjjAEsnWBwWljDKIJ1lXRIkbTRWsqhtiFjgWhTHC2QdgOcwZD3ZSBGvUatuaLOb2BTDcsbF0WWIdOGyyDDUZZpWKrUSmHIzheM9x_69Nn-AQ7isYn8XgRjx_EM4JTkcljS0l9nV4wL_0wk3x0TVsVzmqpGlS3DRPtSh07TUwWq2IEe8L_J-R61o_gYC1iP8zTBy_HG4wAjan2__HbW9iWLva7Lgewtfy-ondshyzDYcLvfJ3-PD9MY_EXbVLa-w |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9QwEB5BOQAHnq3YUiAHDiCUbRK_kiOlLNvSXaTVIvVmjZ3JBbGLYPfCr2fGyVYFhMQpUezIoxnb8409D4CXqNEFqlUesNC5riPlQXVlroylwBqJShRDcTa308_6_NJcDsHqKRaGiJLzGY3lNd3lt-u4laOy42S-uKa-CbdY8eumD9e6OlIRtFzKZeThkEnz-NM5Yyo2Ais9VlpCUO1v6idl6R_Kqvy1FycFM7kP8x1pvV_Jl_F2E8bx5x9ZG_-b9gdwb4Ca2dt-bjyEG7R6BHevJSB8DB9Ot9xjtm4TGs8WJE4axE8p5c49slens3yxeJ2dsLJrs7OvvPtwqxTh4hmaTXaeXfuwnLxfvpvmQ2mFPCrjNnlHGALZusDgtDEG0QTrqmgRo-miNRUb2YWOBaEscUYhjMScwVA3ZaBGHcDear2iJ5DphsWtywLr0GmDZbDBKKtUbDUq5XAEb3YM99_6BBo-GR5F45N4vIjHD-IZwYnI5KqnJL9OH5iXflhLPrqmrQpntdQNqtuGB-1KHTtNPCxWxQj2hf_XhutZP4KjnYj9sFJ_eLngYBvQmOrwH7-9gNvT5ezCX5zNPz6FO0JufwZzBHub71t6xqhkE56nufgLw5bcJQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Dual+Modality+Reverse+Reranking+%28DM-RR%29+Based+Image+Retrieval+Framework&rft.jtitle=IEEE+open+journal+of+the+Industrial+Electronics+Society&rft.au=Ahmed%2C+Ikhlaq&rft.au=Iltaf%2C+Naima&rft.au=Latif%2C+Rabia&rft.au=Jamail%2C+Nor+Shahida+Mohd&rft.date=2024&rft.issn=2644-1284&rft.eissn=2644-1284&rft.volume=5&rft.spage=886&rft.epage=897&rft_id=info:doi/10.1109%2FOJIES.2024.3435956&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_OJIES_2024_3435956 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2644-1284&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2644-1284&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2644-1284&client=summon |