Multilevel Deep Learning-Based Processing for Lifelog Image Retrieval Enhancement
Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a difficult task. The human memory has its limits. In order to go beyond these limits, researchers developed sensors and wearable cameras to capture i...
Saved in:
Published in | Conference proceedings - IEEE International Conference on Systems, Man, and Cybernetics pp. 1348 - 1354 |
---|---|
Main Authors | , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.10.2018
|
Subjects | |
Online Access | Get full text |
ISSN | 2577-1655 |
DOI | 10.1109/SMC.2018.00236 |
Cover
Loading…
Abstract | Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a difficult task. The human memory has its limits. In order to go beyond these limits, researchers developed sensors and wearable cameras to capture individual's experiences. This trend called lifelog has recently been the subject of several panels, workshops and benchmarks. By analyzing the lifelog tasks of these events more closely, we notice that there are still challenges in managing, analyzing, indexing, retrieving, summarizing and visualizing the captured data. In this work, we present a multilevel deep learning-based processing for lifelog image retrieval enhancement. Our proposed approach is based on five phases in which we use deep learning at several levels. The first phase consists of data pre-processing based on low-level image features to filter out irrelevant, noisy and blurred images. In the second phase, we detect and cross high-level image features using pre-trained CNN to enhance the metadata image description. Then, we manage a semantic segmentation based on the WU-Palmer measure similarity. This segmentation is performed to limit the search area and to control better the runtime and the complexity. The fourth phase consist in analyzing the query using LSTM to match concepts with queries. The final phase which based on doc2sequence aims at retrieving the images that is answering the query. |
---|---|
AbstractList | Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a difficult task. The human memory has its limits. In order to go beyond these limits, researchers developed sensors and wearable cameras to capture individual's experiences. This trend called lifelog has recently been the subject of several panels, workshops and benchmarks. By analyzing the lifelog tasks of these events more closely, we notice that there are still challenges in managing, analyzing, indexing, retrieving, summarizing and visualizing the captured data. In this work, we present a multilevel deep learning-based processing for lifelog image retrieval enhancement. Our proposed approach is based on five phases in which we use deep learning at several levels. The first phase consists of data pre-processing based on low-level image features to filter out irrelevant, noisy and blurred images. In the second phase, we detect and cross high-level image features using pre-trained CNN to enhance the metadata image description. Then, we manage a semantic segmentation based on the WU-Palmer measure similarity. This segmentation is performed to limit the search area and to control better the runtime and the complexity. The fourth phase consist in analyzing the query using LSTM to match concepts with queries. The final phase which based on doc2sequence aims at retrieving the images that is answering the query. |
Author | Ben Amar, Chokri Feki, Ghada Ben Abdallah, Fatma Ben Ammar, Anis |
Author_xml | – sequence: 1 givenname: Fatma surname: Ben Abdallah fullname: Ben Abdallah, Fatma – sequence: 2 givenname: Ghada surname: Feki fullname: Feki, Ghada – sequence: 3 givenname: Anis surname: Ben Ammar fullname: Ben Ammar, Anis – sequence: 4 givenname: Chokri surname: Ben Amar fullname: Ben Amar, Chokri |
BookMark | eNotj81KxDAURqMoOB3dunGTF2jNTZqkXWoddaCD_-shbW9qJJMOaR3w7S3o6sDH4YOTkJMwBCTkElgGwMrrt02VcQZFxhgX6ogkIEWhlJKMH5MFl1qnoKQ8I8k4fs0Oy6FYkJfNt5-cxwN6eoe4pzWaGFzo01szYkef49DiOM4DtUOktbPoh56ud6ZH-opTdHgwnq7Cpwkt7jBM5-TUGj_ixT-X5ON-9V49pvXTw7q6qVMHWk5pk-dcsbJRUkhoG9DcGKs5Ameaa5u3ZWkbEGXH27lCFFp0smUzbIcFsFwsydXfr0PE7T66nYk_20KB4oKLX9LQTuw |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/SMC.2018.00236 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library (IEL) (UW System Shared) IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Sciences (General) |
EISBN | 1538666502 9781538666500 |
EISSN | 2577-1655 |
EndPage | 1354 |
ExternalDocumentID | 8616232 |
Genre | orig-research |
GroupedDBID | 29F 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI M43 OCL RIE RIL RIO RNS |
ID | FETCH-LOGICAL-i175t-b442609b65351cb172aaf72e120727f4c99fb139d2c8663873d5c0873fde81043 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 27 03:03:09 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i175t-b442609b65351cb172aaf72e120727f4c99fb139d2c8663873d5c0873fde81043 |
PageCount | 7 |
ParticipantIDs | ieee_primary_8616232 |
PublicationCentury | 2000 |
PublicationDate | 2018-Oct |
PublicationDateYYYYMMDD | 2018-10-01 |
PublicationDate_xml | – month: 10 year: 2018 text: 2018-Oct |
PublicationDecade | 2010 |
PublicationTitle | Conference proceedings - IEEE International Conference on Systems, Man, and Cybernetics |
PublicationTitleAbbrev | SMC |
PublicationYear | 2018 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0020418 |
Score | 2.080902 |
Snippet | Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1348 |
SubjectTerms | Convolutional Neural Network Deep learning Feature extraction Image retrieval Image segmentation Lifelog Long Short-Term Memory Noise measurement Retrieval Semantic Similarity Semantics Task analysis Word Embedding |
Title | Multilevel Deep Learning-Based Processing for Lifelog Image Retrieval Enhancement |
URI | https://ieeexplore.ieee.org/document/8616232 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED6VTrBAH4i3PDCAhNs4DzdZKa0KoohXpW5VbF-gAtIK0oVfzzkJpUIMTIm85OTH3XfOd98BHCstEyljzYVxQu6jMjyOKVmhVILiuavIAdp7yOGNHIz8q3EwrsDZshYGEXPyGbbsa_4v38z0wl6VtUMpKFqTw12jxK2o1VomV44vwlKUUThR-2HYtbytnChp5ZdXWqfkkaO_CcPvbxaEkZfWIlMt_flLjvG_Rm1B86dGj90uo08NKpjWYWNFXrAOtfLgfrCTUl36tAF3ecntq-UKsQvEOSsVVp_4OQU0w8rKARpghGfZ9TRBMpVdvpHjYfd5_y3anKyXPtv9Ym1rwqjfe-wOeNlXgU8JLGRc-VaWPlIy8AKhFUGYOE46LgrXITST-DqKEkXI0Lg6JEASdjwTaIceicGQ0jdvG6rpLMUdYL4KjJLKiRG1H2InErFEYaTNO1HJZBcadsYm80I6Y1JO1t7fw_uwbtes4ModQDV7X-AhxfxMHeWL_QVOoKxC |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEJ4QPKgXFTC-3YMHTSx0S7u0VxECCsQHJNxId3eqRC1Ey8Vf72xbkRgPntrspZN9zPfNduYbgDOpRCREqCyubd9yUWorDClYoVCC8NyR5ADNPWR_IDoj92bsjQtwuayFQcQ0-Qyr5jX9l69namGuymq-4ITW5HDXCPfdIKvWWoZXtsv9XJaR20Htsd80mVtpqqQRYF5pnpJiR3sL-t9fzVJGXqqLRFbV5y9Bxv-atQ2Vnyo9drfEnx0oYFyCzRWBwRLs5Ef3g53n-tIXZbhPi25fTbYQu0acs1xj9cm6IkjTLK8doAFGjJb1phGSqaz7Rq6HPaQduGh7slb8bHaMsa0Co3Zr2OxYeWcFa0p0IbGka4TpAym8useVJBIThlHDQe7YxGciVwVBJIkbakf5REn8Rl17yqZHpNGnAK6-C8V4FuMeMFd6Wgpph4jK9bER8FAg18JEnihFtA9lM2OTeSaeMckn6-Dv4VNY7wz7vUmvO7g9hA2zflnm3BEUk_cFHhMDSORJuvBfiJWvkg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Conference+proceedings+-+IEEE+International+Conference+on+Systems%2C+Man%2C+and+Cybernetics&rft.atitle=Multilevel+Deep+Learning-Based+Processing+for+Lifelog+Image+Retrieval+Enhancement&rft.au=Ben+Abdallah%2C+Fatma&rft.au=Feki%2C+Ghada&rft.au=Ben+Ammar%2C+Anis&rft.au=Ben+Amar%2C+Chokri&rft.date=2018-10-01&rft.pub=IEEE&rft.eissn=2577-1655&rft.spage=1348&rft.epage=1354&rft_id=info:doi/10.1109%2FSMC.2018.00236&rft.externalDocID=8616232 |