Multilevel Deep Learning-Based Processing for Lifelog Image Retrieval Enhancement

Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a difficult task. The human memory has its limits. In order to go beyond these limits, researchers developed sensors and wearable cameras to capture i...

Full description

Saved in:
Bibliographic Details
Published inConference proceedings - IEEE International Conference on Systems, Man, and Cybernetics pp. 1348 - 1354
Main Authors Ben Abdallah, Fatma, Feki, Ghada, Ben Ammar, Anis, Ben Amar, Chokri
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2018
Subjects
Online AccessGet full text
ISSN2577-1655
DOI10.1109/SMC.2018.00236

Cover

Loading…
Abstract Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a difficult task. The human memory has its limits. In order to go beyond these limits, researchers developed sensors and wearable cameras to capture individual's experiences. This trend called lifelog has recently been the subject of several panels, workshops and benchmarks. By analyzing the lifelog tasks of these events more closely, we notice that there are still challenges in managing, analyzing, indexing, retrieving, summarizing and visualizing the captured data. In this work, we present a multilevel deep learning-based processing for lifelog image retrieval enhancement. Our proposed approach is based on five phases in which we use deep learning at several levels. The first phase consists of data pre-processing based on low-level image features to filter out irrelevant, noisy and blurred images. In the second phase, we detect and cross high-level image features using pre-trained CNN to enhance the metadata image description. Then, we manage a semantic segmentation based on the WU-Palmer measure similarity. This segmentation is performed to limit the search area and to control better the runtime and the complexity. The fourth phase consist in analyzing the query using LSTM to match concepts with queries. The final phase which based on doc2sequence aims at retrieving the images that is answering the query.
AbstractList Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a difficult task. The human memory has its limits. In order to go beyond these limits, researchers developed sensors and wearable cameras to capture individual's experiences. This trend called lifelog has recently been the subject of several panels, workshops and benchmarks. By analyzing the lifelog tasks of these events more closely, we notice that there are still challenges in managing, analyzing, indexing, retrieving, summarizing and visualizing the captured data. In this work, we present a multilevel deep learning-based processing for lifelog image retrieval enhancement. Our proposed approach is based on five phases in which we use deep learning at several levels. The first phase consists of data pre-processing based on low-level image features to filter out irrelevant, noisy and blurred images. In the second phase, we detect and cross high-level image features using pre-trained CNN to enhance the metadata image description. Then, we manage a semantic segmentation based on the WU-Palmer measure similarity. This segmentation is performed to limit the search area and to control better the runtime and the complexity. The fourth phase consist in analyzing the query using LSTM to match concepts with queries. The final phase which based on doc2sequence aims at retrieving the images that is answering the query.
Author Ben Amar, Chokri
Feki, Ghada
Ben Abdallah, Fatma
Ben Ammar, Anis
Author_xml – sequence: 1
  givenname: Fatma
  surname: Ben Abdallah
  fullname: Ben Abdallah, Fatma
– sequence: 2
  givenname: Ghada
  surname: Feki
  fullname: Feki, Ghada
– sequence: 3
  givenname: Anis
  surname: Ben Ammar
  fullname: Ben Ammar, Anis
– sequence: 4
  givenname: Chokri
  surname: Ben Amar
  fullname: Ben Amar, Chokri
BookMark eNotj81KxDAURqMoOB3dunGTF2jNTZqkXWoddaCD_-shbW9qJJMOaR3w7S3o6sDH4YOTkJMwBCTkElgGwMrrt02VcQZFxhgX6ogkIEWhlJKMH5MFl1qnoKQ8I8k4fs0Oy6FYkJfNt5-cxwN6eoe4pzWaGFzo01szYkef49DiOM4DtUOktbPoh56ud6ZH-opTdHgwnq7Cpwkt7jBM5-TUGj_ixT-X5ON-9V49pvXTw7q6qVMHWk5pk-dcsbJRUkhoG9DcGKs5Ameaa5u3ZWkbEGXH27lCFFp0smUzbIcFsFwsydXfr0PE7T66nYk_20KB4oKLX9LQTuw
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/SMC.2018.00236
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library (IEL) (UW System Shared)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Sciences (General)
EISBN 1538666502
9781538666500
EISSN 2577-1655
EndPage 1354
ExternalDocumentID 8616232
Genre orig-research
GroupedDBID 29F
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
M43
OCL
RIE
RIL
RIO
RNS
ID FETCH-LOGICAL-i175t-b442609b65351cb172aaf72e120727f4c99fb139d2c8663873d5c0873fde81043
IEDL.DBID RIE
IngestDate Wed Aug 27 03:03:09 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-b442609b65351cb172aaf72e120727f4c99fb139d2c8663873d5c0873fde81043
PageCount 7
ParticipantIDs ieee_primary_8616232
PublicationCentury 2000
PublicationDate 2018-Oct
PublicationDateYYYYMMDD 2018-10-01
PublicationDate_xml – month: 10
  year: 2018
  text: 2018-Oct
PublicationDecade 2010
PublicationTitle Conference proceedings - IEEE International Conference on Systems, Man, and Cybernetics
PublicationTitleAbbrev SMC
PublicationYear 2018
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0020418
Score 2.080902
Snippet Remembering an event or a meeting, recalling the face or the name of a person, keeping in mind what we ate or the place of a lost object is sometimes a...
SourceID ieee
SourceType Publisher
StartPage 1348
SubjectTerms Convolutional Neural Network
Deep learning
Feature extraction
Image retrieval
Image segmentation
Lifelog
Long Short-Term Memory
Noise measurement
Retrieval
Semantic Similarity
Semantics
Task analysis
Word Embedding
Title Multilevel Deep Learning-Based Processing for Lifelog Image Retrieval Enhancement
URI https://ieeexplore.ieee.org/document/8616232
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED6VTrBAH4i3PDCAhNs4DzdZKa0KoohXpW5VbF-gAtIK0oVfzzkJpUIMTIm85OTH3XfOd98BHCstEyljzYVxQu6jMjyOKVmhVILiuavIAdp7yOGNHIz8q3EwrsDZshYGEXPyGbbsa_4v38z0wl6VtUMpKFqTw12jxK2o1VomV44vwlKUUThR-2HYtbytnChp5ZdXWqfkkaO_CcPvbxaEkZfWIlMt_flLjvG_Rm1B86dGj90uo08NKpjWYWNFXrAOtfLgfrCTUl36tAF3ecntq-UKsQvEOSsVVp_4OQU0w8rKARpghGfZ9TRBMpVdvpHjYfd5_y3anKyXPtv9Ym1rwqjfe-wOeNlXgU8JLGRc-VaWPlIy8AKhFUGYOE46LgrXITST-DqKEkXI0Lg6JEASdjwTaIceicGQ0jdvG6rpLMUdYL4KjJLKiRG1H2InErFEYaTNO1HJZBcadsYm80I6Y1JO1t7fw_uwbtes4ModQDV7X-AhxfxMHeWL_QVOoKxC
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEJ4QPKgXFTC-3YMHTSx0S7u0VxECCsQHJNxId3eqRC1Ey8Vf72xbkRgPntrspZN9zPfNduYbgDOpRCREqCyubd9yUWorDClYoVCC8NyR5ADNPWR_IDoj92bsjQtwuayFQcQ0-Qyr5jX9l69namGuymq-4ITW5HDXCPfdIKvWWoZXtsv9XJaR20Htsd80mVtpqqQRYF5pnpJiR3sL-t9fzVJGXqqLRFbV5y9Bxv-atQ2Vnyo9drfEnx0oYFyCzRWBwRLs5Ef3g53n-tIXZbhPi25fTbYQu0acs1xj9cm6IkjTLK8doAFGjJb1phGSqaz7Rq6HPaQduGh7slb8bHaMsa0Co3Zr2OxYeWcFa0p0IbGka4TpAym8useVJBIThlHDQe7YxGciVwVBJIkbakf5REn8Rl17yqZHpNGnAK6-C8V4FuMeMFd6Wgpph4jK9bER8FAg18JEnihFtA9lM2OTeSaeMckn6-Dv4VNY7wz7vUmvO7g9hA2zflnm3BEUk_cFHhMDSORJuvBfiJWvkg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Conference+proceedings+-+IEEE+International+Conference+on+Systems%2C+Man%2C+and+Cybernetics&rft.atitle=Multilevel+Deep+Learning-Based+Processing+for+Lifelog+Image+Retrieval+Enhancement&rft.au=Ben+Abdallah%2C+Fatma&rft.au=Feki%2C+Ghada&rft.au=Ben+Ammar%2C+Anis&rft.au=Ben+Amar%2C+Chokri&rft.date=2018-10-01&rft.pub=IEEE&rft.eissn=2577-1655&rft.spage=1348&rft.epage=1354&rft_id=info:doi/10.1109%2FSMC.2018.00236&rft.externalDocID=8616232