Attention Prediction in Egocentric Video Using Motion and Visual Saliency

We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vi...

Full description

Saved in:
Bibliographic Details
Published inAdvances in Image and Video Technology Vol. 7087; pp. 277 - 288
Main Authors Yamada, Kentaro, Sugano, Yusuke, Okabe, Takahiro, Sato, Yoichi, Sugimoto, Akihiro, Hiraki, Kazuo
Format Book Chapter
LanguageEnglish
Published Germany Springer Berlin / Heidelberg 2011
Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783642253669
3642253660
ISSN0302-9743
1611-3349
DOI10.1007/978-3-642-25367-6_25

Cover

Abstract We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vision. The purpose of our framework is to compute attention maps from an egocentric video that can be used to infer a person’s visual attention. In addition to a standard visual saliency model, two kinds of attention maps are computed based on a camera’s rotation velocity and direction of movement. These rotation-based and translation-based attention maps are aggregated with a bottom-up saliency map to enhance the accuracy with which the person’s gaze positions can be predicted. The efficiency of the proposed framework was examined in real environments by using a head-mounted gaze tracker, and we found that the egomotion-based attention maps contributed to accurately predicting human visual attention.
AbstractList We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vision. The purpose of our framework is to compute attention maps from an egocentric video that can be used to infer a person’s visual attention. In addition to a standard visual saliency model, two kinds of attention maps are computed based on a camera’s rotation velocity and direction of movement. These rotation-based and translation-based attention maps are aggregated with a bottom-up saliency map to enhance the accuracy with which the person’s gaze positions can be predicted. The efficiency of the proposed framework was examined in real environments by using a head-mounted gaze tracker, and we found that the egomotion-based attention maps contributed to accurately predicting human visual attention.
Author Okabe, Takahiro
Hiraki, Kazuo
Sugimoto, Akihiro
Sato, Yoichi
Sugano, Yusuke
Yamada, Kentaro
Author_xml – sequence: 1
  givenname: Kentaro
  surname: Yamada
  fullname: Yamada, Kentaro
  email: yamada@iis.u-tokyo.ac.jp
  organization: The University of Tokyo, Tokyo, Japan
– sequence: 2
  givenname: Yusuke
  surname: Sugano
  fullname: Sugano, Yusuke
  email: sugano@iis.u-tokyo.ac.jp
  organization: The University of Tokyo, Tokyo, Japan
– sequence: 3
  givenname: Takahiro
  surname: Okabe
  fullname: Okabe, Takahiro
  email: takahiro@iis.u-tokyo.ac.jp
  organization: The University of Tokyo, Tokyo, Japan
– sequence: 4
  givenname: Yoichi
  surname: Sato
  fullname: Sato, Yoichi
  email: ysato@iis.u-tokyo.ac.jp
  organization: The University of Tokyo, Tokyo, Japan
– sequence: 5
  givenname: Akihiro
  surname: Sugimoto
  fullname: Sugimoto, Akihiro
  email: sugimoto@nii.ac.jp
  organization: National Institute of Informatics, Tokyo, Japan
– sequence: 6
  givenname: Kazuo
  surname: Hiraki
  fullname: Hiraki, Kazuo
  email: khiraki@idea.c.u-tokyo.ac.jp
  organization: The University of Tokyo, Tokyo, Japan
BookMark eNpVkE1OwzAQhQ0URCm9AYtcwDD2xHa8rCp-KhWBBGVruc60pFRJidMFt8dN2TALj_XGb_T8XbFB3dTE2I2AWwFg7qwpOHKdSy4VasO1k-qEjZOMSew1fcqGQgvBEXN79m-m7YANAUFya3K8YEOjldUgc7hk4xg3kEqDhaIYstmk66juqqbOXlsqq9Bfqzq7XzchDdoqZB9VSU22iFW9zp6b_oGvyyTHvd9mb35bUR1-rtn5ym8jjf_6iC0e7t-nT3z-8jibTuZ8I2WueDrIFGFpgMjTqtRGIglp0SrlQ2kx96rUwWigoHKDoMMSCsKUuVArEXDE5HFv3LUpEbVu2TRf0QlwB3YugXDoEgrXc3IHdsmUH027tvneU-wcHVz9D_02fPpdR210CAasRSdtcluBv85Xbdw
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2011
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2011
DBID FFUUA
DEWEY 621.367
DOI 10.1007/978-3-642-25367-6_25
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Visual Arts
Engineering
Computer Science
EISBN 9783642253676
3642253679
EISSN 1611-3349
Editor Ho, Yo-Sung
Editor_xml – sequence: 1
  fullname: Ho, Yo-Sung
EndPage 288
ExternalDocumentID EBC3070993_292_291
GroupedDBID 089
0D6
0DA
2HV
38.
A4J
AABBV
AAFYB
AAPKO
ABBVZ
ABFCL
ABFCV
ABMNI
ABTMC
ADWNV
AEDXK
AEHWL
AEJLV
AEKFX
AETDV
AEZAY
AIJHZ
AIMOO
ALMA_UNASSIGNED_HOLDINGS
AZZ
BBABE
CZZ
FFUUA
I4C
IEZ
IX.
MA.
PH7
PI1
SBO
TPJZQ
TSXQS
Z7Z
Z81
Z83
Z88
-DT
-GH
-~X
1SB
29L
2HA
5QI
875
AASHB
ACGFS
ADCXD
AEFIE
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-j2245-224e78cb70eeaefd6723e1293955acd934a5d6c760ec547306cb08e300085f1c3
ISBN 9783642253669
3642253660
ISSN 0302-9743
IngestDate Wed Sep 17 04:00:41 EDT 2025
Mon Apr 07 02:09:25 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCallNum QA76.575
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-j2245-224e78cb70eeaefd6723e1293955acd934a5d6c760ec547306cb08e300085f1c3
OCLC 765960240
PQID EBC3070993_292_291
PageCount 12
ParticipantIDs springer_books_10_1007_978_3_642_25367_6_25
proquest_ebookcentralchapters_3070993_292_291
PublicationCentury 2000
PublicationDate 2011-00-00
PublicationDateYYYYMMDD 2011-01-01
PublicationDate_xml – year: 2011
  text: 2011-00-00
PublicationDecade 2010
PublicationPlace Germany
PublicationPlace_xml – name: Germany
– name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle 5th Pacific Rim Symposium, PSIVT 2011, Gwangju, South Korea, November 20-23, 2011, Proceedings, Part I
PublicationTitle Advances in Image and Video Technology
PublicationYear 2011
Publisher Springer Berlin / Heidelberg
Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin / Heidelberg
– name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0000609088
ssj0002792
Score 1.7195395
Snippet We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual...
SourceID springer
proquest
SourceType Publisher
StartPage 277
SubjectTerms camera motion estimation
first-person vision
visual attention
Visual saliency
Title Attention Prediction in Egocentric Video Using Motion and Visual Saliency
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=3070993&ppg=291
http://link.springer.com/10.1007/978-3-642-25367-6_25
Volume 7087
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT9swFLZGdxk7MNjQOhjygRvy5CaOnR4ZKgIE0ySg4mYljoOKtFZqswt_Pe_5R5OWXuBQq4lqy_LnuC_vfd97hBzzXOVCKM6GQiVM1NywQgwqpupaFLBBMusi-Dd_5MW9uHrIHtpiik5d0pS_zPNGXcl7UIV7gCuqZN-A7HJQuAHfAV9oAWFo14zfVTerpxf76L3js17-Q-oN-sDHk8rOXjnMQ4-mCeTGv3OMz0Se4-hx5jiaExO6ex7BzayJXOXxZIEyk1sw2lGq2XUVOMFc11UQXYUnIZMW5kXflE3LKTsEPOip9GVU4kmpuP9zfHXsdpkWEvU-0FUxqb2meTXLdeKrc61luR79PsPTB6wlnQwT-MAb7RZsox75eDq6uh4v_WZccqRmoUwnzjGk7mrn3JFIbprTysvEWvzbmRV3X8hnlJpQ1IDALHfJBzvdIzux0AYN5-4e2e5kjYQrjwY9nTeLr-RyiSptUaWTKW1RpQ5V6lClHlUKqNIwTkT1G7k_H92dXbBQH4M9geGVMWisyk2puLWFrSupktSi_TbMssJUw1QUWSWNktwaLDHNpSl5blNnZ9cDk-6T3nQ2td8JHVS1MtJmRiUYyS2LvM7BFK4yU0oBVn-fsLhk2kXxA3XY-AVa6DXw-uQkrqvGny90TI8NgOhUAyDaAaIRkB9vHP2AfGq39yHpNfP_9ifYhk15FLbLC9iYXLk
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Image+and+Video+Technology&rft.atitle=Attention+Prediction+in+Egocentric+Video+Using+Motion+and+Visual+Saliency&rft.date=2011-01-01&rft.pub=Springer+Berlin+%2F+Heidelberg&rft.isbn=9783642253669&rft.volume=7087&rft_id=info:doi/10.1007%2F978-3-642-25367-6_25&rft.externalDBID=291&rft.externalDocID=EBC3070993_292_291
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F3070993-l.jpg