Attention Prediction in Egocentric Video Using Motion and Visual Saliency

We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vi...

Full description

Saved in:

Bibliographic Details
Published in	Advances in Image and Video Technology Vol. 7087; pp. 277 - 288
Main Authors	Yamada, Kentaro, Sugano, Yusuke, Okabe, Takahiro, Sato, Yoichi, Sugimoto, Akihiro, Hiraki, Kazuo
Format	Book Chapter
Language	English
Published	Germany Springer Berlin / Heidelberg 2011 Springer Berlin Heidelberg
Series	Lecture Notes in Computer Science
Subjects	camera motion estimation first-person vision visual attention Visual saliency
Online Access	Get full text
ISBN	9783642253669 3642253660
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-642-25367-6_25

Cover

Abstract	We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vision. The purpose of our framework is to compute attention maps from an egocentric video that can be used to infer a person’s visual attention. In addition to a standard visual saliency model, two kinds of attention maps are computed based on a camera’s rotation velocity and direction of movement. These rotation-based and translation-based attention maps are aggregated with a bottom-up saliency map to enhance the accuracy with which the person’s gaze positions can be predicted. The efficiency of the proposed framework was examined in real environments by using a head-mounted gaze tracker, and we found that the egomotion-based attention maps contributed to accurately predicting human visual attention.
AbstractList	We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual saliency are often employed to predict human attention; however, its mechanism and effectiveness have not been fully explored in egocentric vision. The purpose of our framework is to compute attention maps from an egocentric video that can be used to infer a person’s visual attention. In addition to a standard visual saliency model, two kinds of attention maps are computed based on a camera’s rotation velocity and direction of movement. These rotation-based and translation-based attention maps are aggregated with a bottom-up saliency map to enhance the accuracy with which the person’s gaze positions can be predicted. The efficiency of the proposed framework was examined in real environments by using a head-mounted gaze tracker, and we found that the egomotion-based attention maps contributed to accurately predicting human visual attention.
Author	Okabe, Takahiro Hiraki, Kazuo Sugimoto, Akihiro Sato, Yoichi Sugano, Yusuke Yamada, Kentaro
Author_xml	– sequence: 1 givenname: Kentaro surname: Yamada fullname: Yamada, Kentaro email: yamada@iis.u-tokyo.ac.jp organization: The University of Tokyo, Tokyo, Japan – sequence: 2 givenname: Yusuke surname: Sugano fullname: Sugano, Yusuke email: sugano@iis.u-tokyo.ac.jp organization: The University of Tokyo, Tokyo, Japan – sequence: 3 givenname: Takahiro surname: Okabe fullname: Okabe, Takahiro email: takahiro@iis.u-tokyo.ac.jp organization: The University of Tokyo, Tokyo, Japan – sequence: 4 givenname: Yoichi surname: Sato fullname: Sato, Yoichi email: ysato@iis.u-tokyo.ac.jp organization: The University of Tokyo, Tokyo, Japan – sequence: 5 givenname: Akihiro surname: Sugimoto fullname: Sugimoto, Akihiro email: sugimoto@nii.ac.jp organization: National Institute of Informatics, Tokyo, Japan – sequence: 6 givenname: Kazuo surname: Hiraki fullname: Hiraki, Kazuo email: khiraki@idea.c.u-tokyo.ac.jp organization: The University of Tokyo, Tokyo, Japan
BookMark	eNpVkE1OwzAQhQ0URCm9AYtcwDD2xHa8rCp-KhWBBGVruc60pFRJidMFt8dN2TALj_XGb_T8XbFB3dTE2I2AWwFg7qwpOHKdSy4VasO1k-qEjZOMSew1fcqGQgvBEXN79m-m7YANAUFya3K8YEOjldUgc7hk4xg3kEqDhaIYstmk66juqqbOXlsqq9Bfqzq7XzchDdoqZB9VSU22iFW9zp6b_oGvyyTHvd9mb35bUR1-rtn5ym8jjf_6iC0e7t-nT3z-8jibTuZ8I2WueDrIFGFpgMjTqtRGIglp0SrlQ2kx96rUwWigoHKDoMMSCsKUuVArEXDE5HFv3LUpEbVu2TRf0QlwB3YugXDoEgrXc3IHdsmUH027tvneU-wcHVz9D_02fPpdR210CAasRSdtcluBv85Xbdw
ContentType	Book Chapter
Copyright	Springer-Verlag Berlin Heidelberg 2011
Copyright_xml	– notice: Springer-Verlag Berlin Heidelberg 2011
DBID	FFUUA
DEWEY	621.367
DOI	10.1007/978-3-642-25367-6_25
DatabaseName	ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Visual Arts Engineering Computer Science
EISBN	9783642253676 3642253679
EISSN	1611-3349
Editor	Ho, Yo-Sung
Editor_xml	– sequence: 1 fullname: Ho, Yo-Sung
EndPage	288
ExternalDocumentID	EBC3070993_292_291
GroupedDBID	089 0D6 0DA 2HV 38. A4J AABBV AAFYB AAPKO ABBVZ ABFCL ABFCV ABMNI ABTMC ADWNV AEDXK AEHWL AEJLV AEKFX AETDV AEZAY AIJHZ AIMOO ALMA_UNASSIGNED_HOLDINGS AZZ BBABE CZZ FFUUA I4C IEZ IX. MA. PH7 PI1 SBO TPJZQ TSXQS Z7Z Z81 Z83 Z88 -DT -GH -~X 1SB 29L 2HA 5QI 875 AASHB ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RNI RSU SVGTG VI1 ~02
ID	FETCH-LOGICAL-j2245-224e78cb70eeaefd6723e1293955acd934a5d6c760ec547306cb08e300085f1c3
ISBN	9783642253669 3642253660
ISSN	0302-9743
IngestDate	Wed Sep 17 04:00:41 EDT 2025 Mon Apr 07 02:09:25 EDT 2025
IsPeerReviewed	false
IsScholarly	false
LCCallNum	QA76.575
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-j2245-224e78cb70eeaefd6723e1293955acd934a5d6c760ec547306cb08e300085f1c3
OCLC	765960240
PQID	EBC3070993_292_291
PageCount	12
ParticipantIDs	springer_books_10_1007_978_3_642_25367_6_25 proquest_ebookcentralchapters_3070993_292_291
PublicationCentury	2000
PublicationDate	2011-00-00
PublicationDateYYYYMMDD	2011-01-01
PublicationDate_xml	– year: 2011 text: 2011-00-00
PublicationDecade	2010
PublicationPlace	Germany
PublicationPlace_xml	– name: Germany – name: Berlin, Heidelberg
PublicationSeriesTitle	Lecture Notes in Computer Science
PublicationSubtitle	5th Pacific Rim Symposium, PSIVT 2011, Gwangju, South Korea, November 20-23, 2011, Proceedings, Part I
PublicationTitle	Advances in Image and Video Technology
PublicationYear	2011
Publisher	Springer Berlin / Heidelberg Springer Berlin Heidelberg
Publisher_xml	– name: Springer Berlin / Heidelberg – name: Springer Berlin Heidelberg
RelatedPersons	Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David Tygar, Doug
RelatedPersons_xml	– sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, UK – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, UK – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar organization: University of Bern, Bern, Switzerland – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology, Madras, India – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: University of Dortmund, Dortmund, Germany – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu organization: Massachusetts Institute of Technology, USA – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 13 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. organization: Rice University, Houston, USA – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID	ssj0000609088 ssj0002792
Score	1.7195395
Snippet	We propose a method of predicting human egocentric visual attention using bottom-up visual saliency and egomotion information. Computational models of visual...
SourceID	springer proquest
SourceType	Publisher
StartPage	277
SubjectTerms	camera motion estimation first-person vision visual attention Visual saliency
Title	Attention Prediction in Egocentric Video Using Motion and Visual Saliency
URI	http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=3070993&ppg=291 http://link.springer.com/10.1007/978-3-642-25367-6_25
Volume	7087
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT9swFLZGdxk7MNjQOhjygRvy5CaOnR4ZKgIE0ySg4mYljoOKtFZqswt_Pe_5R5OWXuBQq4lqy_LnuC_vfd97hBzzXOVCKM6GQiVM1NywQgwqpupaFLBBMusi-Dd_5MW9uHrIHtpiik5d0pS_zPNGXcl7UIV7gCuqZN-A7HJQuAHfAV9oAWFo14zfVTerpxf76L3js17-Q-oN-sDHk8rOXjnMQ4-mCeTGv3OMz0Se4-hx5jiaExO6ex7BzayJXOXxZIEyk1sw2lGq2XUVOMFc11UQXYUnIZMW5kXflE3LKTsEPOip9GVU4kmpuP9zfHXsdpkWEvU-0FUxqb2meTXLdeKrc61luR79PsPTB6wlnQwT-MAb7RZsox75eDq6uh4v_WZccqRmoUwnzjGk7mrn3JFIbprTysvEWvzbmRV3X8hnlJpQ1IDALHfJBzvdIzux0AYN5-4e2e5kjYQrjwY9nTeLr-RyiSptUaWTKW1RpQ5V6lClHlUKqNIwTkT1G7k_H92dXbBQH4M9geGVMWisyk2puLWFrSupktSi_TbMssJUw1QUWSWNktwaLDHNpSl5blNnZ9cDk-6T3nQ2td8JHVS1MtJmRiUYyS2LvM7BFK4yU0oBVn-fsLhk2kXxA3XY-AVa6DXw-uQkrqvGny90TI8NgOhUAyDaAaIRkB9vHP2AfGq39yHpNfP_9ifYhk15FLbLC9iYXLk
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Advances+in+Image+and+Video+Technology&rft.atitle=Attention+Prediction+in+Egocentric+Video+Using+Motion+and+Visual+Saliency&rft.date=2011-01-01&rft.pub=Springer+Berlin+%2F+Heidelberg&rft.isbn=9783642253669&rft.volume=7087&rft_id=info:doi/10.1007%2F978-3-642-25367-6_25&rft.externalDBID=291&rft.externalDocID=EBC3070993_292_291
thumbnail_s	http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F3070993-l.jpg