Alternative Semantic Representations for Zero-Shot Human Action Recognition
A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from sti...
Saved in:
Published in | Machine Learning and Knowledge Discovery in Databases Vol. 10534; pp. 87 - 102 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2017
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3319712489 9783319712482 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-319-71249-9_6 |
Cover
Loading…
Abstract | A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost, which paves a new way in gaining side information for large-scale zero-shot human action recognition. We investigate different encoding methods to generate semantic representations for human actions from such side information. Based on our zero-shot visual recognition method, we conducted experiments on UCF101 and HMDB51 to evaluate two proposed semantic representations. The results suggest that our proposed text- and image-based semantic representations outperform traditional attributes and word vectors considerably for zero-shot human action recognition. In particular, the image-based semantic representations yield the favourable performance even though the representation is extracted from a small number of images per class.
Code related to this chapter is available at: http://staff.cs.manchester.ac.uk/~kechen/BiDiLEL/
Data related to this chapter are available at: http://staff.cs.manchester.ac.uk/~kechen/ASRHAR/ |
---|---|
AbstractList | A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost, which paves a new way in gaining side information for large-scale zero-shot human action recognition. We investigate different encoding methods to generate semantic representations for human actions from such side information. Based on our zero-shot visual recognition method, we conducted experiments on UCF101 and HMDB51 to evaluate two proposed semantic representations. The results suggest that our proposed text- and image-based semantic representations outperform traditional attributes and word vectors considerably for zero-shot human action recognition. In particular, the image-based semantic representations yield the favourable performance even though the representation is extracted from a small number of images per class.
Code related to this chapter is available at: http://staff.cs.manchester.ac.uk/~kechen/BiDiLEL/
Data related to this chapter are available at: http://staff.cs.manchester.ac.uk/~kechen/ASRHAR/ |
Author | Wang, Qian Chen, Ke |
Author_xml | – sequence: 1 givenname: Qian surname: Wang fullname: Wang, Qian email: qian.wang@manchester.ac.uk – sequence: 2 givenname: Ke surname: Chen fullname: Chen, Ke |
BookMark | eNqNkMtOwzAQRQ0URFv6BWzyA4YZTxLby6oCiqiERGHDxkpcpw9KXOKU78ehwJrVWHfuGclnwHq1rx1jlwhXCCCvtVScOKHmEkWquTb5ERvFlGL2Helj1scckROl-oQNfhdK91gfCATXMqUzNkBAlUqBuTpnoxA2AICaACX02cN427qmLtr1p0vm7r2o27VNntyuccHVbcx9HZLKN8mrazyfr3ybTPexloxtt4tV65f1untfsNOq2AY3-plD9nJ78zyZ8tnj3f1kPOMb0qrlQmSupEVJpc1yuxBWQ5GSWFRyUbnc5laJXBJUVQYlKZkK6VRRliCsFRUIR0OGh7th16zrpWtM6f1bMAimU2eiJEMm2jDfnkxUF5n0wOwa_7F3oTWug2z8Y1Ns7arYRQ0hFoEE5AYxM5iK_2JZJiWA-sO-AI0AgWI |
ContentType | Book Chapter |
Copyright | Springer International Publishing AG 2017 |
Copyright_xml | – notice: Springer International Publishing AG 2017 |
DBID | FFUUA |
DEWEY | 006.31 |
DOI | 10.1007/978-3-319-71249-9_6 |
DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9783319712499 3319712497 |
EISSN | 1611-3349 |
Editor | Vens, Celine Hollmén, Jaakko Dzeroski, Saso Todorovski, Ljupčo Ceci, Michelangelo |
Editor_xml | – sequence: 1 fullname: Todorovski, Ljupčo – sequence: 2 fullname: Ceci, Michelangelo – sequence: 3 fullname: Vens, Celine – sequence: 4 fullname: Hollmén, Jaakko – sequence: 5 fullname: Dzeroski, Saso |
EndPage | 102 |
ExternalDocumentID | EBC6303206_115_142 EBC5577008_115_142 |
GroupedDBID | 0D6 0DA 38. AABBV AALVI ABBVZ ABHTH ABQUB ACDJR AEDXK AEJLV AEKFX AEZAY AGIGN AGYGE AIODD ALBAV ALMA_UNASSIGNED_HOLDINGS AZZ BATQV BBABE CVWCR CZZ FFUUA I4C IEZ SBO SWYDZ TPJZQ TSXQS Z5O Z7R Z7U Z7W Z7X Z7Z Z81 Z83 Z84 Z85 Z87 Z88 -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-j398t-225eb3db3bc56cd2c90a432df7dfe6c6c826730ff50b387427e8abb02cc2f02e3 |
ISBN | 3319712489 9783319712482 |
ISSN | 0302-9743 |
IngestDate | Tue Jul 29 20:20:36 EDT 2025 Wed May 28 23:51:48 EDT 2025 Wed May 28 23:38:32 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
LCCallNum | QA76.9.D343Q334-342T |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-j398t-225eb3db3bc56cd2c90a432df7dfe6c6c826730ff50b387427e8abb02cc2f02e3 |
OCLC | 1018472168 |
OpenAccessLink | https://www.research.manchester.ac.uk/portal/en/publications/alternative-semantic-representations-for-zeroshot-human-action-recognition(6a30151c-69f8-4a90-928c-c575b36caddf).html |
PQID | EBC5577008_115_142 |
PageCount | 16 |
ParticipantIDs | springer_books_10_1007_978_3_319_71249_9_6 proquest_ebookcentralchapters_6303206_115_142 proquest_ebookcentralchapters_5577008_115_142 |
PublicationCentury | 2000 |
PublicationDate | 2017 |
PublicationDateYYYYMMDD | 2017-01-01 |
PublicationDate_xml | – year: 2017 text: 2017 |
PublicationDecade | 2010 |
PublicationPlace | Switzerland |
PublicationPlace_xml | – name: Switzerland – name: Cham |
PublicationSeriesSubtitle | Lecture Notes in Artificial Intelligence |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSeriesTitleAlternate | Lect.Notes Computer |
PublicationSubtitle | European Conference, ECML PKDD 2017, Skopje, Macedonia, September 18-22, 2017, Proceedings, Part I |
PublicationTitle | Machine Learning and Knowledge Discovery in Databases |
PublicationYear | 2017 |
Publisher | Springer International Publishing AG Springer International Publishing |
Publisher_xml | – name: Springer International Publishing AG – name: Springer International Publishing |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Naor, Moni Mitchell, John C. Terzopoulos, Demetri Steffen, Bernhard Pandu Rangan, C. Kanade, Takeo Kittler, Josef Weikum, Gerhard Hutchison, David Tygar, Doug |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni – sequence: 8 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. – sequence: 9 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard – sequence: 10 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri – sequence: 11 givenname: Doug surname: Tygar fullname: Tygar, Doug – sequence: 12 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard |
SSID | ssj0001930170 ssj0002792 |
Score | 2.3746626 |
Snippet | A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic... |
SourceID | springer proquest |
SourceType | Publisher |
StartPage | 87 |
SubjectTerms | Fisher Vector Human action recognition Image deep representation Semantic representation Textual description representation Zero-shot learning |
Title | Alternative Semantic Representations for Zero-Shot Human Action Recognition |
URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5577008&ppg=142 http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6303206&ppg=142 http://link.springer.com/10.1007/978-3-319-71249-9_6 |
Volume | 10534 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELa2ywVx4C3e8oETlVFiO3F84LAqRVW3VAJaqLhYseOoQrBbdVMO_HrGryQbkFC5RKvI2mTn887LM98g9FLnrKHMZITLvCK8MZZosMtEGp03wpGMefri98flwSk_PCvOZrOLUdXSVadfm19_7Sv5H1ThHuDqumSvgWz_pXADPgO-cAWE4TpxfrfTrHHCkCuDtIkhNbQaLlOKzNFqGlee6fv63tZd7exV70B_iWniD6PdsRcbNZZb-2jxPaYMf1pQLD8ACF96fzG0La08pcPuV3u5Jp_O1108GFiEIeQfU4VSxN8Jxm7eHMWzi-N150vCdtN4iaRtxumIXEzSESkdOUloDjm1rfiVgQIQ4GFU4xQnAx0NUU5Qezao5dKRLbJAbhpVbbTTwWjnvm37T3swLgFx7VruYZJIVe6gHVEVc3RjsX949HnIyknmCIV6W-7oFcM5VHgn1x2U3lkG_qbhN_SkVoG3ePLErRBmcurunZmTO-iWa3DBrvMEhHcXzezqHrqd5I-j_O-j5Qh6nKDHE-gxQI976LGHHgfo8Qj6B-j03f7J3gGJszfINyarjoCat5o1mmlTlKahRmY1Z7RpRdPa0pQGwlIwDm1bZJpVglNhq1rrjBpD24xa9hDNV-uVfYRwayVra3Dz87riYDM0bzV4-U3FtOANLx4jkgSjfIVALEs2QQwbVRRCuGGpELxApEr_ub4Et4xm5bD-VZK2css3KlF1A0qKKUBJeZQUoPTkOoufopvDX-AZmneXV_Y5-KidfhE31m9evosz |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Machine+Learning+and+Knowledge+Discovery+in+Databases&rft.au=Wang%2C+Qian&rft.au=Chen%2C+Ke&rft.atitle=Alternative+Semantic+Representations+for+Zero-Shot+Human+Action+Recognition&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2017-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319712482&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=87&rft.epage=102&rft_id=info:doi/10.1007%2F978-3-319-71249-9_6 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5577008-l.jpg http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6303206-l.jpg |