DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition

Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-en...

Full description

Saved in:
Bibliographic Details
Published in2014 IEEE Conference on Computer Vision and Pattern Recognition pp. 2625 - 2632
Main Authors Sun, Lin, Jia, Kui, Chan, Tsung-Han, Fang, Yuqiang, Wang, Gang, Yan, Shuicheng
Format Conference Proceeding Journal Article
LanguageEnglish
Published IEEE 01.06.2014
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-engineered features for all datasets, and learning features directly from the data may be more advantageous. One such endeavor is Slow Feature Analysis (SFA) proposed by Wiskott and Sejnowski [33]. SFA can learn the invariant and slowly varying features from input signals and has been proved to be valuable in human action recognition [34]. It is also observed that the multi-layer feature representation has succeeded remarkably in widespread machine learning applications. In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself. Specifically, we use a two-layered SFA learning structure with 3D convolution and max pooling operations to scale up the method to large inputs and capture abstract and structural features from the video. Thus, the proposed method is suitable for action recognition. At the same time, sharing the same merits of deep learning, the proposed method is generic and fully automated. Our classification results on Hollywood2, KTH and UCF Sports are competitive with previously published results. To highlight some, on the KTH dataset, our recognition rate shows approximately 1% improvement in comparison to state-of-the-art methods even without supervision or dense sampling.
AbstractList Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-engineered features for all datasets, and learning features directly from the data may be more advantageous. One such endeavor is Slow Feature Analysis (SFA) proposed by Wiskott and Sejnowski [33]. SFA can learn the invariant and slowly varying features from input signals and has been proved to be valuable in human action recognition [34]. It is also observed that the multi-layer feature representation has succeeded remarkably in widespread machine learning applications. In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself. Specifically, we use a two-layered SFA learning structure with 3D convolution and max pooling operations to scale up the method to large inputs and capture abstract and structural features from the video. Thus, the proposed method is suitable for action recognition. At the same time, sharing the same merits of deep learning, the proposed method is generic and fully automated. Our classification results on Hollywood2, KTH and UCF Sports are competitive with previously published results. To highlight some, on the KTH dataset, our recognition rate shows approximately 1% improvement in comparison to state-of-the-art methods even without supervision or dense sampling.
Author Kui Jia
Lin Sun
Gang Wang
Shuicheng Yan
Yuqiang Fang
Tsung-Han Chan
Author_xml – sequence: 1
  givenname: Lin
  surname: Sun
  fullname: Sun, Lin
– sequence: 2
  givenname: Kui
  surname: Jia
  fullname: Jia, Kui
– sequence: 3
  givenname: Tsung-Han
  surname: Chan
  fullname: Chan, Tsung-Han
– sequence: 4
  givenname: Yuqiang
  surname: Fang
  fullname: Fang, Yuqiang
– sequence: 5
  givenname: Gang
  surname: Wang
  fullname: Wang, Gang
– sequence: 6
  givenname: Shuicheng
  surname: Yan
  fullname: Yan, Shuicheng
BookMark eNpNjL9Lw0AYQE-pYK0dnVxudEm9n9_1cwupVSGgtOoaLslXOUiTmkuR_vcqdXB6b3i8CzZqu5YYu5JiJqXA2-z9ZTVTQpqZ1nDCpujm0jhEK-XcnrKxFKATQImjf37OpjGGUihwYKyGMcsWebJepnd8QbRrDklOvm-p5uum--JL8sO-J562vjnEEPmm63laDaFr-Yqq7qMNv37Jzja-iTT944S9Le9fs8ckf354ytI8CcqZIfHgUciqJucqxNqVGpVWgF6hsWVZoRe2JCANyggH4GoDVhFqj7K0FvWE3Ry_u7773FMcim2IFTWNb6nbx0KCcyiNnMuf9PqYBiIqdn3Y-v5QAAp0WulvC5JaIg
CODEN IEEPAD
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IH
CBEJK
RIE
RIO
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/CVPR.2014.336
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
Computer Science
EISBN 9781479951185
1479951188
EISSN 1063-6919
2575-7075
EndPage 2632
ExternalDocumentID 6909732
Genre orig-research
GroupedDBID 23M
29F
29O
6IE
6IH
6IK
ABDPE
ACGFS
ALMA_UNASSIGNED_HOLDINGS
CBEJK
IPLJI
M43
RIE
RIO
RNS
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-i274t-a6a901cde77c99d7b3923269a2945bbc9a05be6e362407667d4652e93a91b5593
IEDL.DBID RIE
ISSN 1063-6919
IngestDate Fri Jul 11 06:27:42 EDT 2025
Wed Aug 27 04:30:17 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i274t-a6a901cde77c99d7b3923269a2945bbc9a05be6e362407667d4652e93a91b5593
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Conference-1
ObjectType-Feature-3
content type line 23
SourceType-Conference Papers & Proceedings-2
PQID 1677914181
PQPubID 23500
PageCount 8
ParticipantIDs ieee_primary_6909732
proquest_miscellaneous_1677914181
PublicationCentury 2000
PublicationDate 20140601
PublicationDateYYYYMMDD 2014-06-01
PublicationDate_xml – month: 06
  year: 2014
  text: 20140601
  day: 01
PublicationDecade 2010
PublicationTitle 2014 IEEE Conference on Computer Vision and Pattern Recognition
PublicationTitleAbbrev CVPR
PublicationYear 2014
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssib026764536
ssj0023720
ssj0003211698
Score 2.2955396
Snippet Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are...
SourceID proquest
ieee
SourceType Aggregation Database
Publisher
StartPage 2625
SubjectTerms Abstracts
action recognition
Computer vision
Convolution
deep learning
Feature extraction
Feature recognition
Invariants
Kernel
Learning
Pattern recognition
Recognition
Representations
Sampling
slow feature analysis
Three dimensional
Three-dimensional displays
Video sequences
Visualization
Title DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition
URI https://ieeexplore.ieee.org/document/6909732
https://www.proquest.com/docview/1677914181
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA7bnnzy18TfRPDRdLZNk8W3sTlEVMZ04ttIkxuIox26IvrXe0nbCeqDbyUQaJNrvu9yd98RcuoUw2VoE4bgqRm3WjKNyMrAzoxBeEaMclcDt3fiasKvn5KnBjlb1cIAgE8-g8A9-li-zU3hrso66Mk5cZkmaaLjVtZq1bYTCSl4Uvbu9qdwjJ6NUKuIQuS6sfjIp4iZUKH61tvs9B9HY5fkxYPYKTX7Liu_jmaPN8N1clu_aZlm8hIUyzQwnz9EHP_7KRuk_V3ZR0crzNokDci2yHpFRWn1o7_hUN3toR7bJv3BDbsf9i7oAGAx_2BemdVNmufv1DHJ4hVorXFCkQvTnq-ZoOM6RynP2mQyvHzoX7GqBQN7Rnd1ybTQSBiMBSmNUlamSKeQ8CkdKZ6kqVH6PElBAMIgeoZCSMtFEoGKtQpTdFbiHdLK8gx2CY0kRErMtOxaxY3rtK4iG3U18j8Tc2v2yLZbpumiVNmYViu0R07qjZii5btwhs4gL96moZBShRwpyv7fUw_ImtvVMrHrkLSWrwUcIYVYpsfedr4AkMO-9Q
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEB58HPTkG99G8GhW26aJ8Sary6q7Ir7wVtJkBFFa0S2iv95Jul1BPXgrgUCbTPN9k5n5BmDHK4aryKWcwNNw4YzihpCVo3uwluCZMMpfDfQvZPdWnN2n92OwO6qFQcSQfIYt_xhi-a60lb8q2yNPzovLjMMk4X4a1dVajfXEUkmR1t27wzmckG8j9SimEPt-LCH2KRMudaS_FTf32neXVz7NS7QSr9Uc-qz8OpwD4nRmoN-8a51o8tSqBnnLfv6Qcfzvx8zC4ndtH7scodYcjGExDzNDMsqGv_obDTX9HpqxBWgf9_h15-iQHSO-PH_woM3qJz2X78xzyeoVWaNywogNs6NQNcGumiylsliE287JTbvLh00Y-CM5rANupCHKYB0qZbV2KidCRZRPm1iLNM-tNvtpjhIJCMk3lFI5IdMYdWJ0lJO7kizBRFEWuAwsVhhr-WDUgdPC-l7rOnbxgSEGaBPh7Aos-GXKXmqdjWy4Qiuw3WxERrbvAxqmwLJ6yyKplI4EkZTVv6duwVT3pt_LeqcX52sw7Xe4TvNah4nBa4UbRCgG-Wawoy-5rcI-
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=DL-SFA%3A+Deeply-Learned+Slow+Feature+Analysis+for+Action+Recognition&rft.au=Lin+Sun&rft.au=Kui+Jia&rft.au=Tsung-Han+Chan&rft.au=Yuqiang+Fang&rft.date=2014-06-01&rft.pub=IEEE&rft.issn=1063-6919&rft.eissn=1063-6919&rft.spage=2625&rft.epage=2632&rft_id=info:doi/10.1109%2FCVPR.2014.336&rft.externalDocID=6909732
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon