DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition
Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-en...
Saved in:
Published in | 2014 IEEE Conference on Computer Vision and Pattern Recognition pp. 2625 - 2632 |
---|---|
Main Authors | , , , , , |
Format | Conference Proceeding Journal Article |
Language | English |
Published |
IEEE
01.06.2014
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-engineered features for all datasets, and learning features directly from the data may be more advantageous. One such endeavor is Slow Feature Analysis (SFA) proposed by Wiskott and Sejnowski [33]. SFA can learn the invariant and slowly varying features from input signals and has been proved to be valuable in human action recognition [34]. It is also observed that the multi-layer feature representation has succeeded remarkably in widespread machine learning applications. In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself. Specifically, we use a two-layered SFA learning structure with 3D convolution and max pooling operations to scale up the method to large inputs and capture abstract and structural features from the video. Thus, the proposed method is suitable for action recognition. At the same time, sharing the same merits of deep learning, the proposed method is generic and fully automated. Our classification results on Hollywood2, KTH and UCF Sports are competitive with previously published results. To highlight some, on the KTH dataset, our recognition rate shows approximately 1% improvement in comparison to state-of-the-art methods even without supervision or dense sampling. |
---|---|
AbstractList | Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-engineered features for all datasets, and learning features directly from the data may be more advantageous. One such endeavor is Slow Feature Analysis (SFA) proposed by Wiskott and Sejnowski [33]. SFA can learn the invariant and slowly varying features from input signals and has been proved to be valuable in human action recognition [34]. It is also observed that the multi-layer feature representation has succeeded remarkably in widespread machine learning applications. In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself. Specifically, we use a two-layered SFA learning structure with 3D convolution and max pooling operations to scale up the method to large inputs and capture abstract and structural features from the video. Thus, the proposed method is suitable for action recognition. At the same time, sharing the same merits of deep learning, the proposed method is generic and fully automated. Our classification results on Hollywood2, KTH and UCF Sports are competitive with previously published results. To highlight some, on the KTH dataset, our recognition rate shows approximately 1% improvement in comparison to state-of-the-art methods even without supervision or dense sampling. |
Author | Kui Jia Lin Sun Gang Wang Shuicheng Yan Yuqiang Fang Tsung-Han Chan |
Author_xml | – sequence: 1 givenname: Lin surname: Sun fullname: Sun, Lin – sequence: 2 givenname: Kui surname: Jia fullname: Jia, Kui – sequence: 3 givenname: Tsung-Han surname: Chan fullname: Chan, Tsung-Han – sequence: 4 givenname: Yuqiang surname: Fang fullname: Fang, Yuqiang – sequence: 5 givenname: Gang surname: Wang fullname: Wang, Gang – sequence: 6 givenname: Shuicheng surname: Yan fullname: Yan, Shuicheng |
BookMark | eNpNjL9Lw0AYQE-pYK0dnVxudEm9n9_1cwupVSGgtOoaLslXOUiTmkuR_vcqdXB6b3i8CzZqu5YYu5JiJqXA2-z9ZTVTQpqZ1nDCpujm0jhEK-XcnrKxFKATQImjf37OpjGGUihwYKyGMcsWebJepnd8QbRrDklOvm-p5uum--JL8sO-J562vjnEEPmm63laDaFr-Yqq7qMNv37Jzja-iTT944S9Le9fs8ckf354ytI8CcqZIfHgUciqJucqxNqVGpVWgF6hsWVZoRe2JCANyggH4GoDVhFqj7K0FvWE3Ry_u7773FMcim2IFTWNb6nbx0KCcyiNnMuf9PqYBiIqdn3Y-v5QAAp0WulvC5JaIg |
CODEN | IEEPAD |
ContentType | Conference Proceeding Journal Article |
DBID | 6IE 6IH CBEJK RIE RIO 7SC 7SP 8FD JQ2 L7M L~C L~D |
DOI | 10.1109/CVPR.2014.336 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences Computer Science |
EISBN | 9781479951185 1479951188 |
EISSN | 1063-6919 2575-7075 |
EndPage | 2632 |
ExternalDocumentID | 6909732 |
Genre | orig-research |
GroupedDBID | 23M 29F 29O 6IE 6IH 6IK ABDPE ACGFS ALMA_UNASSIGNED_HOLDINGS CBEJK IPLJI M43 RIE RIO RNS 7SC 7SP 8FD JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-i274t-a6a901cde77c99d7b3923269a2945bbc9a05be6e362407667d4652e93a91b5593 |
IEDL.DBID | RIE |
ISSN | 1063-6919 |
IngestDate | Fri Jul 11 06:27:42 EDT 2025 Wed Aug 27 04:30:17 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i274t-a6a901cde77c99d7b3923269a2945bbc9a05be6e362407667d4652e93a91b5593 |
Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2 |
PQID | 1677914181 |
PQPubID | 23500 |
PageCount | 8 |
ParticipantIDs | ieee_primary_6909732 proquest_miscellaneous_1677914181 |
PublicationCentury | 2000 |
PublicationDate | 20140601 |
PublicationDateYYYYMMDD | 2014-06-01 |
PublicationDate_xml | – month: 06 year: 2014 text: 20140601 day: 01 |
PublicationDecade | 2010 |
PublicationTitle | 2014 IEEE Conference on Computer Vision and Pattern Recognition |
PublicationTitleAbbrev | CVPR |
PublicationYear | 2014 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssib026764536 ssj0023720 ssj0003211698 |
Score | 2.2955396 |
Snippet | Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are... |
SourceID | proquest ieee |
SourceType | Aggregation Database Publisher |
StartPage | 2625 |
SubjectTerms | Abstracts action recognition Computer vision Convolution deep learning Feature extraction Feature recognition Invariants Kernel Learning Pattern recognition Recognition Representations Sampling slow feature analysis Three dimensional Three-dimensional displays Video sequences Visualization |
Title | DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition |
URI | https://ieeexplore.ieee.org/document/6909732 https://www.proquest.com/docview/1677914181 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA7bnnzy18TfRPDRdLZNk8W3sTlEVMZ04ttIkxuIox26IvrXe0nbCeqDbyUQaJNrvu9yd98RcuoUw2VoE4bgqRm3WjKNyMrAzoxBeEaMclcDt3fiasKvn5KnBjlb1cIAgE8-g8A9-li-zU3hrso66Mk5cZkmaaLjVtZq1bYTCSl4Uvbu9qdwjJ6NUKuIQuS6sfjIp4iZUKH61tvs9B9HY5fkxYPYKTX7Liu_jmaPN8N1clu_aZlm8hIUyzQwnz9EHP_7KRuk_V3ZR0crzNokDci2yHpFRWn1o7_hUN3toR7bJv3BDbsf9i7oAGAx_2BemdVNmufv1DHJ4hVorXFCkQvTnq-ZoOM6RynP2mQyvHzoX7GqBQN7Rnd1ybTQSBiMBSmNUlamSKeQ8CkdKZ6kqVH6PElBAMIgeoZCSMtFEoGKtQpTdFbiHdLK8gx2CY0kRErMtOxaxY3rtK4iG3U18j8Tc2v2yLZbpumiVNmYViu0R07qjZii5btwhs4gL96moZBShRwpyv7fUw_ImtvVMrHrkLSWrwUcIYVYpsfedr4AkMO-9Q |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEB58HPTkG99G8GhW26aJ8Sary6q7Ir7wVtJkBFFa0S2iv95Jul1BPXgrgUCbTPN9k5n5BmDHK4aryKWcwNNw4YzihpCVo3uwluCZMMpfDfQvZPdWnN2n92OwO6qFQcSQfIYt_xhi-a60lb8q2yNPzovLjMMk4X4a1dVajfXEUkmR1t27wzmckG8j9SimEPt-LCH2KRMudaS_FTf32neXVz7NS7QSr9Uc-qz8OpwD4nRmoN-8a51o8tSqBnnLfv6Qcfzvx8zC4ndtH7scodYcjGExDzNDMsqGv_obDTX9HpqxBWgf9_h15-iQHSO-PH_woM3qJz2X78xzyeoVWaNywogNs6NQNcGumiylsliE287JTbvLh00Y-CM5rANupCHKYB0qZbV2KidCRZRPm1iLNM-tNvtpjhIJCMk3lFI5IdMYdWJ0lJO7kizBRFEWuAwsVhhr-WDUgdPC-l7rOnbxgSEGaBPh7Aos-GXKXmqdjWy4Qiuw3WxERrbvAxqmwLJ6yyKplI4EkZTVv6duwVT3pt_LeqcX52sw7Xe4TvNah4nBa4UbRCgG-Wawoy-5rcI- |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=DL-SFA%3A+Deeply-Learned+Slow+Feature+Analysis+for+Action+Recognition&rft.au=Lin+Sun&rft.au=Kui+Jia&rft.au=Tsung-Han+Chan&rft.au=Yuqiang+Fang&rft.date=2014-06-01&rft.pub=IEEE&rft.issn=1063-6919&rft.eissn=1063-6919&rft.spage=2625&rft.epage=2632&rft_id=info:doi/10.1109%2FCVPR.2014.336&rft.externalDocID=6909732 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon |