DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition

Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-en...

Full description

Saved in:

Bibliographic Details
Published in	2014 IEEE Conference on Computer Vision and Pattern Recognition pp. 2625 - 2632
Main Authors	Sun, Lin, Jia, Kui, Chan, Tsung-Han, Fang, Yuqiang, Wang, Gang, Yan, Shuicheng
Format	Conference Proceeding Journal Article
Language	English
Published	IEEE 01.06.2014
Subjects	Abstracts action recognition Computer vision Convolution deep learning Feature extraction Feature recognition Invariants Kernel Learning Pattern recognition Recognition Representations Sampling slow feature analysis Three dimensional Three-dimensional displays Video sequences Visualization
Online Access	Get full text

Cover

Loading…

Abstract	Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-engineered features for all datasets, and learning features directly from the data may be more advantageous. One such endeavor is Slow Feature Analysis (SFA) proposed by Wiskott and Sejnowski [33]. SFA can learn the invariant and slowly varying features from input signals and has been proved to be valuable in human action recognition [34]. It is also observed that the multi-layer feature representation has succeeded remarkably in widespread machine learning applications. In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself. Specifically, we use a two-layered SFA learning structure with 3D convolution and max pooling operations to scale up the method to large inputs and capture abstract and structural features from the video. Thus, the proposed method is suitable for action recognition. At the same time, sharing the same merits of deep learning, the proposed method is generic and fully automated. Our classification results on Hollywood2, KTH and UCF Sports are competitive with previously published results. To highlight some, on the KTH dataset, our recognition rate shows approximately 1% improvement in comparison to state-of-the-art methods even without supervision or dense sampling.
AbstractList	Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are implemented sophisticatedly and difficult to be extended to other sensor modalities. Recent studies discover that there are no universally best hand-engineered features for all datasets, and learning features directly from the data may be more advantageous. One such endeavor is Slow Feature Analysis (SFA) proposed by Wiskott and Sejnowski [33]. SFA can learn the invariant and slowly varying features from input signals and has been proved to be valuable in human action recognition [34]. It is also observed that the multi-layer feature representation has succeeded remarkably in widespread machine learning applications. In this paper, we propose to combine SFA with deep learning techniques to learn hierarchical representations from the video data itself. Specifically, we use a two-layered SFA learning structure with 3D convolution and max pooling operations to scale up the method to large inputs and capture abstract and structural features from the video. Thus, the proposed method is suitable for action recognition. At the same time, sharing the same merits of deep learning, the proposed method is generic and fully automated. Our classification results on Hollywood2, KTH and UCF Sports are competitive with previously published results. To highlight some, on the KTH dataset, our recognition rate shows approximately 1% improvement in comparison to state-of-the-art methods even without supervision or dense sampling.
Author	Kui Jia Lin Sun Gang Wang Shuicheng Yan Yuqiang Fang Tsung-Han Chan
Author_xml	– sequence: 1 givenname: Lin surname: Sun fullname: Sun, Lin – sequence: 2 givenname: Kui surname: Jia fullname: Jia, Kui – sequence: 3 givenname: Tsung-Han surname: Chan fullname: Chan, Tsung-Han – sequence: 4 givenname: Yuqiang surname: Fang fullname: Fang, Yuqiang – sequence: 5 givenname: Gang surname: Wang fullname: Wang, Gang – sequence: 6 givenname: Shuicheng surname: Yan fullname: Yan, Shuicheng
BookMark	eNpNjL9Lw0AYQE-pYK0dnVxudEm9n9_1cwupVSGgtOoaLslXOUiTmkuR_vcqdXB6b3i8CzZqu5YYu5JiJqXA2-z9ZTVTQpqZ1nDCpujm0jhEK-XcnrKxFKATQImjf37OpjGGUihwYKyGMcsWebJepnd8QbRrDklOvm-p5uum--JL8sO-J562vjnEEPmm63laDaFr-Yqq7qMNv37Jzja-iTT944S9Le9fs8ckf354ytI8CcqZIfHgUciqJucqxNqVGpVWgF6hsWVZoRe2JCANyggH4GoDVhFqj7K0FvWE3Ry_u7773FMcim2IFTWNb6nbx0KCcyiNnMuf9PqYBiIqdn3Y-v5QAAp0WulvC5JaIg
CODEN	IEEPAD
ContentType	Conference Proceeding Journal Article
DBID	6IE 6IH CBEJK RIE RIO 7SC 7SP 8FD JQ2 L7M L~C L~D
DOI	10.1109/CVPR.2014.336
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional
DatabaseTitle	Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional
DatabaseTitleList	Technology Research Database
Database_xml	– sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Applied Sciences Computer Science
EISBN	9781479951185 1479951188
EISSN	1063-6919 2575-7075
EndPage	2632
ExternalDocumentID	6909732
Genre	orig-research
GroupedDBID	23M 29F 29O 6IE 6IH 6IK ABDPE ACGFS ALMA_UNASSIGNED_HOLDINGS CBEJK IPLJI M43 RIE RIO RNS 7SC 7SP 8FD JQ2 L7M L~C L~D
ID	FETCH-LOGICAL-i274t-a6a901cde77c99d7b3923269a2945bbc9a05be6e362407667d4652e93a91b5593
IEDL.DBID	RIE
ISSN	1063-6919
IngestDate	Fri Jul 11 06:27:42 EDT 2025 Wed Aug 27 04:30:17 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i274t-a6a901cde77c99d7b3923269a2945bbc9a05be6e362407667d4652e93a91b5593
Notes	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2
PQID	1677914181
PQPubID	23500
PageCount	8
ParticipantIDs	ieee_primary_6909732 proquest_miscellaneous_1677914181
PublicationCentury	2000
PublicationDate	20140601
PublicationDateYYYYMMDD	2014-06-01
PublicationDate_xml	– month: 06 year: 2014 text: 20140601 day: 01
PublicationDecade	2010
PublicationTitle	2014 IEEE Conference on Computer Vision and Pattern Recognition
PublicationTitleAbbrev	CVPR
PublicationYear	2014
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssib026764536 ssj0023720 ssj0003211698
Score	2.2955396
Snippet	Most of the previous work on video action recognition use complex hand-designed local features, such as SIFT, HOG and SURF, but these approaches are...
SourceID	proquest ieee
SourceType	Aggregation Database Publisher
StartPage	2625
SubjectTerms	Abstracts action recognition Computer vision Convolution deep learning Feature extraction Feature recognition Invariants Kernel Learning Pattern recognition Recognition Representations Sampling slow feature analysis Three dimensional Three-dimensional displays Video sequences Visualization
Title	DL-SFA: Deeply-Learned Slow Feature Analysis for Action Recognition
URI	https://ieeexplore.ieee.org/document/6909732 https://www.proquest.com/docview/1677914181
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwEA7bnnzy18TfRPDRdLZNk8W3sTlEVMZ04ttIkxuIox26IvrXe0nbCeqDbyUQaJNrvu9yd98RcuoUw2VoE4bgqRm3WjKNyMrAzoxBeEaMclcDt3fiasKvn5KnBjlb1cIAgE8-g8A9-li-zU3hrso66Mk5cZkmaaLjVtZq1bYTCSl4Uvbu9qdwjJ6NUKuIQuS6sfjIp4iZUKH61tvs9B9HY5fkxYPYKTX7Liu_jmaPN8N1clu_aZlm8hIUyzQwnz9EHP_7KRuk_V3ZR0crzNokDci2yHpFRWn1o7_hUN3toR7bJv3BDbsf9i7oAGAx_2BemdVNmufv1DHJ4hVorXFCkQvTnq-ZoOM6RynP2mQyvHzoX7GqBQN7Rnd1ybTQSBiMBSmNUlamSKeQ8CkdKZ6kqVH6PElBAMIgeoZCSMtFEoGKtQpTdFbiHdLK8gx2CY0kRErMtOxaxY3rtK4iG3U18j8Tc2v2yLZbpumiVNmYViu0R07qjZii5btwhs4gL96moZBShRwpyv7fUw_ImtvVMrHrkLSWrwUcIYVYpsfedr4AkMO-9Q
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEB58HPTkG99G8GhW26aJ8Sary6q7Ir7wVtJkBFFa0S2iv95Jul1BPXgrgUCbTPN9k5n5BmDHK4aryKWcwNNw4YzihpCVo3uwluCZMMpfDfQvZPdWnN2n92OwO6qFQcSQfIYt_xhi-a60lb8q2yNPzovLjMMk4X4a1dVajfXEUkmR1t27wzmckG8j9SimEPt-LCH2KRMudaS_FTf32neXVz7NS7QSr9Uc-qz8OpwD4nRmoN-8a51o8tSqBnnLfv6Qcfzvx8zC4ndtH7scodYcjGExDzNDMsqGv_obDTX9HpqxBWgf9_h15-iQHSO-PH_woM3qJz2X78xzyeoVWaNywogNs6NQNcGumiylsliE287JTbvLh00Y-CM5rANupCHKYB0qZbV2KidCRZRPm1iLNM-tNvtpjhIJCMk3lFI5IdMYdWJ0lJO7kizBRFEWuAwsVhhr-WDUgdPC-l7rOnbxgSEGaBPh7Aos-GXKXmqdjWy4Qiuw3WxERrbvAxqmwLJ6yyKplI4EkZTVv6duwVT3pt_LeqcX52sw7Xe4TvNah4nBa4UbRCgG-Wawoy-5rcI-
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2014+IEEE+Conference+on+Computer+Vision+and+Pattern+Recognition&rft.atitle=DL-SFA%3A+Deeply-Learned+Slow+Feature+Analysis+for+Action+Recognition&rft.au=Lin+Sun&rft.au=Kui+Jia&rft.au=Tsung-Han+Chan&rft.au=Yuqiang+Fang&rft.date=2014-06-01&rft.pub=IEEE&rft.issn=1063-6919&rft.eissn=1063-6919&rft.spage=2625&rft.epage=2632&rft_id=info:doi/10.1109%2FCVPR.2014.336&rft.externalDocID=6909732
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6919&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6919&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6919&client=summon