A hybrid unsupervised-weakly supervised method for video anomaly detection

Video anomaly detection has recently become a critical field of research because of its potential applications in autonomous surveillance systems. We propose Hybrid-ConvATFM, a novel method that uniquely operates in both unsupervised and weakly supervised settings, making several key contributions....

Full description

Saved in:
Bibliographic Details
Published inPattern analysis and applications : PAA Vol. 28; no. 3
Main Authors Rakhmonov, Akhrorjon Akhmadjon Ugli, Varnousefaderani, Bahar Amirian, Kim, Jeonghong
Format Journal Article
LanguageEnglish
Published London Springer London 01.09.2025
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Video anomaly detection has recently become a critical field of research because of its potential applications in autonomous surveillance systems. We propose Hybrid-ConvATFM, a novel method that uniquely operates in both unsupervised and weakly supervised settings, making several key contributions. First, it alleviates the human annotation problem by employing a GMM-based divisive clustering algorithm to generate video-level pseudo-labels, enabling fully unsupervised operation when frame-level or video-level annotations are unavailable. Second, Hybrid-ConvATFM excels at capturing the complex temporal dynamics of anomalous events through its innovative architecture that combines convolutional autoencoders with attention modules, extracting enhanced local and global temporal dependencies of video segments. Third, our approach incorporates a temporal feature magnitude learning technique that effectively identifies and classifies abnormal segments with high precision. The versatility and effectiveness of Hybrid-ConvATFM is demonstrated through extensive experimentation on benchmark datasets. In the weakly supervised setting, our method achieves area under the curve scores of 97.70% on ShanghaiTech, 84.91% on UCF-Crime, and 83.51% on XD-Violence. In the fully unsupervised setting, Hybrid-ConvATFM attains 89.50% on ShanghaiTech, 79.61% on UCF-Crime, and 78.43% on XD-Violence, establishing new performance benchmarks in video anomaly detection across multiple operational scenarios.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1433-7541
1433-755X
DOI:10.1007/s10044-025-01515-9