A hybrid unsupervised-weakly supervised method for video anomaly detection

Video anomaly detection has recently become a critical field of research because of its potential applications in autonomous surveillance systems. We propose Hybrid-ConvATFM, a novel method that uniquely operates in both unsupervised and weakly supervised settings, making several key contributions....

Full description

Saved in:

Bibliographic Details
Published in	Pattern analysis and applications : PAA Vol. 28; no. 3
Main Authors	Rakhmonov, Akhrorjon Akhmadjon Ugli, Varnousefaderani, Bahar Amirian, Kim, Jeonghong
Format	Journal Article
Language	English
Published	London Springer London 01.09.2025 Springer Nature B.V
Subjects	Annotations Anomalies Benchmarks Clustering Computer Science Crime Critical field (superconductivity) Original Article Pattern Recognition Segments Surveillance systems Violence Video anomaly detection Autoencoder Pseudo-label Weakly supervised learning Unsupervised learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Video anomaly detection has recently become a critical field of research because of its potential applications in autonomous surveillance systems. We propose Hybrid-ConvATFM, a novel method that uniquely operates in both unsupervised and weakly supervised settings, making several key contributions. First, it alleviates the human annotation problem by employing a GMM-based divisive clustering algorithm to generate video-level pseudo-labels, enabling fully unsupervised operation when frame-level or video-level annotations are unavailable. Second, Hybrid-ConvATFM excels at capturing the complex temporal dynamics of anomalous events through its innovative architecture that combines convolutional autoencoders with attention modules, extracting enhanced local and global temporal dependencies of video segments. Third, our approach incorporates a temporal feature magnitude learning technique that effectively identifies and classifies abnormal segments with high precision. The versatility and effectiveness of Hybrid-ConvATFM is demonstrated through extensive experimentation on benchmark datasets. In the weakly supervised setting, our method achieves area under the curve scores of 97.70% on ShanghaiTech, 84.91% on UCF-Crime, and 83.51% on XD-Violence. In the fully unsupervised setting, Hybrid-ConvATFM attains 89.50% on ShanghaiTech, 79.61% on UCF-Crime, and 78.43% on XD-Violence, establishing new performance benchmarks in video anomaly detection across multiple operational scenarios.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1433-7541 1433-755X
DOI:	10.1007/s10044-025-01515-9