Detecting video anomalies by jointly utilizing appearance and skeleton information

A Video anomaly detection detects abnormal content that does not appear in the training set, and unsupervised methods such as generative methods are applied because the training set only offers normal content. Recently, skeleton-based features have been leveraged to alleviate the background distract...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 246; p. 123135
Main Authors Pang, Wenfeng, He, Qianhua, Li, Yanxiong, Ahmed, Noman
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 15.07.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A Video anomaly detection detects abnormal content that does not appear in the training set, and unsupervised methods such as generative methods are applied because the training set only offers normal content. Recently, skeleton-based features have been leveraged to alleviate the background distractions meanwhile outstanding human motion. However, we regard the appearance and skeleton information as complementary, and propose a generative method consisting of an appearance branch implemented by a 3D U-Net and a skeleton branch implemented by a novel Skeleton-Transformer. Moreover, a Multi-head Co-attention-based fusion module is proposed to fuse the intermediate features extracted from the appearance and skeleton branches, and then transfer the fusion information to each branch. This fusion module addresses the challenge of maintaining the feature structure of each branch during the fusion process, which is essential in the generative model. Experimental results show that the fusion module improves the performances of the appearance branch and skeleton branch, and their combination achieves state-of-the-art performance on HR-ShanghaiTech and Corridor datasets. •Fusing appearance and skeleton information is helpful in anomaly detection.•Multi-head co-attention module fuses multimodal features with structures unchanged.•Different fusion structures affect the model performance.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2023.123135