Cross-attention-based hybrid ViT-CNN fusion network for action recognition in visible and infrared videos
Human action recognition (HAR) in videos is a critical task in computer vision, but traditional methods relying solely on visible (RGB) data face challenges in low-light or occluded scenarios. Infrared (IR) imagery offers robustness in such conditions, yet effectively fusing IR and visible modalitie...
Saved in:
Published in | Pattern analysis and applications : PAA Vol. 28; no. 3 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Heidelberg
Springer Nature B.V
01.09.2025
|
Subjects | |
Online Access | Get full text |
ISSN | 1433-7541 1433-755X |
DOI | 10.1007/s10044-025-01493-y |
Cover
Loading…