MultiSubjects: A multi-subject video dataset for single-person basketball action recognition from basketball gym

Computer vision technology is becoming a research focus in the field of basketball. Despite the abundance of datasets centered on basketball games, there remains a significant gap in the availability of a large-scale, multi-subject, and fine-grained dataset for the recognition of basketball actions...

Full description

Saved in:

Bibliographic Details
Published in	Computer vision and image understanding Vol. 249; p. 104193
Main Authors	Han, Zhijie, Qin, Wansong, Wang, Yalu, Wang, Qixiang, Shi, Yongbin
Format	Journal Article
Language	English
Published	Elsevier Inc 01.12.2024
Subjects	Basketball Civilian basketball gym Dataset MultiSubjects Skeleton-based action recognition Video action recognition Skeleton-based action recognition Civilian basketball gym Dataset 41A10 Video action recognition 65D05 MultiSubjects 65D17 Basketball 41A05
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Computer vision technology is becoming a research focus in the field of basketball. Despite the abundance of datasets centered on basketball games, there remains a significant gap in the availability of a large-scale, multi-subject, and fine-grained dataset for the recognition of basketball actions in real-world sports scenarios, particularly for amateur players. Such datasets are crucial for advancing the application of computer vision tasks in the real world. To address this gap, we deployed multi-view cameras in a civilian basketball gym, constructed a real basketball data acquisition platform, and acquired a challenging multi-subject video dataset, named MultiSubjects. The MultiSubjects v1.0 dataset features a variety of ages, body types, attire, genders, and basketball actions, providing researchers with a high-quality and diverse resource of basketball action data. We collected a total of 1,000 distinct subjects from video data between September and December 2023, classified and labeled three basic basketball actions, and assigned a unique identity ID to each subject, provided a total of 6,144 video clips, 436,460 frames, and labeled 6,144 instances of actions with clear temporal boundaries using 436,460 human body bounding boxes. Additionally, complete frame-wise skeleton keypoint coordinates for the entire action are provided. We used some representative video action recognition algorithms as well as skeleton-based action recognition algorithms on the MultiSubjects v1.0 dataset and analyzed the results. The results confirm that the quality of our dataset surpasses that of popular video action recognition datasets, it also presents that skeleton-based action recognition remains a challenging task. The link to our dataset is: https://huggingface.co/datasets/Henu-Software/Henu-MultiSubjects. •We introduce a novel dataset named MultiSubjects for amateur basketball action recognition.•We collected 6,144 action video clips from 1,000 different subjects in a real civilian basketball gym.•We provide action labels, identity IDs for 1,000 subjects, 25fps bounding boxes, and skeleton keypoints.•We demonstrate MultiSubjects’ applicability in video action recognition and its challenges for skeleton-based methods.
ISSN:	1077-3142
DOI:	10.1016/j.cviu.2024.104193