Sports Video Classification Method Based on Improved Deep Learning

Classifying sports videos is complex due to their dynamic nature. Traditional methods, like optical flow and the Histogram of Oriented Gradient (HOG), are limited by their need for expertise and lack of universality. Deep learning, particularly Convolutional Neural Networks (CNNs), offers more effec...

Full description

Saved in:

Bibliographic Details
Published in	Applied sciences Vol. 14; no. 2; p. 948
Main Authors	Gao, Tianhao, Zhang, Meng, Zhu, Yifan, Zhang, Youjian, Pang, Xiangsheng, Ying, Jing, Liu, Wenming
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.01.2024
Subjects	Accuracy Algorithms Classification Convolutional Neural Networks (CNN) Datasets Deep learning Evaluation Football Fourier transforms image processing Methods Neural networks Noise control sports video analysis Velocity
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Classifying sports videos is complex due to their dynamic nature. Traditional methods, like optical flow and the Histogram of Oriented Gradient (HOG), are limited by their need for expertise and lack of universality. Deep learning, particularly Convolutional Neural Networks (CNNs), offers more effective feature recognition in sports videos, but standard CNNs struggle with fast-paced or low-resolution sports videos. Our novel neural network model addresses these challenges. It begins by selecting important frames from sports footage and applying a fuzzy noise reduction algorithm to enhance video quality. The model then uses a bifurcated neural network to extract detailed features, leading to a densely connected neural network with a specific activation function for categorizing videos. We tested our model on a High-Definition Sports Video Dataset covering over 20 sports and a low-resolution dataset. Our model outperformed established classifiers like DenseNet, VggNet, Inception v3, and ResNet-50. It achieved high precision (0.9718), accuracy (0.9804), F-score (0.9761), and recall (0.9723) on the high-resolution dataset, and significantly better precision (0.8725) on the low-resolution dataset. Correspondingly, the highest values on the matrix of four traditional models are: precision (0.9690), accuracy (0.9781), F-score (0.9670), recall (0.9681) on the high-resolution dataset, and precision (0.8627) on the low-resolution dataset. This demonstrates our model’s superior performance in sports video classification under various conditions, including rapid motion and low resolution. It marks a significant step forward in sports data analytics and content categorization.
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app14020948