ASHN for Multi-Human Pose Estimation

Due to the diversity of human body posture, there are problems such as occlusion of key points, difference of target scale and background blur among people. Therefore, multi-human pose estimation is still a challenging task. The existing deep learning-based multi-body pose estimation methods are mai...

Full description

Saved in:

Bibliographic Details
Published in	2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT) pp. 1 - 6
Main Authors	Gao, Pan, Hu, Zhuhua
Format	Conference Proceeding
Language	English
Published	IEEE 09.12.2022
Subjects	Artificial intelligence attention-containing stacked Hourglass network convolutional block attention module Convolutional neural networks focal 12 loss Multi-person human pose estimation Pose estimation Task analysis
Online Access	Get full text
DOI	10.1109/ACAIT56212.2022.10137930

Cover

More Information
Summary:	Due to the diversity of human body posture, there are problems such as occlusion of key points, difference of target scale and background blur among people. Therefore, multi-human pose estimation is still a challenging task. The existing deep learning-based multi-body pose estimation methods are mainly divided into top-down and bottom-up, but most of them do not make full use of local features in the network. In this paper, convolutional block attention module(CBAM) and Focal L2 Loss were used to process the context information of convolutional neural network and consolidate local features. Specifically, we propose attention-containing stacked hourglass network (ASHN). ASHN is based on a stacked hourglass network, with the addition of a convolutional block attention module (CBAM) module to improve performance, combined with Focal L2 Loss in the model. Compared with the existing methods, our method achieves competitive performance, achieving 66.8% AP, 72.1% AP75 and 65.4% APM on COCO data sets.
DOI:	10.1109/ACAIT56212.2022.10137930