Person re-identification in video surveillance systems by feature replacement of occluded parts of human figures

Тhe algorithm for re-identifying people in intelligent video surveillance systems is proposed. It is based on the construction of a compound neural network descriptor and replacing features of occluded parts of a human figure. Composite descriptors are generated for all images in a gallery and recor...

Full description

Saved in:

Bibliographic Details
Published in	Pattern analysis and applications : PAA Vol. 28; no. 2
Main Authors	Ye, Shiping, Bohush, R., Ihnatsyeva, S., Ablameyko, S.
Format	Journal Article
Language	English
Published	London Springer London 01.06.2025 Springer Nature B.V
Subjects	Algorithms Computer Science Feature extraction Original Article Pattern Recognition Similarity Surveillance Surveillance systems Database for re-identification Image augmentation Convolution neural networks Compound descriptor
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Тhe algorithm for re-identifying people in intelligent video surveillance systems is proposed. It is based on the construction of a compound neural network descriptor and replacing features of occluded parts of a human figure. Composite descriptors are generated for all images in a gallery and recorded in a table. They characterize the global and local features of each person, considering his individual parts’ visibility. Detection and selection of areas of interest for the formation of local descriptors is carried out based on detecting key points of the human body. If a person is partially occluded by other people or objects, then the corresponding region is classified as invisible, and the compound descriptor of respective component will be invalid and equal to zero. For images whose feature vector has zero components, the feature table is ranked according to the cosine similarity metric for each visible local fragment. Based on the feature table rankings, the k-nearest neighbors are determined and the k1-best are selected from them. The corresponding k 1 -nearest neighbors’ component average value of the feature vector is used to replace the zero descriptor components. The feature table is then updated for the generated vectors and ranking is performed according to the query using the cosine similarity metric. ResNet-50 and DenseNet-121 were used as backbone CNNs for feature extraction, and testing was performed using Market-1501, DukeMTMC-ReID, Occluded-Duke, MSMT17, and PolReID1077 datasets.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1433-7541 1433-755X
DOI:	10.1007/s10044-025-01482-1