Diffusion Model-Based Pedestrian De-Occlusion Method Using a Monocular Camera Sensor for Indoor Visual Localization

In monocular camera-based visual localization methods, query images are compared with reference images in the database using an image retrieval algorithm. Once the best matching image is obtained, the pose of online users can be estimated using algorithms such as epipolar geometry or EpnP. However,...

Full description

Saved in:

Bibliographic Details
Published in	IEEE sensors journal Vol. 23; no. 22; pp. 28421 - 28429
Main Authors	Yang, Songxiang, Ma, Lin
Format	Journal Article
Language	English
Published	New York IEEE 15.11.2023 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Cameras Diffusion model Image enhancement image inpainting Image quality Image restoration Image retrieval Image segmentation indoor localization Localization Location awareness Occlusion Pedestrians Queries Semantic segmentation Semantics Task analysis
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In monocular camera-based visual localization methods, query images are compared with reference images in the database using an image retrieval algorithm. Once the best matching image is obtained, the pose of online users can be estimated using algorithms such as epipolar geometry or EpnP. However, in practical applications, the quality of query images may be affected by environmental and device factors, resulting in decreased image quality. This article primarily focuses on addressing the challenges posed by pedestrians, which are commonly encountered obstacles in indoor visual positioning scenes. It proposes an automatic approach for performing semantic segmentation and occlusion removal to enhance query image quality and improve positioning accuracy. Extensive experimental analysis and comparisons of positioning errors demonstrate that the proposed method significantly enhances the performance of visual localization techniques. To the best of our knowledge, this is the first attempt to address occlusion interference in vision-based localization through an active removal approach.
ISSN:	1530-437X 1558-1748
DOI:	10.1109/JSEN.2023.3321724