Global Optical and SAR Image Registration Method Based on Local Distortion Division

Variations in terrain elevation cause images acquired under different imaging modalities to deviate from a linear mapping relationship. This effect is particularly pronounced between optical and SAR images, where the range-based imaging mechanism of SAR sensors leads to significant local geometric d...

Full description

Saved in:

Bibliographic Details
Published in	Remote sensing (Basel, Switzerland) Vol. 17; no. 9; p. 1642
Main Authors	Li, Bangjie, Guan, Dongdong, Xie, Yuzhen, Zheng, Xiaolong, Chen, Zhengsheng, Pan, Lefei, Zhao, Weiheng, Xiang, Deliang
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.05.2025
Subjects	Accuracy Algorithms Artificial neural networks Artificial satellites in remote sensing Deep learning digital capsule feature descriptor Distortion Error correction Image acquisition Image processing Image registration local distortion Methods Neural networks Occlusion Parameter estimation Radar imaging Registration Remote sensing Root-mean-square errors Salience Semantics Statistical analysis superpixel features Synthetic aperture radar China
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Variations in terrain elevation cause images acquired under different imaging modalities to deviate from a linear mapping relationship. This effect is particularly pronounced between optical and SAR images, where the range-based imaging mechanism of SAR sensors leads to significant local geometric distortions, such as perspective shrinkage and occlusion. As a result, it becomes difficult to represent the spatial correspondence between optical and SAR images using a single geometric model. To address this challenge, we propose a global optical-SAR image registration method that leverages local distortion characteristics. Specifically, we introduce a Superpixel-based Local Distortion Division (SLDD) method, which defines superpixel region features and segments the image into local distortion and normal regions by computing the Mahalanobis distance between superpixel features. We further design a Multi-Feature Fusion Capsule Network (MFFCN) that integrates shallow salient features with deep structural details, reconstructing the dimensions of digital capsules to generate feature descriptors encompassing texture, phase, structure, and amplitude information. This design effectively mitigates the information loss and feature degradation problems caused by pooling operations in conventional convolutional neural networks (CNNs). Additionally, a hard negative mining loss is incorporated to further enhance feature discriminability. Feature descriptors are extracted separately from regions with different distortion levels, and corresponding transformation models are built for local registration. Finally, the local registration results are fused to generate a globally aligned image. Experimental results on public datasets demonstrate that the proposed method achieves superior performance over state-of-the-art (SOTA) approaches in terms of Root Mean Squared Error (RMSE), Correct Match Number (CMN), Distribution of Matched Points (Scat), Edge Fidelity (EF), and overall visual quality.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2072-4292 2072-4292
DOI:	10.3390/rs17091642