A Multiscale Framework With Unsupervised Learning for Remote Sensing Image Registration

Yuanxin Ye(Southwest Jiaotong University), Tengfeng Tang(Southwest Jiaotong University), Bai Zhu(Southwest Jiaotong University), Chao Yang(Southwest Jiaotong University), Bo Li(Northwestern Polytechnical University), Siyuan Hao(Qingdao University of Technology)
IEEE Transactions on Geoscience and Remote Sensing
January 1, 2022
Cited by 158

Abstract

Registration for multisensor or multimodal image pairs with a large degree of distortions is a fundamental task for many remote sensing applications. To achieve accurate and low-cost remote sensing image registration, we propose a multiscale framework with unsupervised learning, named MU-Net. Without costly ground truth labels, MU-Net directly learns the end-to-end mapping from the image pairs to their transformation parameters. MU-Net stacks several deep neural network (DNN) models on multiple scales to generate a coarse-to-fine registration pipeline, which prevents the backpropagation from falling into a local extremum and resists significant image distortions. We design a novel loss function paradigm based on structural similarity, which makes MU-Net suitable for various types of multimodal images. MU-Net is compared with traditional feature-based and area-based methods, as well as supervised and other unsupervised learning methods on the optical-optical, optical-infrared, optical-synthetic aperture radar (SAR), and optical-map datasets. Experimental results show that MU-Net achieves more comprehensive and accurate registration performance between these image pairs with geometric and radiometric distortions. We share the code implemented by Pytorch at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/yeyuanxin110/MU-Net</uri> .


Related Papers

No related papers found

Powered by citation graph analysis