Pyramid Attention "Zero-Shot" Network for Single-Image Superresolution

Single-image superresolution (SISR) is one of the requisite image processing methods used to reconstructs a high-resolution (HR) image from a low-resolution (LR) observation. Existing SISR methods mostly rely on supervised learning and are confined to specific training data, where implicit conformit...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on network science and engineering Vol. 9; no. 6; pp. 4028 - 4039
Main Authors Han, Xianjun, Wang, Huabin, Li, Xuejun, Yang, Hongyu
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.11.2022
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Single-image superresolution (SISR) is one of the requisite image processing methods used to reconstructs a high-resolution (HR) image from a low-resolution (LR) observation. Existing SISR methods mostly rely on supervised learning and are confined to specific training data, where implicit conformity matching of the LR images with their high-resolution (HR) counterparts is conducted. However, real LR images rarely obey these constraints, and the degradation process is much more intricate and unknown. In this paper, we proposed a pyramid attention zero-shot (PAZS) network for SISR that sufficiently explores the information hidden in an image itself by learning the patch distribution at a different scale of the image. The proposed pyramid generative model learns the patch distribution in the internal patch learning stage. Then, based on the learned intrinsic property, we explore the corresponding superresolution (SR) image by integrating the intrinsic information into a self-attention mechanism with a progressive generation style at test time. This mechanism consists of internal-external attention and a cross-scale guided fusion module as the connecting passageway. Thus, the PAZS architecture maintains both the global structure and the fine textures of the SR image. This allows SR to be performed without other training datasets and can be adapted to different settings for each image. Experiments on diverse datasets demonstrate that the proposed method outperforms other methods based on single-image training.
ISSN:2327-4697
2334-329X
DOI:10.1109/TNSE.2022.3192471