Deep Light Field Super-Resolution Using Frequency Domain Analysis and Semantic Prior

Light field (LF) camera can simultaneously capture the intensity and direction information of light rays, which has been widely concerned. However, limited by the size of the imaging sensor, the captured LF image (LFI) has a trade-off between spatial and angular resolutions. To this end, this paper...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on multimedia Vol. 24; pp. 3722 - 3737
Main Authors	Chen, Yeyao, Jiang, Gangyi, Jiang, Zhidi, Yu, Mei, Ho, Yo-Sung
Format	Journal Article
Language	English
Published	Piscataway IEEE 2022 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Artificial neural networks convolutional neural network Decomposition Estimation Frequency analysis Frequency domain analysis frequency domain transformation Image restoration Light field super-resolution Light fields Luminous intensity Quality assessment Restoration semantic prior Semantics Superresolution Three-dimensional displays Transformations (mathematics) Visual perception
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Light field (LF) camera can simultaneously capture the intensity and direction information of light rays, which has been widely concerned. However, limited by the size of the imaging sensor, the captured LF image (LFI) has a trade-off between spatial and angular resolutions. To this end, this paper proposes a new LF super-resolution method using frequency domain analysis and semantic prior, which designs a two-stage learning framework to enhance the spatial and angular resolutions of LFI. Specifically, the proposed method first decomposes the spatial and angular information to explore the 4D structure of LFI by using frequency domain transformation, and formulates the LF super-resolution as a frequency restoration process. Then, the decomposed frequency components are recovered in a progressive restoration manner, with new cascaded 2D and 3D convolutional neural networks. To further improve the quality of the reconstructed LFI, especially at the object boundary, the semantic prior is incorporated into the designed network to enhance its representation ability. Finally, the super-resolved LFI is reconstructed by inverse frequency domain transformation. Experimental results show that the proposed method can effectively generate high-resolution LFI, and outperforms other state-of-the-art methods in terms of both subjective visual perception and objective quality evaluation. Moreover, the proposed method can enhance the performance of LF applications such as depth estimation.
ISSN:	1520-9210 1941-0077
DOI:	10.1109/TMM.2021.3106775