PSC diffusion: patch-based simplified conditional diffusion model for low-light image enhancement

Low-light image enhancement is pivotal for augmenting the utility and recognition of visuals captured under inadequate lighting conditions. Previous methods based on Generative Adversarial Networks (GAN) are affected by mode collapse and lack attention to the inherent characteristics of low-light im...

Full description

Saved in:

Bibliographic Details
Published in	Multimedia systems Vol. 30; no. 4
Main Authors	Wan, Fei, Xu, Bingxin, Pan, Weiguo, Liu, Hongzhe
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.08.2024 Springer Nature B.V
Subjects	Attention Computer Communication Networks Computer Graphics Computer Science Cryptology Data Storage Representation Diffusion rate Feature maps Generative adversarial networks Image enhancement Image processing Image quality Multimedia Information Systems Multiplication Noise prediction Operating Systems Parameters Regular Paper Regularization Sampling Parameter-free attention U-Net Image patching Diffusion model Low-light image enhancement Generative model
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Low-light image enhancement is pivotal for augmenting the utility and recognition of visuals captured under inadequate lighting conditions. Previous methods based on Generative Adversarial Networks (GAN) are affected by mode collapse and lack attention to the inherent characteristics of low-light images. This paper propose the Patch-based Simplified Conditional Diffusion Model (PSC Diffusion) for low-light image enhancement due to the outstanding performance of diffusion models in image generation. Specifically, recognizing the potential issue of gradient vanishing in extremely low-light images due to smaller pixel values, we design a simplified U-Net architecture with SimpleGate and Parameter-free attention (SimPF) block to predict noise. This architecture utilizes parameter-free attention mechanism and fewer convolutional layers to reduce multiplication operations across feature maps, resulting in a 12–51% reduction in parameters compared to U-Nets used in several prominent diffusion models, which also accelerates the sampling speed. In addition, preserving intricate details in images during the diffusion process is achieved through employing a patch-based diffusion strategy, integrated with global structure-aware regularization, which effectively enhances the overall quality of the enhanced images. Experiments show that the method proposed in this paper achieves richer image details and better perceptual quality, while the sampling speed is over 35% faster than similar diffusion model-based methods.
ISSN:	0942-4962 1432-1882
DOI:	10.1007/s00530-024-01391-z