SCNet: A Dual‐Branch Network for Strong Noisy Image Denoising Based on Swin Transformer and ConvNeXt

ABSTRACT Image denoising plays a vital role in restoring high‐quality images from noisy inputs and directly impacts downstream vision tasks. Traditional methods often fail under strong noise, causing detail loss or excessive smoothing. While recent Convolutional Neural Networks‐based and Transformer...

Full description

Saved in:
Bibliographic Details
Published inComputer animation and virtual worlds Vol. 36; no. 3
Main Authors Lin, Chuchao, Zou, Changjun, Xu, Hangbin
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.05.2025
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:ABSTRACT Image denoising plays a vital role in restoring high‐quality images from noisy inputs and directly impacts downstream vision tasks. Traditional methods often fail under strong noise, causing detail loss or excessive smoothing. While recent Convolutional Neural Networks‐based and Transformer‐based models have shown progress, they struggle to jointly capture global structure and preserve local details. To address this, we propose SCNet, a dual‐branch fusion network tailored for strong‐noise denoising. It combines a Swin Transformer branch for global context modeling and a ConvNeXt branch for fine‐grained local feature extraction. Their outputs are adaptively merged via a Feature Fusion Block using joint spatial and channel attention, ensuring semantic consistency and texture fidelity. A multi‐scale upsampling module and the Charbonnier loss further improve structural accuracy and visual quality. Extensive experiments on four benchmark datasets show that SCNet outperforms state‐of‐the‐art methods, especially under severe noise, and proves effective in real‐world tasks such as mural image restoration. SCNet is a dual‐branch denoising framework that integrates Swin Transformer for global context modeling and ConvNeXt for local detail preservation. Through attention‐based feature fusion and multi‐scale reconstruction, SCNet demonstrates strong robustness under severe noise and exhibits promising performance in real‐world mural restoration.
Bibliography:This work was supported by Research Project on Innovation and Entrepreneurship Education of East China Jiaotong University (24hjct18), College Student Innovation and Entrepreneurship Training Program Project, National Natural Science Foundation of China (62162027), Humanities and Social Sciences Research Project in Jiangxi Province's Universities (JC24205).
Funding
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1546-4261
1546-427X
DOI:10.1002/cav.70030