Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution

Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods inc...

Full description

Saved in:

Bibliographic Details
Main Authors	Li, Jianze, Cao, Jiezhang, Zou, Zichen, Su, Xiongfei, Yuan, Xin, Zhang, Yulun, Guo, Yong, Yang, Xiaokang
Format	Journal Article
Language	English
Published	05.10.2024
Subjects	Computer Science - Computer Vision and Pattern Recognition
Online Access	Get full text

Cover

Loading…

Abstract	Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods incur substantial training costs and may constrain the performance of the student model by the teacher's limitations. To tackle these issues, we propose DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a noise-aware discriminator (NAD) to participate in adversarial training, further enhancing the authenticity of the generated content. Additionally, we improve the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's ability to generate fine details. Our experiments demonstrate that, compared with previous diffusion-based methods requiring dozens or even hundreds of steps, our DFOSD attains comparable or even superior results in both quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher performance and efficiency compared with other one-step diffusion methods. We will release code and models at https://github.com/JianzeLi-114/DFOSD.
AbstractList	Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods incur substantial training costs and may constrain the performance of the student model by the teacher's limitations. To tackle these issues, we propose DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a noise-aware discriminator (NAD) to participate in adversarial training, further enhancing the authenticity of the generated content. Additionally, we improve the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's ability to generate fine details. Our experiments demonstrate that, compared with previous diffusion-based methods requiring dozens or even hundreds of steps, our DFOSD attains comparable or even superior results in both quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher performance and efficiency compared with other one-step diffusion methods. We will release code and models at https://github.com/JianzeLi-114/DFOSD.
Author	Su, Xiongfei Yuan, Xin Zhang, Yulun Yang, Xiaokang Zou, Zichen Cao, Jiezhang Li, Jianze Guo, Yong
Author_xml	– sequence: 1 givenname: Jianze surname: Li fullname: Li, Jianze – sequence: 2 givenname: Jiezhang surname: Cao fullname: Cao, Jiezhang – sequence: 3 givenname: Zichen surname: Zou fullname: Zou, Zichen – sequence: 4 givenname: Xiongfei surname: Su fullname: Su, Xiongfei – sequence: 5 givenname: Xin surname: Yuan fullname: Yuan, Xin – sequence: 6 givenname: Yulun surname: Zhang fullname: Zhang, Yulun – sequence: 7 givenname: Yong surname: Guo fullname: Guo, Yong – sequence: 8 givenname: Xiaokang surname: Yang fullname: Yang, Xiaokang
BackLink	https://doi.org/10.48550/arXiv.2410.04224$$DView paper in arXiv
BookMark	eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIBChiYGBmZcDJ4uWQWl2Tm5CSWZObn6boVpaYq-Oel6gaXpBYouGSmpZUWA8UV0vKLFIJSE3N0w_OLclIUPHMT01MVgksLUot0g1KL83NKQbp5GFjTEnOKU3mhNDeDvJtriLOHLtjW-IKizNzEosp4kO3xYNuNCasAAIzqO7k
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY GOX
DOI	10.48550/arxiv.2410.04224
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2410_04224
GroupedDBID	AKY GOX
ID	FETCH-arxiv_primary_2410_042243
IEDL.DBID	GOX
IngestDate	Sat Oct 12 12:27:35 EDT 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_2410_042243
OpenAccessLink	https://arxiv.org/abs/2410.04224
ParticipantIDs	arxiv_primary_2410_04224
PublicationCentury	2000
PublicationDate	2024-10-05
PublicationDateYYYYMMDD	2024-10-05
PublicationDate_xml	– month: 10 year: 2024 text: 2024-10-05 day: 05
PublicationDecade	2020
PublicationYear	2024
Score	3.8680432
SecondaryResourceType	preprint
Snippet	Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Computer Vision and Pattern Recognition
Title	Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution
URI	https://arxiv.org/abs/2410.04224
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdVxNTwMhEJ20PXkxGjX1ew5eUVwooUdjXWsTbeJHsrdN2R2SJto0a9f48x2gRi-9AgECM8yDeTyAC6eGbLpWicroKqQZjXAmq4SqvSfJcE5S-I38-GTGb3pSDIoO4O9fmFnzPf9K-sDu84rDi7wMKlW6C90sC5St-2mRkpNRimvd_q8dY8xY9C9I5DuwvUZ3eJO2Yxc6tNiDySj40XsinYm8IcLpgkTgV-Fo7n0bHqyQwSM-M2oTkd6CDx_s6PjSLqkR4Yk9Gcg-nOd3r7djEUcvl0kqogwTK-PE1AH0-EJPfUCjbSVnik8ZstrW1npF1jrta3nt1bA-hP6mXo42Vx3DVsYBNxLNBifQWzUtnXLAXLmzuGo_umxvMw
link.rule.ids	228,230,783,888
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Distillation-Free+One-Step+Diffusion+for+Real-World+Image+Super-Resolution&rft.au=Li%2C+Jianze&rft.au=Cao%2C+Jiezhang&rft.au=Zou%2C+Zichen&rft.au=Su%2C+Xiongfei&rft.date=2024-10-05&rft_id=info:doi/10.48550%2Farxiv.2410.04224&rft.externalDocID=2410_04224