Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution

Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods inc...

Full description

Saved in:
Bibliographic Details
Main Authors Li, Jianze, Cao, Jiezhang, Zou, Zichen, Su, Xiongfei, Yuan, Xin, Zhang, Yulun, Guo, Yong, Yang, Xiaokang
Format Journal Article
LanguageEnglish
Published 05.10.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods incur substantial training costs and may constrain the performance of the student model by the teacher's limitations. To tackle these issues, we propose DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a noise-aware discriminator (NAD) to participate in adversarial training, further enhancing the authenticity of the generated content. Additionally, we improve the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's ability to generate fine details. Our experiments demonstrate that, compared with previous diffusion-based methods requiring dozens or even hundreds of steps, our DFOSD attains comparable or even superior results in both quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher performance and efficiency compared with other one-step diffusion methods. We will release code and models at https://github.com/JianzeLi-114/DFOSD.
AbstractList Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods incur substantial training costs and may constrain the performance of the student model by the teacher's limitations. To tackle these issues, we propose DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a noise-aware discriminator (NAD) to participate in adversarial training, further enhancing the authenticity of the generated content. Additionally, we improve the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's ability to generate fine details. Our experiments demonstrate that, compared with previous diffusion-based methods requiring dozens or even hundreds of steps, our DFOSD attains comparable or even superior results in both quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher performance and efficiency compared with other one-step diffusion methods. We will release code and models at https://github.com/JianzeLi-114/DFOSD.
Author Su, Xiongfei
Yuan, Xin
Zhang, Yulun
Yang, Xiaokang
Zou, Zichen
Cao, Jiezhang
Li, Jianze
Guo, Yong
Author_xml – sequence: 1
  givenname: Jianze
  surname: Li
  fullname: Li, Jianze
– sequence: 2
  givenname: Jiezhang
  surname: Cao
  fullname: Cao, Jiezhang
– sequence: 3
  givenname: Zichen
  surname: Zou
  fullname: Zou, Zichen
– sequence: 4
  givenname: Xiongfei
  surname: Su
  fullname: Su, Xiongfei
– sequence: 5
  givenname: Xin
  surname: Yuan
  fullname: Yuan, Xin
– sequence: 6
  givenname: Yulun
  surname: Zhang
  fullname: Zhang, Yulun
– sequence: 7
  givenname: Yong
  surname: Guo
  fullname: Guo, Yong
– sequence: 8
  givenname: Xiaokang
  surname: Yang
  fullname: Yang, Xiaokang
BackLink https://doi.org/10.48550/arXiv.2410.04224$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIBChiYGBmZcDJ4uWQWl2Tm5CSWZObn6boVpaYq-Oel6gaXpBYouGSmpZUWA8UV0vKLFIJSE3N0w_OLclIUPHMT01MVgksLUot0g1KL83NKQbp5GFjTEnOKU3mhNDeDvJtriLOHLtjW-IKizNzEosp4kO3xYNuNCasAAIzqO7k
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2410.04224
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2410_04224
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2410_042243
IEDL.DBID GOX
IngestDate Sat Oct 12 12:27:35 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2410_042243
OpenAccessLink https://arxiv.org/abs/2410.04224
ParticipantIDs arxiv_primary_2410_04224
PublicationCentury 2000
PublicationDate 2024-10-05
PublicationDateYYYYMMDD 2024-10-05
PublicationDate_xml – month: 10
  year: 2024
  text: 2024-10-05
  day: 05
PublicationDecade 2020
PublicationYear 2024
Score 3.8680432
SecondaryResourceType preprint
Snippet Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution
URI https://arxiv.org/abs/2410.04224
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdVxNTwMhEJ20PXkxGjX1ew5eUVwooUdjXWsTbeJHsrdN2R2SJto0a9f48x2gRi-9AgECM8yDeTyAC6eGbLpWicroKqQZjXAmq4SqvSfJcE5S-I38-GTGb3pSDIoO4O9fmFnzPf9K-sDu84rDi7wMKlW6C90sC5St-2mRkpNRimvd_q8dY8xY9C9I5DuwvUZ3eJO2Yxc6tNiDySj40XsinYm8IcLpgkTgV-Fo7n0bHqyQwSM-M2oTkd6CDx_s6PjSLqkR4Yk9Gcg-nOd3r7djEUcvl0kqogwTK-PE1AH0-EJPfUCjbSVnik8ZstrW1npF1jrta3nt1bA-hP6mXo42Vx3DVsYBNxLNBifQWzUtnXLAXLmzuGo_umxvMw
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Distillation-Free+One-Step+Diffusion+for+Real-World+Image+Super-Resolution&rft.au=Li%2C+Jianze&rft.au=Cao%2C+Jiezhang&rft.au=Zou%2C+Zichen&rft.au=Su%2C+Xiongfei&rft.date=2024-10-05&rft_id=info:doi/10.48550%2Farxiv.2410.04224&rft.externalDocID=2410_04224