Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution
Diffusion models have been achieving excellent performance for real-world image super-resolution (Real-ISR) with considerable computational costs. Current approaches are trying to derive one-step diffusion models from multi-step counterparts through knowledge distillation. However, these methods inc...
Saved in:
Main Authors | , , , , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
05.10.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Diffusion models have been achieving excellent performance for real-world
image super-resolution (Real-ISR) with considerable computational costs.
Current approaches are trying to derive one-step diffusion models from
multi-step counterparts through knowledge distillation. However, these methods
incur substantial training costs and may constrain the performance of the
student model by the teacher's limitations. To tackle these issues, we propose
DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a
noise-aware discriminator (NAD) to participate in adversarial training, further
enhancing the authenticity of the generated content. Additionally, we improve
the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's
ability to generate fine details. Our experiments demonstrate that, compared
with previous diffusion-based methods requiring dozens or even hundreds of
steps, our DFOSD attains comparable or even superior results in both
quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher
performance and efficiency compared with other one-step diffusion methods. We
will release code and models at https://github.com/JianzeLi-114/DFOSD. |
---|---|
AbstractList | Diffusion models have been achieving excellent performance for real-world
image super-resolution (Real-ISR) with considerable computational costs.
Current approaches are trying to derive one-step diffusion models from
multi-step counterparts through knowledge distillation. However, these methods
incur substantial training costs and may constrain the performance of the
student model by the teacher's limitations. To tackle these issues, we propose
DFOSD, a Distillation-Free One-Step Diffusion model. Specifically, we propose a
noise-aware discriminator (NAD) to participate in adversarial training, further
enhancing the authenticity of the generated content. Additionally, we improve
the perceptual loss with edge-aware DISTS (EA-DISTS) to enhance the model's
ability to generate fine details. Our experiments demonstrate that, compared
with previous diffusion-based methods requiring dozens or even hundreds of
steps, our DFOSD attains comparable or even superior results in both
quantitative metrics and qualitative evaluations. Our DFOSD also abtains higher
performance and efficiency compared with other one-step diffusion methods. We
will release code and models at https://github.com/JianzeLi-114/DFOSD. |
Author | Su, Xiongfei Yuan, Xin Zhang, Yulun Yang, Xiaokang Zou, Zichen Cao, Jiezhang Li, Jianze Guo, Yong |
Author_xml | – sequence: 1 givenname: Jianze surname: Li fullname: Li, Jianze – sequence: 2 givenname: Jiezhang surname: Cao fullname: Cao, Jiezhang – sequence: 3 givenname: Zichen surname: Zou fullname: Zou, Zichen – sequence: 4 givenname: Xiongfei surname: Su fullname: Su, Xiongfei – sequence: 5 givenname: Xin surname: Yuan fullname: Yuan, Xin – sequence: 6 givenname: Yulun surname: Zhang fullname: Zhang, Yulun – sequence: 7 givenname: Yong surname: Guo fullname: Guo, Yong – sequence: 8 givenname: Xiaokang surname: Yang fullname: Yang, Xiaokang |
BackLink | https://doi.org/10.48550/arXiv.2410.04224$$DView paper in arXiv |
BookMark | eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIBChiYGBmZcDJ4uWQWl2Tm5CSWZObn6boVpaYq-Oel6gaXpBYouGSmpZUWA8UV0vKLFIJSE3N0w_OLclIUPHMT01MVgksLUot0g1KL83NKQbp5GFjTEnOKU3mhNDeDvJtriLOHLtjW-IKizNzEosp4kO3xYNuNCasAAIzqO7k |
ContentType | Journal Article |
Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
DBID | AKY GOX |
DOI | 10.48550/arxiv.2410.04224 |
DatabaseName | arXiv Computer Science arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 2410_04224 |
GroupedDBID | AKY GOX |
ID | FETCH-arxiv_primary_2410_042243 |
IEDL.DBID | GOX |
IngestDate | Sat Oct 12 12:27:35 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-arxiv_primary_2410_042243 |
OpenAccessLink | https://arxiv.org/abs/2410.04224 |
ParticipantIDs | arxiv_primary_2410_04224 |
PublicationCentury | 2000 |
PublicationDate | 2024-10-05 |
PublicationDateYYYYMMDD | 2024-10-05 |
PublicationDate_xml | – month: 10 year: 2024 text: 2024-10-05 day: 05 |
PublicationDecade | 2020 |
PublicationYear | 2024 |
Score | 3.8680432 |
SecondaryResourceType | preprint |
Snippet | Diffusion models have been achieving excellent performance for real-world
image super-resolution (Real-ISR) with considerable computational costs.
Current... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Computer Vision and Pattern Recognition |
Title | Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution |
URI | https://arxiv.org/abs/2410.04224 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdVxNTwMhEJ20PXkxGjX1ew5eUVwooUdjXWsTbeJHsrdN2R2SJto0a9f48x2gRi-9AgECM8yDeTyAC6eGbLpWicroKqQZjXAmq4SqvSfJcE5S-I38-GTGb3pSDIoO4O9fmFnzPf9K-sDu84rDi7wMKlW6C90sC5St-2mRkpNRimvd_q8dY8xY9C9I5DuwvUZ3eJO2Yxc6tNiDySj40XsinYm8IcLpgkTgV-Fo7n0bHqyQwSM-M2oTkd6CDx_s6PjSLqkR4Yk9Gcg-nOd3r7djEUcvl0kqogwTK-PE1AH0-EJPfUCjbSVnik8ZstrW1npF1jrta3nt1bA-hP6mXo42Vx3DVsYBNxLNBifQWzUtnXLAXLmzuGo_umxvMw |
link.rule.ids | 228,230,783,888 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Distillation-Free+One-Step+Diffusion+for+Real-World+Image+Super-Resolution&rft.au=Li%2C+Jianze&rft.au=Cao%2C+Jiezhang&rft.au=Zou%2C+Zichen&rft.au=Su%2C+Xiongfei&rft.date=2024-10-05&rft_id=info:doi/10.48550%2Farxiv.2410.04224&rft.externalDocID=2410_04224 |