AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual disinformation. A key enabler for generating high-resolution images with low computational cost has been the development of latent diffusion models (LDMs)....
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , , |
Format | Paper |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
27.03.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual disinformation. A key enabler for generating high-resolution images with low computational cost has been the development of latent diffusion models (LDMs). In contrast to conventional diffusion models, LDMs perform the denoising process in the low-dimensional latent space of a pre-trained autoencoder (AE) instead of the high-dimensional image space. Despite their relevance, the forensic analysis of LDMs is still in its infancy. In this work we propose AEROBLADE, a novel detection method which exploits an inherent component of LDMs: the AE used to transform images between image and latent space. We find that generated images can be more accurately reconstructed by the AE than real images, allowing for a simple detection approach based on the reconstruction error. Most importantly, our method is easy to implement and does not require any training, yet nearly matches the performance of detectors that rely on extensive training. We empirically demonstrate that AEROBLADE is effective against state-of-the-art LDMs, including Stable Diffusion and Midjourney. Beyond detection, our approach allows for the qualitative analysis of images, which can be leveraged for identifying inpainted regions. We release our code and data at https://github.com/jonasricker/aeroblade . |
---|---|
AbstractList | With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual disinformation. A key enabler for generating high-resolution images with low computational cost has been the development of latent diffusion models (LDMs). In contrast to conventional diffusion models, LDMs perform the denoising process in the low-dimensional latent space of a pre-trained autoencoder (AE) instead of the high-dimensional image space. Despite their relevance, the forensic analysis of LDMs is still in its infancy. In this work we propose AEROBLADE, a novel detection method which exploits an inherent component of LDMs: the AE used to transform images between image and latent space. We find that generated images can be more accurately reconstructed by the AE than real images, allowing for a simple detection approach based on the reconstruction error. Most importantly, our method is easy to implement and does not require any training, yet nearly matches the performance of detectors that rely on extensive training. We empirically demonstrate that AEROBLADE is effective against state-of-the-art LDMs, including Stable Diffusion and Midjourney. Beyond detection, our approach allows for the qualitative analysis of images, which can be leveraged for identifying inpainted regions. We release our code and data at https://github.com/jonasricker/aeroblade . |
Author | Lukovnikov, Denis Ricker, Jonas Fischer, Asja |
Author_xml | – sequence: 1 givenname: Jonas surname: Ricker fullname: Ricker, Jonas – sequence: 2 givenname: Denis surname: Lukovnikov fullname: Lukovnikov, Denis – sequence: 3 givenname: Asja surname: Fischer fullname: Fischer, Asja |
BookMark | eNqNi9sKgkAURYco6PoPAz0L05jd3iyVAiEQexbRYxh1Tp2Z-f-M-oCeNqy19lj0kRB6YqR9f-FtlloPxcyYm1JKr9Y6CPyRqMM4O-_TMIp3MueyxRavXsIAMgILlW0JJTUyLS2glVHbNM582OlRXsHIi-l6GTpLgBXVwDKDitBYdt9vzEw8FYOmvBuY_XYi5kmcH47ek-nlwNjiRo6xU4XearVVarEK_P-qN8zXRxM |
ContentType | Paper |
Copyright | 2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
Copyright_xml | – notice: 2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College ProQuest Central SciTech Premium Collection ProQuest Engineering Collection Engineering Database Publicly Available Content Database ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection |
DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2331-8422 |
Genre | Working Paper/Pre-Print |
GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS |
ID | FETCH-proquest_journals_29209001653 |
IEDL.DBID | BENPR |
IngestDate | Fri Nov 08 20:57:06 EST 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-proquest_journals_29209001653 |
OpenAccessLink | https://www.proquest.com/docview/2920900165?pq-origsite=%requestingapplication% |
PQID | 2920900165 |
PQPubID | 2050157 |
ParticipantIDs | proquest_journals_2920900165 |
PublicationCentury | 2000 |
PublicationDate | 20240327 |
PublicationDateYYYYMMDD | 2024-03-27 |
PublicationDate_xml | – month: 03 year: 2024 text: 20240327 day: 27 |
PublicationDecade | 2020 |
PublicationPlace | Ithaca |
PublicationPlace_xml | – name: Ithaca |
PublicationTitle | arXiv.org |
PublicationYear | 2024 |
Publisher | Cornell University Library, arXiv.org |
Publisher_xml | – name: Cornell University Library, arXiv.org |
SSID | ssj0002672553 |
Score | 3.5253384 |
SecondaryResourceType | preprint |
Snippet | With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual... |
SourceID | proquest |
SourceType | Aggregation Database |
SubjectTerms | Image reconstruction Image resolution Qualitative analysis Training |
Title | AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error |
URI | https://www.proquest.com/docview/2920900165 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH-4FsGbn_gxR0CvxZGkaetFNtc6ZZtjTNhtLOkLeLCdbXf1bzcpnTsIO4ZAeHmE3-995T2AexSSSmEst4hi1-OMKk8apvWkSLV5T1HkM_vBeTwRww_-tvAXTcCtbMoqt5hYA3WaKxsjf7BTlSJroPhP62_PTo2y2dVmhEYLXGo8ha4Dbj-eTGd_URYqAmMzs39AW7NHcgzudLXG4gQOMDuFw7roUpVnkPbi2Xt_ZHTxSObNqAYvKRDJAKu6RCojuSYjYw5mFRl8ar2xsS3y-mVAoCR1tp_0NlVum1GmWBDrS-46wpK4KPLiHO6SeP489LaiLZvnUy53l2UX4GR5hpdA1IpzzbRWSihOAwwZ-ir0OdO-1imXV9Ded9L1_u0bOKKGr215FQ3a4BhZ8dbwbSU70AqTl06jWrMa_8S_13SLeQ |
link.rule.ids | 783,787,12777,21400,33385,33756,43612,43817 |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB60RfTmEx9VF_QalH2l8SLVJqaaVpEIvYVmMws9mNQk_f_uhtQehJ4XltlhmG92Xh_ALcqUptJEbh7Fe4czqpzUIK2Tykwbe_I8weyA83giwy_-OhXTNuFWtW2VK5_YOOqsUDZHfmdZlTwboIjHxY9jWaNsdbWl0NiGLmcGq-2kePDyl2Oh0jURM_vnZhvsCPah-zFbYHkAW5gfwk7TcqmqI8gG_uf7U2Q08UDilqjBCUpEMsS6aZDKSaFJZILBvCbDudZLm9kio2_jAirS1PrJYFkXdhVlhiWxP8n1Pljil2VRHsNN4MfPobMSLWmNp0rWT2Un0MmLHE-BqBnnmmmtlFScuthnKFRfcKaF1hlPz6C36abzzcfXsBvG4yiJRpO3C9ijBrltoxV1e9AxcuOlQd46vWrU-wuh7Irt |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=AEROBLADE%3A+Training-Free+Detection+of+Latent+Diffusion+Images+Using+Autoencoder+Reconstruction+Error&rft.jtitle=arXiv.org&rft.au=Ricker%2C+Jonas&rft.au=Lukovnikov%2C+Denis&rft.au=Fischer%2C+Asja&rft.date=2024-03-27&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422 |