AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error

With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual disinformation. A key enabler for generating high-resolution images with low computational cost has been the development of latent diffusion models (LDMs)....

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Ricker, Jonas, Lukovnikov, Denis, Fischer, Asja
Format	Paper
Language	English
Published	Ithaca Cornell University Library, arXiv.org 27.03.2024
Subjects	Image reconstruction Image resolution Qualitative analysis Training
Online Access	Get full text

Cover

Loading…

Abstract	With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual disinformation. A key enabler for generating high-resolution images with low computational cost has been the development of latent diffusion models (LDMs). In contrast to conventional diffusion models, LDMs perform the denoising process in the low-dimensional latent space of a pre-trained autoencoder (AE) instead of the high-dimensional image space. Despite their relevance, the forensic analysis of LDMs is still in its infancy. In this work we propose AEROBLADE, a novel detection method which exploits an inherent component of LDMs: the AE used to transform images between image and latent space. We find that generated images can be more accurately reconstructed by the AE than real images, allowing for a simple detection approach based on the reconstruction error. Most importantly, our method is easy to implement and does not require any training, yet nearly matches the performance of detectors that rely on extensive training. We empirically demonstrate that AEROBLADE is effective against state-of-the-art LDMs, including Stable Diffusion and Midjourney. Beyond detection, our approach allows for the qualitative analysis of images, which can be leveraged for identifying inpainted regions. We release our code and data at https://github.com/jonasricker/aeroblade .
AbstractList	With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual disinformation. A key enabler for generating high-resolution images with low computational cost has been the development of latent diffusion models (LDMs). In contrast to conventional diffusion models, LDMs perform the denoising process in the low-dimensional latent space of a pre-trained autoencoder (AE) instead of the high-dimensional image space. Despite their relevance, the forensic analysis of LDMs is still in its infancy. In this work we propose AEROBLADE, a novel detection method which exploits an inherent component of LDMs: the AE used to transform images between image and latent space. We find that generated images can be more accurately reconstructed by the AE than real images, allowing for a simple detection approach based on the reconstruction error. Most importantly, our method is easy to implement and does not require any training, yet nearly matches the performance of detectors that rely on extensive training. We empirically demonstrate that AEROBLADE is effective against state-of-the-art LDMs, including Stable Diffusion and Midjourney. Beyond detection, our approach allows for the qualitative analysis of images, which can be leveraged for identifying inpainted regions. We release our code and data at https://github.com/jonasricker/aeroblade .
Author	Lukovnikov, Denis Ricker, Jonas Fischer, Asja
Author_xml	– sequence: 1 givenname: Jonas surname: Ricker fullname: Ricker, Jonas – sequence: 2 givenname: Denis surname: Lukovnikov fullname: Lukovnikov, Denis – sequence: 3 givenname: Asja surname: Fischer fullname: Fischer, Asja
BookMark	eNqNi9sKgkAURYco6PoPAz0L05jd3iyVAiEQexbRYxh1Tp2Z-f-M-oCeNqy19lj0kRB6YqR9f-FtlloPxcyYm1JKr9Y6CPyRqMM4O-_TMIp3MueyxRavXsIAMgILlW0JJTUyLS2glVHbNM582OlRXsHIi-l6GTpLgBXVwDKDitBYdt9vzEw8FYOmvBuY_XYi5kmcH47ek-nlwNjiRo6xU4XearVVarEK_P-qN8zXRxM
ContentType	Paper
Copyright	2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml	– notice: 2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID	8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS
DatabaseName	ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College ProQuest Central SciTech Premium Collection ProQuest Engineering Collection Engineering Database Publicly Available Content Database ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection
DatabaseTitle	Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection
DatabaseTitleList	Publicly Available Content Database
Database_xml	– sequence: 1 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Physics
EISSN	2331-8422
Genre	Working Paper/Pre-Print
GroupedDBID	8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS
ID	FETCH-proquest_journals_29209001653
IEDL.DBID	BENPR
IngestDate	Fri Nov 08 20:57:06 EST 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-proquest_journals_29209001653
OpenAccessLink	https://www.proquest.com/docview/2920900165?pq-origsite=%requestingapplication%
PQID	2920900165
PQPubID	2050157
ParticipantIDs	proquest_journals_2920900165
PublicationCentury	2000
PublicationDate	20240327
PublicationDateYYYYMMDD	2024-03-27
PublicationDate_xml	– month: 03 year: 2024 text: 20240327 day: 27
PublicationDecade	2020
PublicationPlace	Ithaca
PublicationPlace_xml	– name: Ithaca
PublicationTitle	arXiv.org
PublicationYear	2024
Publisher	Cornell University Library, arXiv.org
Publisher_xml	– name: Cornell University Library, arXiv.org
SSID	ssj0002672553
Score	3.5253384
SecondaryResourceType	preprint
Snippet	With recent text-to-image models, anyone can generate deceptively realistic images with arbitrary contents, fueling the growing threat of visual...
SourceID	proquest
SourceType	Aggregation Database
SubjectTerms	Image reconstruction Image resolution Qualitative analysis Training
Title	AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
URI	https://www.proquest.com/docview/2920900165
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH-4FsGbn_gxR0CvxZGkaetFNtc6ZZtjTNhtLOkLeLCdbXf1bzcpnTsIO4ZAeHmE3-995T2AexSSSmEst4hi1-OMKk8apvWkSLV5T1HkM_vBeTwRww_-tvAXTcCtbMoqt5hYA3WaKxsjf7BTlSJroPhP62_PTo2y2dVmhEYLXGo8ha4Dbj-eTGd_URYqAmMzs39AW7NHcgzudLXG4gQOMDuFw7roUpVnkPbi2Xt_ZHTxSObNqAYvKRDJAKu6RCojuSYjYw5mFRl8ar2xsS3y-mVAoCR1tp_0NlVum1GmWBDrS-46wpK4KPLiHO6SeP489LaiLZvnUy53l2UX4GR5hpdA1IpzzbRWSihOAwwZ-ir0OdO-1imXV9Ded9L1_u0bOKKGr215FQ3a4BhZ8dbwbSU70AqTl06jWrMa_8S_13SLeQ
link.rule.ids	783,787,12777,21400,33385,33756,43612,43817
linkProvider	ProQuest
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB60RfTmEx9VF_QalH2l8SLVJqaaVpEIvYVmMws9mNQk_f_uhtQehJ4XltlhmG92Xh_ALcqUptJEbh7Fe4czqpzUIK2Tykwbe_I8weyA83giwy_-OhXTNuFWtW2VK5_YOOqsUDZHfmdZlTwboIjHxY9jWaNsdbWl0NiGLmcGq-2kePDyl2Oh0jURM_vnZhvsCPah-zFbYHkAW5gfwk7TcqmqI8gG_uf7U2Q08UDilqjBCUpEMsS6aZDKSaFJZILBvCbDudZLm9kio2_jAirS1PrJYFkXdhVlhiWxP8n1Pljil2VRHsNN4MfPobMSLWmNp0rWT2Un0MmLHE-BqBnnmmmtlFScuthnKFRfcKaF1hlPz6C36abzzcfXsBvG4yiJRpO3C9ijBrltoxV1e9AxcuOlQd46vWrU-wuh7Irt
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=AEROBLADE%3A+Training-Free+Detection+of+Latent+Diffusion+Images+Using+Autoencoder+Reconstruction+Error&rft.jtitle=arXiv.org&rft.au=Ricker%2C+Jonas&rft.au=Lukovnikov%2C+Denis&rft.au=Fischer%2C+Asja&rft.date=2024-03-27&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422