HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memo...

Full description

Saved in:
Bibliographic Details
Published inProceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 6527 - 6536
Main Authors Ruiz, Nataniel, Li, Yuanzhen, Jampani, Varun, Wei, Wei, Hou, Tingbo, Pritch, Yael, Wadhwa, Neal, Rubinstein, Michael, Aberman, Kfir
Format Conference Proceeding
LanguageEnglish
Published IEEE 16.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and storing a personalized model per subject can be demanding in terms of storage capacity. To overcome these challenges, we propose HyperDreamBooth-a hypernetwork capable of efficiently generating a small set of personalized weights from a single image of a person. By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications. Our method achieves personalization on faces in roughly 20 seconds, 25x faster than DreamBooth and 125x faster than Textual Inversion, using as few as one reference image, with the same quality and style diversity as DreamBooth. Also our method yields a model that is 10,000x smaller than a normal DreamBooth model.
AbstractList Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and storing a personalized model per subject can be demanding in terms of storage capacity. To overcome these challenges, we propose HyperDreamBooth-a hypernetwork capable of efficiently generating a small set of personalized weights from a single image of a person. By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications. Our method achieves personalization on faces in roughly 20 seconds, 25x faster than DreamBooth and 125x faster than Textual Inversion, using as few as one reference image, with the same quality and style diversity as DreamBooth. Also our method yields a model that is 10,000x smaller than a normal DreamBooth model.
Author Rubinstein, Michael
Ruiz, Nataniel
Hou, Tingbo
Li, Yuanzhen
Wadhwa, Neal
Aberman, Kfir
Wei, Wei
Jampani, Varun
Pritch, Yael
Author_xml – sequence: 1
  givenname: Nataniel
  surname: Ruiz
  fullname: Ruiz, Nataniel
  organization: Google Research
– sequence: 2
  givenname: Yuanzhen
  surname: Li
  fullname: Li, Yuanzhen
  organization: Google Research
– sequence: 3
  givenname: Varun
  surname: Jampani
  fullname: Jampani, Varun
  organization: Google Research
– sequence: 4
  givenname: Wei
  surname: Wei
  fullname: Wei, Wei
  organization: Google Research
– sequence: 5
  givenname: Tingbo
  surname: Hou
  fullname: Hou, Tingbo
  organization: Google Research
– sequence: 6
  givenname: Yael
  surname: Pritch
  fullname: Pritch, Yael
  organization: Google Research
– sequence: 7
  givenname: Neal
  surname: Wadhwa
  fullname: Wadhwa, Neal
  organization: Google Research
– sequence: 8
  givenname: Michael
  surname: Rubinstein
  fullname: Rubinstein, Michael
  organization: Google Research
– sequence: 9
  givenname: Kfir
  surname: Aberman
  fullname: Aberman, Kfir
  organization: Google Research
BookMark eNotj9FKwzAUhqMoOGffYBd5gdaTnCZtvNPp3GC6IdPbkbanWt2akQR0Pr1Dvfrg4-OH_5yd9K4nxkYCMiHAXI5flk9KFoiZBJlnAFrmRywxhSlRASo8mGM2EKAx1UaYM5aE8A4AKIXQphywxXS_I3_ryW5vnItvV_xXPFL8dP4j8NZ5PrEh8iX54Hq76b5t7FzPXctX9BXT6NLZ1r4Sf3ANbcIFO23tJlDyzyF7ntytxtN0vrifja_naScKHdOiNrmWUgvS0uhc5gVQ3VBb5pVRFWioDkGpqqYuSgCJppKIbSNQ2BaUNDhko7_djojWO99trd-vD0eV1gLxB6APUUE
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/CVPR52733.2024.00624
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL) - NZ
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL) - NZ
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
EISBN 9798350353006
EISSN 1063-6919
EndPage 6536
ExternalDocumentID 10656613
Genre orig-research
GroupedDBID 6IE
6IH
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
OCL
RIE
RIL
RIO
ID FETCH-LOGICAL-i176t-7c9462261e629642470ecdef84b95b060bc9485bdc7800239b233fd131af05293
IEDL.DBID RIE
IngestDate Wed Aug 27 02:00:49 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i176t-7c9462261e629642470ecdef84b95b060bc9485bdc7800239b233fd131af05293
PageCount 10
ParticipantIDs ieee_primary_10656613
PublicationCentury 2000
PublicationDate 2024-June-16
PublicationDateYYYYMMDD 2024-06-16
PublicationDate_xml – month: 06
  year: 2024
  text: 2024-June-16
  day: 16
PublicationDecade 2020
PublicationTitle Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online)
PublicationTitleAbbrev CVPR
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003211698
Score 2.5379434
Snippet Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles,...
SourceID ieee
SourceType Publisher
StartPage 6527
SubjectTerms Computer vision
diffusion
diffusion models
DreamBooth
Face recognition
GAN
Generative AI
generative models
Graphics processing units
HyperNetworks
Memory management
personalization
Semantics
subject driven personalization
Text to image
Title HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
URI https://ieeexplore.ieee.org/document/10656613
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELagE1N5FPGWB1aXJE7smJFCVZAoFWpRtyp-RELQBjXpwq_nzkkLQkJiszw4kU_2fZ_vvjtCLm3i0kiajCmrOYtl4pjOZcLAO2WBDbmMfC-Cx6EYTOKHaTJtxOpeC-Oc88lnrotDH8u3hVnhUxmccEQf2KN2G5hbLdbaPKhwoDJCpY08LgzUVe9l9Iz1xTjQwAiLZAvUtf9oouJ9SL9Nhuuv16kjb91Vpbvm81dhxn__3i7pfMv16GjjiPbIllvsk3aDL2lzessD8jQA0rm8BZg4vynAQtfUTwzrTPCSAn6l_ays6GiN0GuNJi1yOkaGXBXsfg4XEMUOau9lh0z6d-PegDUNFdhrKEXFpFGxALwVOoHR1iiWgTPW5WmsVaIDEWiDxWK0NTL1qlcdcZ6DxcIsx4ggPyStRbFwR4QqB8QsMCbKUxvzPFUmAhtbJeHu1LDAMengBs0-6poZs_XenPwxf0p20EiYhBWKM9Kqlit3Du6-0hfezF_KpqlM
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZQGWDiVcQbD6wpiZ3YMSOFKoU2VKhF3ar6EQlBG9SkC7-es5MWhITEZnlwIp_s-z7ffXcIXenIxISrqSe0pF7II-PJjEceeKeprwPKietF0E9ZMgofxtG4Fqs7LYwxxiWfmZYduli-ztXSPpXBCbfow_ao3QTHH5FKrrV-UqFAZpiIa4Fc4Ivr9svg2VYYo0AEiS2Tzayy_UcbFedFOjsoXX2_Sh55ay1L2VKfv0oz_vsHd1HzW7CHB2tXtIc2zHwf7dQIE9fntzhATwnQzsUdAMXZbQ42usFuIq1ywQsMCBZ3pkWJByuMXqk0cZ7hoeXIZe51Z3AFYdtD7b1oolHnfthOvLqlgvcacFZ6XImQAeIKDLPxVhJy3yhtsjiUIpI-86Wy5WKkVjx2uldJKM3AZsE0szFBeoga83xujhAWBqiZrxTJYh3SLBaKgJW14HB7SljgGDXtBk0-qqoZk9XenPwxf4m2kmG_N-l108dTtG0NZlOyAnaGGuViac7B-Zfywpn8C9JqrJY
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=HyperDreamBooth%3A+HyperNetworks+for+Fast+Personalization+of+Text-to-Image+Models&rft.au=Ruiz%2C+Nataniel&rft.au=Li%2C+Yuanzhen&rft.au=Jampani%2C+Varun&rft.au=Wei%2C+Wei&rft.date=2024-06-16&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=6527&rft.epage=6536&rft_id=info:doi/10.1109%2FCVPR52733.2024.00624&rft.externalDocID=10656613