HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memo...
Saved in:
Published in | Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 6527 - 6536 |
---|---|
Main Authors | , , , , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
16.06.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and storing a personalized model per subject can be demanding in terms of storage capacity. To overcome these challenges, we propose HyperDreamBooth-a hypernetwork capable of efficiently generating a small set of personalized weights from a single image of a person. By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications. Our method achieves personalization on faces in roughly 20 seconds, 25x faster than DreamBooth and 125x faster than Textual Inversion, using as few as one reference image, with the same quality and style diversity as DreamBooth. Also our method yields a model that is 10,000x smaller than a normal DreamBooth model. |
---|---|
AbstractList | Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles, while retaining high-fidelity to their identities. However, the process of personalization presents inherent challenges in terms of time and memory requirements. Fine-tuning each personalized model needs considerable GPU time investment, and storing a personalized model per subject can be demanding in terms of storage capacity. To overcome these challenges, we propose HyperDreamBooth-a hypernetwork capable of efficiently generating a small set of personalized weights from a single image of a person. By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications. Our method achieves personalization on faces in roughly 20 seconds, 25x faster than DreamBooth and 125x faster than Textual Inversion, using as few as one reference image, with the same quality and style diversity as DreamBooth. Also our method yields a model that is 10,000x smaller than a normal DreamBooth model. |
Author | Rubinstein, Michael Ruiz, Nataniel Hou, Tingbo Li, Yuanzhen Wadhwa, Neal Aberman, Kfir Wei, Wei Jampani, Varun Pritch, Yael |
Author_xml | – sequence: 1 givenname: Nataniel surname: Ruiz fullname: Ruiz, Nataniel organization: Google Research – sequence: 2 givenname: Yuanzhen surname: Li fullname: Li, Yuanzhen organization: Google Research – sequence: 3 givenname: Varun surname: Jampani fullname: Jampani, Varun organization: Google Research – sequence: 4 givenname: Wei surname: Wei fullname: Wei, Wei organization: Google Research – sequence: 5 givenname: Tingbo surname: Hou fullname: Hou, Tingbo organization: Google Research – sequence: 6 givenname: Yael surname: Pritch fullname: Pritch, Yael organization: Google Research – sequence: 7 givenname: Neal surname: Wadhwa fullname: Wadhwa, Neal organization: Google Research – sequence: 8 givenname: Michael surname: Rubinstein fullname: Rubinstein, Michael organization: Google Research – sequence: 9 givenname: Kfir surname: Aberman fullname: Aberman, Kfir organization: Google Research |
BookMark | eNotj9FKwzAUhqMoOGffYBd5gdaTnCZtvNPp3GC6IdPbkbanWt2akQR0Pr1Dvfrg4-OH_5yd9K4nxkYCMiHAXI5flk9KFoiZBJlnAFrmRywxhSlRASo8mGM2EKAx1UaYM5aE8A4AKIXQphywxXS_I3_ryW5vnItvV_xXPFL8dP4j8NZ5PrEh8iX54Hq76b5t7FzPXctX9BXT6NLZ1r4Sf3ANbcIFO23tJlDyzyF7ntytxtN0vrifja_naScKHdOiNrmWUgvS0uhc5gVQ3VBb5pVRFWioDkGpqqYuSgCJppKIbSNQ2BaUNDhko7_djojWO99trd-vD0eV1gLxB6APUUE |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/CVPR52733.2024.00624 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) - NZ IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) - NZ url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences |
EISBN | 9798350353006 |
EISSN | 1063-6919 |
EndPage | 6536 |
ExternalDocumentID | 10656613 |
Genre | orig-research |
GroupedDBID | 6IE 6IH 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP OCL RIE RIL RIO |
ID | FETCH-LOGICAL-i176t-7c9462261e629642470ecdef84b95b060bc9485bdc7800239b233fd131af05293 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 27 02:00:49 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i176t-7c9462261e629642470ecdef84b95b060bc9485bdc7800239b233fd131af05293 |
PageCount | 10 |
ParticipantIDs | ieee_primary_10656613 |
PublicationCentury | 2000 |
PublicationDate | 2024-June-16 |
PublicationDateYYYYMMDD | 2024-06-16 |
PublicationDate_xml | – month: 06 year: 2024 text: 2024-June-16 day: 16 |
PublicationDecade | 2020 |
PublicationTitle | Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) |
PublicationTitleAbbrev | CVPR |
PublicationYear | 2024 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0003211698 |
Score | 2.5379434 |
Snippet | Personalization has emerged as a prominent aspect within the field of generative AI, enabling the synthesis of individuals in diverse contexts and styles,... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 6527 |
SubjectTerms | Computer vision diffusion diffusion models DreamBooth Face recognition GAN Generative AI generative models Graphics processing units HyperNetworks Memory management personalization Semantics subject driven personalization Text to image |
Title | HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models |
URI | https://ieeexplore.ieee.org/document/10656613 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELagE1N5FPGWB1aXJE7smJFCVZAoFWpRtyp-RELQBjXpwq_nzkkLQkJiszw4kU_2fZ_vvjtCLm3i0kiajCmrOYtl4pjOZcLAO2WBDbmMfC-Cx6EYTOKHaTJtxOpeC-Oc88lnrotDH8u3hVnhUxmccEQf2KN2G5hbLdbaPKhwoDJCpY08LgzUVe9l9Iz1xTjQwAiLZAvUtf9oouJ9SL9Nhuuv16kjb91Vpbvm81dhxn__3i7pfMv16GjjiPbIllvsk3aDL2lzessD8jQA0rm8BZg4vynAQtfUTwzrTPCSAn6l_ays6GiN0GuNJi1yOkaGXBXsfg4XEMUOau9lh0z6d-PegDUNFdhrKEXFpFGxALwVOoHR1iiWgTPW5WmsVaIDEWiDxWK0NTL1qlcdcZ6DxcIsx4ggPyStRbFwR4QqB8QsMCbKUxvzPFUmAhtbJeHu1LDAMengBs0-6poZs_XenPwxf0p20EiYhBWKM9Kqlit3Du6-0hfezF_KpqlM |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZQGWDiVcQbD6wpiZ3YMSOFKoU2VKhF3ar6EQlBG9SkC7-es5MWhITEZnlwIp_s-z7ffXcIXenIxISrqSe0pF7II-PJjEceeKeprwPKietF0E9ZMgofxtG4Fqs7LYwxxiWfmZYduli-ztXSPpXBCbfow_ao3QTHH5FKrrV-UqFAZpiIa4Fc4Ivr9svg2VYYo0AEiS2Tzayy_UcbFedFOjsoXX2_Sh55ay1L2VKfv0oz_vsHd1HzW7CHB2tXtIc2zHwf7dQIE9fntzhATwnQzsUdAMXZbQ42usFuIq1ywQsMCBZ3pkWJByuMXqk0cZ7hoeXIZe51Z3AFYdtD7b1oolHnfthOvLqlgvcacFZ6XImQAeIKDLPxVhJy3yhtsjiUIpI-86Wy5WKkVjx2uldJKM3AZsE0szFBeoga83xujhAWBqiZrxTJYh3SLBaKgJW14HB7SljgGDXtBk0-qqoZk9XenPwxf4m2kmG_N-l108dTtG0NZlOyAnaGGuViac7B-Zfywpn8C9JqrJY |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=HyperDreamBooth%3A+HyperNetworks+for+Fast+Personalization+of+Text-to-Image+Models&rft.au=Ruiz%2C+Nataniel&rft.au=Li%2C+Yuanzhen&rft.au=Jampani%2C+Varun&rft.au=Wei%2C+Wei&rft.date=2024-06-16&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=6527&rft.epage=6536&rft_id=info:doi/10.1109%2FCVPR52733.2024.00624&rft.externalDocID=10656613 |