Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation

Diffusion models have recently achieved remarkable progress in generating realistic images. However, chal-lenges remain in accurately understanding and synthesizing the layout requirements in the textual prompts. To align the generated image with layout instructions, we present a training-free layou...

Full description

Saved in:
Bibliographic Details
Published in2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) pp. 6624 - 6634
Main Authors Gong, Biao, Huang, Siteng, Feng, Yutong, Zhang, Shiwei, Li, Yuyuan, Liu, Yu
Format Conference Proceeding
LanguageEnglish
Published IEEE 16.06.2024
Subjects
Online AccessGet full text

Cover

Loading…