SCENE-BASED TEXT-TO-IMAGE GENERATION WITH HUMAN PRIORS
In one embodiment, a method includes accessing a text input and a scene input corresponding to the text input, wherein the scene input comprises semantic segmentations, generating text tokens for the text input and scene tokens for the scene input by machine-learning models, generating predicted ima...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English French |
Published |
11.07.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!