A new approach for personalizing text-to-image diffusion models to user needs has been proposed, enabling the synthesis of novel images of a specific subject in different contexts. This method involves fine-tuning a pretrained model with a few input images of the subject and using a unique identifier to embed the subject in the output domain. As a result, fully-novel photorealistic images of the subject can be synthesized in diverse scenes, poses, views, and lighting conditions that are not present in the reference images.
A new approach for personalizing text-to-image diffusion models to user needs has been proposed, enabling the synthesis of novel images of a specific subject in different contexts. This method involves fine-tuning a pretrained model with a few input images of the subject and using a unique identifier to embed the subject in the output domain. As a result, fully-novel photorealistic images of the subject can be synthesized in diverse scenes, poses, views, and lighting conditions that are not present in the reference images.
instance_prompt
stringThe prompt used to describe your training images. Should be something like:'a photo of [identifier] [class noun]'. where [identifier] should be something unique
class_prompt
stringThe prompt used to describe your class. Should be something like:'a photo of a [class noun]'. where [class_noun] describe a class of instance
center_crop
booleanWhether to center crop the images before resizing to resolution
Default value: false
learning_rate
numberInitial learning rate (after the potential warmup period) to use.
Default value: 0.000001
lr_scheduler
stringThe scheduler type to use during training.
Default value: constant
Allowed values: linear
cosine
cosine_with_restarts
polynomial
constant
constant_with_warmup
gradient_checkpointing
booleanWhether or not to use gradient checkpointing to save memory at the expense of slower backward pass.
Default value: false
webhook
fileThe webhook to call when inference is done, by default you will get the output in the response of your inference request