deepinfra/dreambooth cover image

deepinfra/dreambooth

A new approach for personalizing text-to-image diffusion models to user needs has been proposed, enabling the synthesis of novel images of a specific subject in different contexts. This method involves fine-tuning a pretrained model with a few input images of the subject and using a unique identifier to embed the subject in the output domain. As a result, fully-novel photorealistic images of the subject can be synthesized in diverse scenes, poses, views, and lighting conditions that are not present in the reference images.

A new approach for personalizing text-to-image diffusion models to user needs has been proposed, enabling the synthesis of novel images of a specific subject in different contexts. This method involves fine-tuning a pretrained model with a few input images of the subject and using a unique identifier to embed the subject in the output domain. As a result, fully-novel photorealistic images of the subject can be synthesized in diverse scenes, poses, views, and lighting conditions that are not present in the reference images.

Public
$0.0005/sec

HTTP/cURL API

 

Input fields

instance_promptstring

The prompt used to describe your training images. Should be something like:'a photo of [identifier] [class noun]'. where [identifier] should be something unique


class_promptstring

The prompt used to describe your class. Should be something like:'a photo of a [class noun]'. where [class_noun] describe a class of instance


instance_datastring

zip file containing images of instance you want to train on


output_model_namestring

The name of the model to be saved. Must start with your github username


max_train_stepsinteger

The maximum number of training steps to run

Default value: 1000


train_batch_sizeinteger

The batch size to use during training

Default value: 1


center_cropboolean

Whether to center crop the images before resizing to resolution

Default value: false


learning_ratenumber

Initial learning rate (after the potential warmup period) to use.

Default value: 0.000001


lr_schedulerstring

The scheduler type to use during training.

Default value: constant

Allowed values: linearcosinecosine_with_restartspolynomialconstantconstant_with_warmup


lr_warmup_stepsinteger

Number of steps for the warmup in the lr scheduler

Default value: 0


use_8bit_adamboolean

Whether or not to use 8-bit Adam from bitsandbytes.

Default value: false


adam_beta1number

The beta1 parameter for the Adam optimizer.

Default value: 0.9


adam_beta2number

The beta2 parameter for the Adam optimizer.

Default value: 0.999


adam_weight_decaynumber

Weight decay to use

Default value: 0.01


adam_epsilonnumber

Epsilon value for the Adam optimizer

Default value: 1e-8


max_grad_normnumber

Max gradient norm.

Default value: 1


gradient_checkpointingboolean

Whether or not to use gradient checkpointing to save memory at the expense of slower backward pass.

Default value: false


webhookfile

The webhook to call when inference is done, by default you will get the output in the response of your inference request

Input Schema

Output Schema


© 2023 Deep Infra. All rights reserved.

Discord Logo