CompVis/stable-diffusion-v1-4 cover image

CompVis/stable-diffusion-v1-4

Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input.

Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input.

Public

HTTP/cURL API

You can use cURL or any other http client to run inferences:

curl -X POST \
    -H "Authorization: bearer $DEEPINFRA_TOKEN"  \
    -F 'prompt=A photo of an astronaut riding a horse on Mars.'  \
    'https://api.deepinfra.com/v1/inference/CompVis/stable-diffusion-v1-4'

which will give you back something similar to:

{
  "images": [
    "data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAIAAACQd1PeAAAADElEQVQI12PQz3wAAAJDAXkkWn+MAAAAAElFTkSuQmCC"
  ],
  "nsfw_content_detected": [
    false
  ],
  "seed": 42,
  "request_id": null,
  "inference_status": {
    "status": "unknown",
    "runtime_ms": 0,
    "cost": 0.0,
    "tokens_generated": 0,
    "tokens_input": 0
  }
}

Input fields

promptstring

text prompt


negative_promptstring

negative text prompt

Default value: ""


imagestring

starting image to use


num_imagesinteger

number of images to generate

Default value: 1

Range: 1 ≤ num_images ≤ 4


num_inference_stepsinteger

number of denoising steps

Default value: 25

Range: 1 ≤ num_inference_steps ≤ 512


guidance_scalenumber

classifier-free guidance, higher means follow prompt more closely

Default value: 7.5

Range: 1 ≤ guidance_scale ≤ 20


strengthnumber

how much to follow the input image. 1 means ignore the image, 0 means follow the image exactly

Default value: 0.8

Range: 0 ≤ strength ≤ 1


widthinteger

image width in px

Default value: 768

Allowed values: 1282563844485125766407047688328969601024


heightinteger

image height in px

Default value: 768

Allowed values: 1282563844485125766407047688328969601024


seedinteger

random seed, empty means random

Range: 0 ≤ seed


use_compelboolean

use compel library for weighted prompts

Default value: false


lorastring

Name of LoRA weights


lora_scalenumber

LoRA coefficients to apply

Default value: 1


webhookfile

The webhook to call when inference is done, by default you will get the output in the response of your inference request

Input Schema

Output Schema