DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
FastVideo/
$0.0360
/ second
LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
You can use cURL or any other http client to run inferences:
curl -X POST \
-d '{"prompt": "A young man gently holds out a single white daisy to a young woman in a sunlit park, her face brightening into a quiet smile as she reaches to take it. Bright daylight, lush green trees behind them. Sharp focus, vivid colors."}' \
-H "Authorization: bearer $DEEPINFRA_TOKEN" \
-H 'Content-Type: application/json' \
'https://api.deepinfra.com/v1/inference/FastVideo/LTX2-Distilled-Diffusers'
which will give you back something similar to:
{
"video_url": "/model/inference/pyramid_sample.mp4",
"seed": "12345",
"request_id": null,
"inference_status": {
"status": "unknown",
"runtime_ms": 0,
"cost": 0.0,
"tokens_generated": 0,
"tokens_input": 0,
"output_length": 0
}
}
© 2026 DeepInfra. All rights reserved.