Nemotron 3 Nano Omni — the first multimodal model in the Nemotron 3 family, now on DeepInfra!
ByteDance/
$4.300
/ 1M tokens
*A new-generation professional-grade multimodal video creation model developed, supports video generation with multimodal reference inputs including images, videos and audio.

You can use cURL or any other http client to run inferences:
curl -X POST \
-d '{"prompt": "A kitten is yawning at the camera"}' \
-H "Authorization: bearer $DEEPINFRA_TOKEN" \
-H 'Content-Type: application/json' \
'https://api.deepinfra.com/v1/inference/ByteDance/Seedance-2.0'
which will give you back something similar to:
{
"video_url": "/model/inference/seedance_sample.mp4",
"status": "ok",
"out_tokens": 1000,
"request_id": null,
"inference_status": {
"status": "unknown",
"runtime_ms": 0,
"cost": 0.0,
"tokens_generated": 0,
"tokens_input": 0,
"output_length": 0
}
}
© 2026 Deep Infra. All rights reserved.