Nemotron 3 Nano Omni — the first multimodal model in the Nemotron 3 family, now on DeepInfra!

PrunaAI/
$0.025 / second
*Pruna's talking head video generation model. Provide a portrait image and either a speech script or an audio file, and the model generates a realistic video of the person speaking. Supports multiple voices, languages, and output resolutions.
You can use cURL or any other http client to run inferences:
curl -X POST \
-d '{"image": "https://example.com/portrait.jpg"}' \
-H "Authorization: bearer $DEEPINFRA_TOKEN" \
-H 'Content-Type: application/json' \
'https://api.deepinfra.com/v1/inference/PrunaAI/p-video-avatar'
which will give you back something similar to:
{
"video_url": "https://api.pruna.ai/v1/predictions/delivery/abc/output.mp4",
"request_id": null,
"inference_status": {
"status": "unknown",
"runtime_ms": 0,
"cost": 0.0,
"tokens_generated": 0,
"tokens_input": 0,
"output_length": 0
}
}
© 2026 Deep Infra. All rights reserved.