FLUX.2 is live! High-fidelity image generation made simple.

Bria/
$0.14
/ second
Identify and segment objects across video frames using specific coordinate points. Just point in the right direction and the model will figure out by itself which object should be masked.

You can use cURL or any other http client to run inferences:
curl -X POST \
-d '{"video": "https://example.com/video.mp4", "key_points": [{"type": "positive", "x": 765, "y": 344}, {"type": "negative", "x": 200, "y": 200}]}' \
-H "Authorization: bearer $DEEPINFRA_TOKEN" \
-H 'Content-Type: application/json' \
'https://api.deepinfra.com/v1/inference/Bria/video_mask_by_key_points'
which will give you back something similar to:
{
"status": "COMPLETED",
"video_url": null,
"mask_url": null,
"request_id": null,
"inference_status": {
"status": "unknown",
"runtime_ms": 0,
"cost": 0.0,
"tokens_generated": 0,
"tokens_input": 0
}
}
© 2026 Deep Infra. All rights reserved.