We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

nvidia logo

nvidia/

Nemotron-3.5-ASR-Streaming-Multilingual-0.6b

$0.00020

/ minute

Nemotron 3.5 ASR Streaming Multilingual is an open 0.6B-parameter prompt-conditioned cache-aware FastConformer-RNNT model, engineered for low-latency streaming transcription across 40+ languages. It powers real-time captioning, voice agents, and multilingual transcription pipelines—replacing separate per-language Whisper deployments with a single inference pass.

nvidia/Nemotron-3.5-ASR-Streaming-Multilingual-0.6b cover image
demoapi

8CDGt7cf

2026-06-03T12:07:11+00:00