We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

Nemotron 3 Nano Omni — the first multimodal model in the Nemotron 3 family, now on DeepInfra!

nvidia logo

nvidia/

Nemotron-3-Nano-Omni-30B-A3B-Reasoning

$0.20

in

$0.80

out

/ 1M tokens

Nemotron 3 Nano Omni is an open multimodal model built on a hybrid Mixture-of-Experts (MoE) architecture, engineered for high efficiency and strong accuracy across image, video, audio, and text inputs. It powers always-on sub-agents for computer use, document intelligence, and audio-video understanding—replacing fragmented vision, speech, and language pipelines with a single unified inference pass.

Deploy Private Endpoint
Public
bfloat16
262,144
JSON
Function
Multimodal
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning cover image
demoapi

1dRGuMiK

2026-04-28T05:29:29+00:00