We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

🚀 New models by Bria.ai, generate and edit images at scale 🚀

nvidia logo

nvidia/

NVIDIA-Nemotron-Nano-12B-v2-VL

$0.20

in

$0.60

out

The model is an auto-regressive vision language model that uses an optimized transformer architecture. The model enables multi-image reasoning and video understanding, along with strong document intelligence, visual Q&A and summarization capabilities.

Deploy Private Endpoint
Public
fp8
131,072
Function
Multimodal
Project
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL cover image
demoapi

9rfoqv3r

2025-10-28T19:24:26+00:00