We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

🚀 New model available: DeepSeek-V3.1 🚀

nvidia logo

nvidia/

NVIDIA-Nemotron-Nano-9B-v2

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Deploy Private Endpoint
Public
$0.04/$0.16 in/out Mtoken
bfloat16
131,072
Function
Project
nvidia/NVIDIA-Nemotron-Nano-9B-v2 cover image
demoapi

OHN3ULM0

2025-09-09T17:16:35+00:00