FLUX.2 is live! High-fidelity image generation made simple.
nvidia/
$0.06
in
$0.24
out
/ 1M tokens
NVIDIA Nemotron 3 Nano is an open reasoning model optimized for fast, cost-efficient inference. Built with a hybrid MoE and Mamba architecture and trained on NVIDIA-curated synthetic reasoning data, it delivers strong multi-step reasoning with stable latency and predictable performance for agentic and production workloads.

Ask me anything
Settings
© 2025 Deep Infra. All rights reserved.