Qwen3-Max-Thinking state-of-the-art reasoning model at your fingertips!
nvidia/
$0.10
in
$0.50
out
$0.04
cached
/ 1M tokens
NVIDIA Nemotron 3 Super is a hybrid Mixture-of-Experts (MoE) model engineered for highest compute efficiency and accuracy in multi-agent applications and specialized agentic systems. It is optimized to run many collaborating agents per application on a single GPU, delivering high accuracy for reasoning, tool use, and instruction following.

Ask me anything
Settings
© 2026 Deep Infra. All rights reserved.