We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

🚀 New models by Bria.ai, generate and edit images at scale 🚀

nvidia logo

nvidia/

Llama-3.3-Nemotron-Super-49B-v1.5

$0.10

in

$0.40

out

Llama-3.3-Nemotron-Super-49B-v1.5 is a large language model (LLM) optimized for advanced reasoning, conversational interactions, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta's Llama-3.3-70B-Instruct, it employs a Neural Architecture Search (NAS) approach, significantly enhancing efficiency and reducing memory requirements.

Deploy Private Endpoint
Public
fp8
131,072
JSON
Function
ProjectNemotron
nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 cover image
demoapi

BUnlCImb

2025-09-09T21:56:23+00:00