We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

🚀 New model available: DeepSeek-V3.1 🚀

nvidia logo

nvidia/

Llama-3.3-Nemotron-Super-49B-v1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a large language model (LLM) optimized for advanced reasoning, conversational interactions, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta's Llama-3.3-70B-Instruct, it employs a Neural Architecture Search (NAS) approach, significantly enhancing efficiency and reducing memory requirements.

Deploy Private Endpoint
Public
$0.10/$0.40 in/out Mtoken
fp8
131,072
Function
Project
nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 cover image