We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

NVIDIA Nemotron 3 Super - blazing-fast agentic AI, ready to deploy today!

Qwen logo

Qwen/

Qwen3.5-0.8B

$0.01

in

$0.05

out

/ 1M tokens

Qwen3.5-0.8B is Alibaba's smallest model in the Qwen3.5 series, featuring a hybrid Gated Delta Networks and sparse Mixture-of-Experts architecture. Despite its compact size, it supports a 262K token context window, 201 languages, thinking/reasoning mode, and tool calling. Ideal for edge deployments, resource-constrained environments, and lightweight inference tasks.

Deploy Private Endpoint
Public
fp8
262,144
Function
Multimodal
ProjectPaperLicense
Qwen/Qwen3.5-0.8B cover image
demoapi

2fc06364715b967f1860aea9cf38778875588b17

2026-03-20T23:26:37+00:00