We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

GLM-5.1 - state-of-the-art agentic engineering, now available on DeepInfra!

Qwen logo

Qwen/

Qwen3.5-35B-A3B

$0.20

in

$0.95

out

$0.10

cached

/ 1M tokens

Qwen3.5-35B-A3B is an efficient Mixture-of-Experts model from Alibaba's Qwen3.5 series with 35B total parameters and only 3B activated per token. It features a 262K token context window (extensible to 1M with YaRN), thinking/reasoning mode, tool calling, and support for 201 languages. Delivers strong performance on reasoning, coding, and vision-language tasks at a fraction of the compute cost.

Deploy Private Endpoint
Public
fp8
262,144
JSON
Function
Multimodal
ProjectPaperLicense
Qwen/Qwen3.5-35B-A3B cover image