We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

DeepInfra raises $107M Series B to scale the inference cloud — read the announcement

Qwen logo

Qwen/

Qwen3.5-9B

$0.04

in

$0.15

out

/ 1M tokens

Qwen3.5-9B is a high-performance model from Alibaba's Qwen3.5 series with a hybrid Gated Delta Networks and sparse MoE architecture. It features a 262K token context window, thinking/reasoning mode, tool calling, multi-token prediction, and support for 201 languages. Excels at reasoning, coding, instruction following, and long-context tasks.

Deploy Private Endpoint
Public
bfloat16
262,144
JSON
Function
Multimodal
ProjectPaperLicense
Qwen/Qwen3.5-9B cover image