DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
Qwen/
$0.04
in
$0.15
out
/ 1M tokens
Qwen3.5-9B is a high-performance model from Alibaba's Qwen3.5 series with a hybrid Gated Delta Networks and sparse MoE architecture. It features a 262K token context window, thinking/reasoning mode, tool calling, multi-token prediction, and support for 201 languages. Excels at reasoning, coding, instruction following, and long-context tasks.

© 2026 DeepInfra. All rights reserved.