DeepInfra raises $107M Series B to scale the inference cloud — read the announcement
Qwen/
$0.01
in
$0.05
out
/ 1M tokens
Qwen3.5-0.8B is Alibaba's smallest model in the Qwen3.5 series, featuring a hybrid Gated Delta Networks and sparse Mixture-of-Experts architecture. Despite its compact size, it supports a 262K token context window, 201 languages, thinking/reasoning mode, and tool calling. Ideal for edge deployments, resource-constrained environments, and lightweight inference tasks.

© 2026 DeepInfra. All rights reserved.