We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic…

Nemotron 3 Nano Omni — the first multimodal model in the Nemotron 3 family, now on DeepInfra!

Qwen logo

Qwen/

Qwen3.5-122B-A10B

$0.29

in

$2.40

out

/ 1M tokens

Qwen3.5-122B-A10B is a large Mixture-of-Experts model from Alibaba's Qwen3.5 series with 122B total parameters and 10B activated per token. It features a 262K token context window (extensible to 1M with YaRN), thinking/reasoning mode, tool calling, and support for 201 languages. Excels at complex reasoning, coding, multimodal understanding, and agentic tasks with the efficiency of sparse activation.

Deploy Private Endpoint
Public
fp8
262,144
JSON
Function
Multimodal
ProjectPaperLicense
Qwen/Qwen3.5-122B-A10B cover image