NVIDIA Nemotron 3 Super - blazing-fast agentic AI, ready to deploy today!
Qwen/
$0.01
in
$0.05
out
/ 1M tokens
Qwen3.5-0.8B is Alibaba's smallest model in the Qwen3.5 series, featuring a hybrid Gated Delta Networks and sparse Mixture-of-Experts architecture. Despite its compact size, it supports a 262K token context window, 201 languages, thinking/reasoning mode, and tool calling. Ideal for edge deployments, resource-constrained environments, and lightweight inference tasks.

2fc06364715b967f1860aea9cf38778875588b17
2026-03-20T23:26:37+00:00
© 2026 Deep Infra. All rights reserved.