NVIDIA Nemotron 3 Super - blazing-fast agentic AI, ready to deploy today!
Qwen/
$0.01
in
$0.05
out
/ 1M tokens
Qwen3.5-0.8B is Alibaba's smallest model in the Qwen3.5 series, featuring a hybrid Gated Delta Networks and sparse Mixture-of-Experts architecture. Despite its compact size, it supports a 262K token context window, 201 languages, thinking/reasoning mode, and tool calling. Ideal for edge deployments, resource-constrained environments, and lightweight inference tasks.

© 2026 Deep Infra. All rights reserved.