Qwen3-VL-235B-A22B-Instruct
Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date. This generation delivers comprehensive upgrades across the board: superior text understanding & generation, deeper visual perception & reasoning, extended context length, enhanced spatial and video dynamics comprehension, and stronger agent interaction capabilities.
fp8
256k
$0.11 cached, $0.20 in, $0.88 out / 1M