qwen
Qwen: Qwen3 VL 32B Instruct
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Input Cost
$0.10
per 1M tokens
Output Cost
$0.42
per 1M tokens
Context Window
131,072
tokens
Developer ID: qwen/qwen3-vl-32b-instruct
Related Models
qwen
$0.20/1M
Qwen: Qwen3 Coder Flash
Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 ...
qwen
$0.20/1M
Qwen: Qwen3 VL 235B A22B Instruct
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text ge...
qwen
$0.16/1M
Qwen: Qwen3.5-35B-A3B
The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid archit...