qwen

Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Input Cost

$0.10

per 1M tokens

Output Cost

$0.42

per 1M tokens

Context Window

131,072

tokens

Compare vs GPT-4o

                Developer ID: qwen/qwen3-vl-32b-instruct            

Related Models

qwen

$0.20/1M

Qwen: Qwen3 Coder Flash

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 ...

📝 1,000,000 ctx Compare →

qwen

$0.20/1M

Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text ge...

📝 262,144 ctx Compare →

qwen

$0.16/1M

Qwen: Qwen3.5-35B-A3B

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid archit...

📝 262,144 ctx Compare →