Qwen3 VL 30B A3B Instruct

Type: Generation
Capabilities: vision

Overview

Meet Qwen3-VL-30B, the smaller model of the Qwen3-VL family, delivering performance similar to GPT-4.1-mini and Claude Sonnet 4. This highly capable mid-size model is suited for tasks that are constrained or require high token volumes. Excels at reasoning, coding, and structured output generation.

Best for:

Production workloads requiring strong performance without frontier model costs
Complex reasoning tasks
Code generation

Pricing

Priority	Input Tokens (per 1M)	Output Tokens (per 1M)
Realtime¹	$0.16	$0.80
Async	$0.07	$0.30
Batch (24h)	$0.05	$0.20

Playground

Open this model in the Playground.

Realtime availability is limited. Doubleword is primarily a batch API. ↩

Qwen3 VL 30B A3B Instruct

Overview

Pricing

Playground

Footnotes