Model Name
Qwen/Qwen3-VL-30B-A3B-Instruct-FP8Qwen3 VL 30B A3B Instruct
- Type: Generation
- Capabilities:
vision
Overview
Meet Qwen3-VL-30B, the smaller model of the Qwen3-VL family, delivering performance similar to GPT-4.1-mini and Claude Sonnet 4. This highly capable mid-size model is suited for tasks that are constrained or require high token volumes. Excels at reasoning, coding, and structured output generation.
Best for:
- Production workloads requiring strong performance without frontier model costs
- Complex reasoning tasks
- Code generation
Pricing
| Priority | Input Tokens (per 1M) | Output Tokens (per 1M) |
|---|---|---|
| Realtime1 | $0.16 | $0.80 |
| High (1h) | $0.07 | $0.30 |
| Standard (24h) | $0.05 | $0.20 |
Playground
Open this model in the Playground.
Footnotes
-
Realtime availability is limited. Doubleword is primarily a batch API. ↩