DoublewordDoubleword

Model Name

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Qwen3 VL 30B A3B Instruct

  • Type: Generation
  • Capabilities: vision

Overview

Meet Qwen3-VL-30B, the smaller model of the Qwen3-VL family, delivering performance similar to GPT-4.1-mini and Claude Sonnet 4. This highly capable mid-size model is suited for tasks that are constrained or require high token volumes. Excels at reasoning, coding, and structured output generation.

Best for:

  • Production workloads requiring strong performance without frontier model costs
  • Complex reasoning tasks
  • Code generation

Pricing

PriorityInput Tokens (per 1M)Output Tokens (per 1M)
Realtime1$0.16$0.80
High (1h)$0.07$0.30
Standard (24h)$0.05$0.20

Playground

Open this model in the Playground.

Footnotes

  1. Realtime availability is limited. Doubleword is primarily a batch API.