Qwen3 30B is a budget large language model from qwen, priced at $0.1/1M input tokens and $0.15/1M output tokens. It is 96% cheaper than the market average and best suited for everyday budget tasks. The 32k context window covers most standard production workloads.

For most production workloads, the cost breakdown is dominated by input tokens (system prompts, context, retrieved documents) rather than output. At this price point, Qwen3 30B is one of the most cost-effective options for high-volume tasks.

Frequently Asked Questions

How much does Qwen3 30B cost per 1,000 tokens?

Qwen3 30B costs $0.0001 per 1,000 input tokens and $0.0001 per 1,000 output tokens.

What is Qwen3 30B's context window?

Qwen3 30B supports a context window of 32k tokens, which is suitable for standard use cases and moderate-length conversations.

How does Qwen3 30B compare to GPT-4o on price?

Qwen3 30B is 96% cheaper than the market average on input tokens. At $0.1/1M input vs $2.50/1M for GPT-4o, the cost difference becomes significant at scale — 10,000 requests/day with 1,000 input tokens each costs $30/month with Qwen3 30B vs $750/month with GPT-4o.

Compare Qwen3 30B with other models

Qwen3 30B vs GPT-4.1 Nano Qwen3 30B vs Gemini 2.5 Flash-Lite Qwen3 30B vs Gemini 2.0 Flash Qwen3 30B vs Mistral Small 3.1 Qwen3 30B vs Mistral Nemo Qwen3 30B vs Gemini 3 Flash-Lite