Is DeepSeek R1 0528 or Qwen3 235B A22B Instruct 2507 cheaper?

Qwen3 235B A22B Instruct 2507 is cheaper on output tokens ($0.30 vs $2.15 per 1M).

Which has the larger context window, DeepSeek R1 0528 or Qwen3 235B A22B Instruct 2507?

Qwen3 235B A22B Instruct 2507 has the larger context window (262K tokens).

DeepSeek R1 0528 vs Qwen3 235B A22B Instruct 2507

Qwen3 235B A22B Instruct 2507 is cheaper on output tokens, while Qwen3 235B A22B Instruct 2507 offers a larger context window. Choose DeepSeek R1 0528 or Qwen3 235B A22B Instruct 2507 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	DeepSeek R1 0528	Qwen3 235B A22B Instruct 2507
Provider	submodel	submodel
Input / 1M tokens	$0.50	$0.20
Output / 1M tokens	$2.15	$0.30
Context window	75K	262K
Parameters	—	—
Open weights	No	Yes
Released	Aug 2025	Aug 2025

DeepSeek R1 0528 details →Qwen3 235B A22B Instruct 2507 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.