Is GLM 4.5 FP8 or Qwen3 235B A22B Instruct 2507 cheaper?

Qwen3 235B A22B Instruct 2507 is cheaper on output tokens ($0.30 vs $0.80 per 1M).

Which has the larger context window, GLM 4.5 FP8 or Qwen3 235B A22B Instruct 2507?

Qwen3 235B A22B Instruct 2507 has the larger context window (262K tokens).

GLM 4.5 FP8 vs Qwen3 235B A22B Instruct 2507

Qwen3 235B A22B Instruct 2507 is cheaper on output tokens, while Qwen3 235B A22B Instruct 2507 offers a larger context window. Choose GLM 4.5 FP8 or Qwen3 235B A22B Instruct 2507 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	GLM 4.5 FP8	Qwen3 235B A22B Instruct 2507
Provider	submodel	submodel
Input / 1M tokens	$0.20	$0.20
Output / 1M tokens	$0.80	$0.30
Context window	131K	262K
Parameters	—	—
Open weights	Yes	Yes
Released	Jul 2025	Aug 2025

GLM 4.5 FP8 details →Qwen3 235B A22B Instruct 2507 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.