Is Qwen3.5 397B A17B Thinking or GLM 4.5 cheaper?

GLM 4.5 is cheaper on output tokens ($1.30 vs $3.60 per 1M).

Which has the larger context window, Qwen3.5 397B A17B Thinking or GLM 4.5?

Qwen3.5 397B A17B Thinking has the larger context window (258K tokens).

Qwen3.5 397B A17B Thinking vs GLM 4.5

GLM 4.5 is cheaper on output tokens, while Qwen3.5 397B A17B Thinking offers a larger context window. Choose Qwen3.5 397B A17B Thinking or GLM 4.5 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Qwen3.5 397B A17B Thinking	GLM 4.5
Provider	NanoGPT	NanoGPT
Input / 1M tokens	$0.60	$0.30
Output / 1M tokens	$3.60	$1.30
Context window	258K	128K
Parameters	—	355B
Open weights	No	No
Released	Feb 2026	Apr 2025

Qwen3.5 397B A17B Thinking details →GLM 4.5 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.