Is Qwen3.5 397B A17B FP8 or GLM 5 Fast cheaper?

GLM 5 Fast is cheaper on output tokens ($3.60 vs $4.14 per 1M).

Which has the larger context window, Qwen3.5 397B A17B FP8 or GLM 5 Fast?

Qwen3.5 397B A17B FP8 has the larger context window (262K tokens).

Qwen3.5 397B A17B FP8 vs GLM 5 Fast

GLM 5 Fast is cheaper on output tokens, while Qwen3.5 397B A17B FP8 offers a larger context window. Choose Qwen3.5 397B A17B FP8 or GLM 5 Fast based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Qwen3.5 397B A17B FP8	GLM 5 Fast
Provider	Neuralwatt	Neuralwatt
Input / 1M tokens	$0.69	$1.10
Output / 1M tokens	$4.14	$3.60
Context window	262K	203K
Parameters	—	—
Open weights	Yes	Yes
Released	Feb 2026	Apr 2026

Qwen3.5 397B A17B FP8 details →GLM 5 Fast details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.