Skip to content

Qwen3.5 397B A17B FP8 vs GLM 5 Fast

GLM 5 Fast is cheaper on output tokens, while Qwen3.5 397B A17B FP8 offers a larger context window. Choose Qwen3.5 397B A17B FP8 or GLM 5 Fast based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecQwen3.5 397B A17B FP8GLM 5 Fast
ProviderNeuralwattNeuralwatt
Input / 1M tokens$0.69$1.10
Output / 1M tokens$4.14$3.60
Context window262K203K
Parameters
Open weightsYesYes
ReleasedFeb 2026Apr 2026

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.