Is L3 8B Stheno V3.2 or GLM 4.6 cheaper?

L3 8B Stheno V3.2 is cheaper on output tokens ($0.05 vs $2.20 per 1M).

Which has the larger context window, L3 8B Stheno V3.2 or GLM 4.6?

GLM 4.6 has the larger context window (205K tokens).

L3 8B Stheno V3.2 vs GLM 4.6

L3 8B Stheno V3.2 is cheaper on output tokens, while GLM 4.6 offers a larger context window. Choose L3 8B Stheno V3.2 or GLM 4.6 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	L3 8B Stheno V3.2	GLM 4.6
Provider	NovitaAI	NovitaAI
Input / 1M tokens	$0.05	$0.55
Output / 1M tokens	$0.05	$2.20
Context window	8K	205K
Parameters	—	357B
Open weights	Yes	Yes
Released	Nov 2024	Sep 2025

L3 8B Stheno V3.2 details →GLM 4.6 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.