Is Nemotron Cascade 2 30B A3B or GLM 5 cheaper?

Nemotron Cascade 2 30B A3B is cheaper on output tokens ($0.80 vs $3.20 per 1M).

Which has the larger context window, Nemotron Cascade 2 30B A3B or GLM 5?

Nemotron Cascade 2 30B A3B has the larger context window (256K tokens).

Nemotron Cascade 2 30B A3B vs GLM 5

Nemotron Cascade 2 30B A3B is cheaper on output tokens, while Nemotron Cascade 2 30B A3B offers a larger context window. Choose Nemotron Cascade 2 30B A3B or GLM 5 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Nemotron Cascade 2 30B A3B	GLM 5
Provider	Venice AI	Venice AI
Input / 1M tokens	$0.14	$1.00
Output / 1M tokens	$0.80	$3.20
Context window	256K	198K
Parameters	—	744B
Open weights	Yes	Yes
Released	Mar 2026	Feb 2026

Nemotron Cascade 2 30B A3B details →GLM 5 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.