Is Llama 3.2 3B Instruct or GLM 4.6 cheaper?

Llama 3.2 3B Instruct is cheaper on output tokens ($0.05 vs $2.20 per 1M).

Which has the larger context window, Llama 3.2 3B Instruct or GLM 4.6?

GLM 4.6 has the larger context window (205K tokens).

Llama 3.2 3B Instruct vs GLM 4.6

Llama 3.2 3B Instruct is cheaper on output tokens, while GLM 4.6 offers a larger context window. Choose Llama 3.2 3B Instruct or GLM 4.6 based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.2 3B Instruct	GLM 4.6
Provider	NovitaAI	NovitaAI
Input / 1M tokens	$0.03	$0.55
Output / 1M tokens	$0.05	$2.20
Context window	33K	205K
Parameters	—	357B
Open weights	Yes	Yes
Released	Sep 2024	Sep 2025

Llama 3.2 3B Instruct details →GLM 4.6 details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.