Is Llama 3.2 3B Instruct or GPT OSS 120B cheaper?

Llama 3.2 3B Instruct is cheaper on output tokens ($0.34 vs $0.75 per 1M).

Which has the larger context window, Llama 3.2 3B Instruct or GPT OSS 120B?

GPT OSS 120B has the larger context window (128K tokens).

Llama 3.2 3B Instruct vs GPT OSS 120B

Llama 3.2 3B Instruct is cheaper on output tokens, while GPT OSS 120B offers a larger context window. Choose Llama 3.2 3B Instruct or GPT OSS 120B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.2 3B Instruct	GPT OSS 120B
Provider	Cloudflare Workers AI	Cloudflare Workers AI
Input / 1M tokens	$0.05	$0.35
Output / 1M tokens	$0.34	$0.75
Context window	80K	128K
Parameters	—	117B
Open weights	Yes	Yes
Released	Sep 2024	Aug 2025

Llama 3.2 3B Instruct details →GPT OSS 120B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.