Is Llama 3.1 70B Instruct or GPT OSS 120B cheaper?

GPT OSS 120B is cheaper on output tokens ($0.15 vs $0.40 per 1M).

Which has the larger context window, Llama 3.1 70B Instruct or GPT OSS 120B?

Both models offer a similar context window.

Llama 3.1 70B Instruct vs GPT OSS 120B

GPT OSS 120B is cheaper on output tokens. Choose Llama 3.1 70B Instruct or GPT OSS 120B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.1 70B Instruct	GPT OSS 120B
Provider	OpenRouter	OpenRouter
Input / 1M tokens	$0.40	$0.03
Output / 1M tokens	$0.40	$0.15
Context window	131K	131K
Parameters	—	117B
Open weights	Yes	Yes
Released	Jul 2024	Aug 2025

Llama 3.1 70B Instruct details →GPT OSS 120B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.