Skip to content

Llama 3.1 70B Instruct vs GPT OSS 120B

GPT OSS 120B is cheaper on output tokens. Choose Llama 3.1 70B Instruct or GPT OSS 120B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecLlama 3.1 70B InstructGPT OSS 120B
ProviderOpenRouterOpenRouter
Input / 1M tokens$0.40$0.03
Output / 1M tokens$0.40$0.15
Context window131K131K
Parameters117B
Open weightsYesYes
ReleasedJul 2024Aug 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.