Is Llama 3.1 8B Instruct fp8 or GPT OSS 20B cheaper?

Llama 3.1 8B Instruct fp8 is cheaper on output tokens ($0.29 vs $0.30 per 1M).

Which has the larger context window, Llama 3.1 8B Instruct fp8 or GPT OSS 20B?

GPT OSS 20B has the larger context window (128K tokens).

Llama 3.1 8B Instruct fp8 vs GPT OSS 20B

Llama 3.1 8B Instruct fp8 is cheaper on output tokens, while GPT OSS 20B offers a larger context window. Choose Llama 3.1 8B Instruct fp8 or GPT OSS 20B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.1 8B Instruct fp8	GPT OSS 20B
Provider	Cloudflare Workers AI	Cloudflare Workers AI
Input / 1M tokens	$0.15	$0.20
Output / 1M tokens	$0.29	$0.30
Context window	32K	128K
Parameters	—	21B
Open weights	Yes	Yes
Released	Jul 2024	Aug 2025

Llama 3.1 8B Instruct fp8 details →GPT OSS 20B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.