Is Llama 3.1 8B Instruct FP8 or GPT-5.2 Codex cheaper?

Llama 3.1 8B Instruct FP8 is cheaper on output tokens ($0.29 vs $14.00 per 1M).

Which has the larger context window, Llama 3.1 8B Instruct FP8 or GPT-5.2 Codex?

GPT-5.2 Codex has the larger context window (400K tokens).

Llama 3.1 8B Instruct FP8 vs GPT-5.2 Codex

Llama 3.1 8B Instruct FP8 is cheaper on output tokens, while GPT-5.2 Codex offers a larger context window. Choose Llama 3.1 8B Instruct FP8 or GPT-5.2 Codex based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.1 8B Instruct FP8	GPT-5.2 Codex
Provider	Cloudflare AI Gateway	Cloudflare AI Gateway
Input / 1M tokens	$0.15	$1.75
Output / 1M tokens	$0.29	$14.00
Context window	128K	400K
Parameters	—	—
Open weights	No	No
Released	Apr 2025	Dec 2025

Llama 3.1 8B Instruct FP8 details →GPT-5.2 Codex details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.