Skip to content

Llama 3.3 70B Instruct FP8 Fast vs GPT OSS 20B

GPT OSS 20B is cheaper on output tokens. Choose Llama 3.3 70B Instruct FP8 Fast or GPT OSS 20B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecLlama 3.3 70B Instruct FP8 FastGPT OSS 20B
ProviderCloudflare AI GatewayCloudflare AI Gateway
Input / 1M tokens$0.29$0.20
Output / 1M tokens$2.25$0.30
Context window128K128K
Parameters21B
Open weightsNoNo
ReleasedApr 2025Aug 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.