Skip to content

Llama 3.2 3B Instruct vs QwQ 32B

Llama 3.2 3B Instruct is cheaper on output tokens. Choose Llama 3.2 3B Instruct or QwQ 32B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecLlama 3.2 3B InstructQwQ 32B
ProviderCloudflare AI GatewayCloudflare AI Gateway
Input / 1M tokens$0.05$0.66
Output / 1M tokens$0.34$1.00
Context window128K128K
Parameters33B
Open weightsNoNo
ReleasedApr 2025Apr 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.