Is Llama 4 Maverick 17B FP8 or Qwen 3.5 397B A17B cheaper?

Llama 4 Maverick 17B FP8 is cheaper on output tokens ($0.60 vs $3.00 per 1M).

Which has the larger context window, Llama 4 Maverick 17B FP8 or Qwen 3.5 397B A17B?

Llama 4 Maverick 17B FP8 has the larger context window (1M tokens).

Llama 4 Maverick 17B FP8 vs Qwen 3.5 397B A17B

Llama 4 Maverick 17B FP8 is cheaper on output tokens, while Llama 4 Maverick 17B FP8 offers a larger context window. Choose Llama 4 Maverick 17B FP8 or Qwen 3.5 397B A17B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 4 Maverick 17B FP8	Qwen 3.5 397B A17B
Provider	Deep Infra	Deep Infra
Input / 1M tokens	$0.15	$0.45
Output / 1M tokens	$0.60	$3.00
Context window	1M	262K
Parameters	—	397B
Open weights	Yes	Yes
Released	Apr 2025	Feb 2026

Llama 4 Maverick 17B FP8 details →Qwen 3.5 397B A17B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.