Is Qwen3 235B A22B Instruct 2507 FP8 or Llama 3.3 70B cheaper?

Qwen3 235B A22B Instruct 2507 FP8 is cheaper on output tokens ($0.60 vs $0.88 per 1M).

Which has the larger context window, Qwen3 235B A22B Instruct 2507 FP8 or Llama 3.3 70B?

Qwen3 235B A22B Instruct 2507 FP8 has the larger context window (262K tokens).

Qwen3 235B A22B Instruct 2507 FP8 vs Llama 3.3 70B

Qwen3 235B A22B Instruct 2507 FP8 is cheaper on output tokens, while Qwen3 235B A22B Instruct 2507 FP8 offers a larger context window. Choose Qwen3 235B A22B Instruct 2507 FP8 or Llama 3.3 70B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Qwen3 235B A22B Instruct 2507 FP8	Llama 3.3 70B
Provider	Together AI	Together AI
Input / 1M tokens	$0.20	$0.88
Output / 1M tokens	$0.60	$0.88
Context window	262K	131K
Parameters	—	70B
Open weights	Yes	Yes
Released	Jul 2025	Dec 2024

Qwen3 235B A22B Instruct 2507 FP8 details →Llama 3.3 70B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.