Skip to content

Qwen3 235B A22B Instruct 2507 FP8 vs Llama 3.3 70B

Qwen3 235B A22B Instruct 2507 FP8 is cheaper on output tokens, while Qwen3 235B A22B Instruct 2507 FP8 offers a larger context window. Choose Qwen3 235B A22B Instruct 2507 FP8 or Llama 3.3 70B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecQwen3 235B A22B Instruct 2507 FP8Llama 3.3 70B
ProviderTogether AITogether AI
Input / 1M tokens$0.20$0.88
Output / 1M tokens$0.60$0.88
Context window262K131K
Parameters70B
Open weightsYesYes
ReleasedJul 2025Dec 2024

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.