Is Llama 3.3 70B Instruct or Mistral Medium 3.5 128B cheaper?

Llama 3.3 70B Instruct is cheaper on output tokens ($0.99 vs $5.50 per 1M).

Which has the larger context window, Llama 3.3 70B Instruct or Mistral Medium 3.5 128B?

Mistral Medium 3.5 128B has the larger context window (262K tokens).

Llama 3.3 70B Instruct vs Mistral Medium 3.5 128B

Llama 3.3 70B Instruct is cheaper on output tokens, while Mistral Medium 3.5 128B offers a larger context window. Choose Llama 3.3 70B Instruct or Mistral Medium 3.5 128B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.3 70B Instruct	Mistral Medium 3.5 128B
Provider	Berget.AI	Berget.AI
Input / 1M tokens	$0.99	$1.65
Output / 1M tokens	$0.99	$5.50
Context window	128K	262K
Parameters	—	—
Open weights	Yes	Yes
Released	Apr 2025	Apr 2026

Llama 3.3 70B Instruct details →Mistral Medium 3.5 128B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.