Is Gemma 3N E4B Instruct or Llama 3.3 70B cheaper?

Gemma 3N E4B Instruct is cheaper on output tokens ($0.12 vs $0.88 per 1M).

Which has the larger context window, Gemma 3N E4B Instruct or Llama 3.3 70B?

Llama 3.3 70B has the larger context window (131K tokens).

Gemma 3N E4B Instruct vs Llama 3.3 70B

Gemma 3N E4B Instruct is cheaper on output tokens, while Llama 3.3 70B offers a larger context window. Choose Gemma 3N E4B Instruct or Llama 3.3 70B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Gemma 3N E4B Instruct	Llama 3.3 70B
Provider	Together AI	Together AI
Input / 1M tokens	$0.06	$0.88
Output / 1M tokens	$0.12	$0.88
Context window	33K	131K
Parameters	—	70B
Open weights	Yes	Yes
Released	May 2025	Dec 2024

Gemma 3N E4B Instruct details →Llama 3.3 70B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.