Is Llama 3.3 70B Cu Mai or Auto model (Standard) cheaper?

Llama 3.3 70B Cu Mai is cheaper on output tokens ($0.49 vs $19.99 per 1M).

Which has the larger context window, Llama 3.3 70B Cu Mai or Auto model (Standard)?

Auto model (Standard) has the larger context window (1M tokens).

Llama 3.3 70B Cu Mai vs Auto model (Standard)

Llama 3.3 70B Cu Mai is cheaper on output tokens, while Auto model (Standard) offers a larger context window. Choose Llama 3.3 70B Cu Mai or Auto model (Standard) based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Llama 3.3 70B Cu Mai	Auto model (Standard)
Provider	NanoGPT	NanoGPT
Input / 1M tokens	$0.49	$10.00
Output / 1M tokens	$0.49	$19.99
Context window	16K	1M
Parameters	—	—
Open weights	No	No
Released	Dec 2024	Jun 2024

Llama 3.3 70B Cu Mai details →Auto model (Standard) details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.