Llama 3.2 3B Instruct vs Llama 3.1 8B Instruct
Llama 3.2 3B Instruct is cheaper on output tokens. Choose Llama 3.2 3B Instruct or Llama 3.1 8B Instruct based on the trade-off between cost, context, and the benchmarks that matter for your use case.
| Spec | Llama 3.2 3B Instruct | Llama 3.1 8B Instruct |
|---|---|---|
| Provider | Inference | Inference |
| Input / 1M tokens | $0.02 | $0.03 |
| Output / 1M tokens | $0.02 | $0.03 |
| Context window | 16K | 16K |
| Parameters | — | — |
| Open weights | Yes | Yes |
| Released | Jan 2025 | Jan 2025 |
FAQ
Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.