Llama 3.2 3B Instruct
InferenceOpen weights
Llama 3.2 3B Instruct by Inference costs $0.02 per 1M input tokens and $0.02 per 1M output tokens, with a 16K-token context window.
Pricing
Input (per 1M tokens)
$0.02
Output (per 1M tokens)
$0.02
Cached input (per 1M)
—
Specifications
- Provider
- Inference
- Context window
- 16K tokens
- Parameters
- —
- Released
- Jan 2025
- Open weights
- Yes
- Frontier model
- No
Compare Llama 3.2 3B Instruct with…
Llama 3.2 3B Instruct vs Mistral Nemo 12B Instruct$0.10/1MLlama 3.2 3B Instruct vs Google Gemma 3$0.30/1MLlama 3.2 3B Instruct vs Osmosis Structure 0.6B$0.50/1MLlama 3.2 3B Instruct vs Qwen 3 Embedding 4B$0.00/1MLlama 3.2 3B Instruct vs Qwen 2.5 7B Vision Instruct$0.20/1MLlama 3.2 3B Instruct vs Llama 3.1 8B Instruct$0.03/1M
FAQ
Pricing is per 1M tokens (USD); confirm with the provider before production use.