Llama 3.1 8B Instruct
InferenceOpen weights
Llama 3.1 8B Instruct by Inference costs $0.03 per 1M input tokens and $0.03 per 1M output tokens, with a 16K-token context window.
Pricing
Input (per 1M tokens)
$0.03
Output (per 1M tokens)
$0.03
Cached input (per 1M)
—
Specifications
- Provider
- Inference
- Context window
- 16K tokens
- Parameters
- —
- Released
- Jan 2025
- Open weights
- Yes
- Frontier model
- No
Compare Llama 3.1 8B Instruct with…
Llama 3.1 8B Instruct vs Mistral Nemo 12B Instruct$0.10/1MLlama 3.1 8B Instruct vs Google Gemma 3$0.30/1MLlama 3.1 8B Instruct vs Osmosis Structure 0.6B$0.50/1MLlama 3.1 8B Instruct vs Qwen 3 Embedding 4B$0.00/1MLlama 3.1 8B Instruct vs Qwen 2.5 7B Vision Instruct$0.20/1MLlama 3.1 8B Instruct vs Llama 3.2 1B Instruct$0.01/1M
FAQ
Pricing is per 1M tokens (USD); confirm with the provider before production use.