Skip to content

Llama 3.1 8B Instruct

InferenceOpen weights

Llama 3.1 8B Instruct by Inference costs $0.03 per 1M input tokens and $0.03 per 1M output tokens, with a 16K-token context window.

Pricing

Input (per 1M tokens)
$0.03
Output (per 1M tokens)
$0.03
Cached input (per 1M)

Specifications

Provider
Inference
Context window
16K tokens
Parameters
Released
Jan 2025
Open weights
Yes
Frontier model
No

Compare Llama 3.1 8B Instruct with…

FAQ

Pricing is per 1M tokens (USD); confirm with the provider before production use.