Llama 4 Maverick 17B 128E Instruct FP8
Azure Cognitive ServicesOpen weights
Llama 4 Maverick 17B 128E Instruct FP8 by Azure Cognitive Services costs $0.25 per 1M input tokens and $1.00 per 1M output tokens, with a 128K-token context window.
Pricing
Input (per 1M tokens)
$0.25
Output (per 1M tokens)
$1.00
Cached input (per 1M)
—
Specifications
- Provider
- Azure Cognitive Services
- Context window
- 128K tokens
- Parameters
- —
- Released
- Apr 2025
- Open weights
- Yes
- Frontier model
- No
Compare Llama 4 Maverick 17B 128E Instruct FP8 with…
Llama 4 Maverick 17B 128E Instruct FP8 vs GPT-5.3 Codex$14.00/1MLlama 4 Maverick 17B 128E Instruct FP8 vs GPT-5$10.00/1MLlama 4 Maverick 17B 128E Instruct FP8 vs GPT-5 Mini$2.00/1MLlama 4 Maverick 17B 128E Instruct FP8 vs GPT-5.2$14.00/1MLlama 4 Maverick 17B 128E Instruct FP8 vs GPT-4o mini$0.60/1MLlama 4 Maverick 17B 128E Instruct FP8 vs GPT-4o$10.00/1M
FAQ
Pricing is per 1M tokens (USD); confirm with the provider before production use.