Skip to content

NousResearch: Hermes 2 Pro - Llama-3 8B vs Qwen: Qwen3.7 Max

NousResearch: Hermes 2 Pro - Llama-3 8B is cheaper on output tokens, while Qwen: Qwen3.7 Max offers a larger context window. Choose NousResearch: Hermes 2 Pro - Llama-3 8B or Qwen: Qwen3.7 Max based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecNousResearch: Hermes 2 Pro - Llama-3 8BQwen: Qwen3.7 Max
ProviderKilo GatewayKilo Gateway
Input / 1M tokens$0.14$1.63
Output / 1M tokens$0.14$4.88
Context window8K1M
Parameters
Open weightsYesNo
ReleasedMay 2024Aug 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.