Skip to content

GPT OSS 120B High Throughput vs Qwen3 Coder 30B A3B Instruct

GPT OSS 120B High Throughput is cheaper on output tokens, while Qwen3 Coder 30B A3B Instruct offers a larger context window. Choose GPT OSS 120B High Throughput or Qwen3 Coder 30B A3B Instruct based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecGPT OSS 120B High ThroughputQwen3 Coder 30B A3B Instruct
ProviderClarifaiClarifai
Input / 1M tokens$0.09$0.11
Output / 1M tokens$0.36$0.75
Context window131K262K
Parameters
Open weightsYesYes
ReleasedAug 2025Jul 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.