Skip to content

Qwen3 30B A3B Instruct 2507 vs GPT OSS 120B High Throughput

GPT OSS 120B High Throughput is cheaper on output tokens, while Qwen3 30B A3B Instruct 2507 offers a larger context window. Choose Qwen3 30B A3B Instruct 2507 or GPT OSS 120B High Throughput based on the trade-off between cost, context, and the benchmarks that matter for your use case.

SpecQwen3 30B A3B Instruct 2507GPT OSS 120B High Throughput
ProviderClarifaiClarifai
Input / 1M tokens$0.30$0.09
Output / 1M tokens$0.50$0.36
Context window262K131K
Parameters
Open weightsYesYes
ReleasedJul 2025Aug 2025

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.