Is Qwen3 VL Flash or GPT OSS 120B cheaper?

GPT OSS 120B is cheaper on output tokens ($0.25 vs $0.40 per 1M).

Which has the larger context window, Qwen3 VL Flash or GPT OSS 120B?

Qwen3 VL Flash has the larger context window (262K tokens).

Qwen3 VL Flash vs GPT OSS 120B

GPT OSS 120B is cheaper on output tokens, while Qwen3 VL Flash offers a larger context window. Choose Qwen3 VL Flash or GPT OSS 120B based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	Qwen3 VL Flash	GPT OSS 120B
Provider	LLM Gateway	LLM Gateway
Input / 1M tokens	$0.05	$0.05
Output / 1M tokens	$0.40	$0.25
Context window	262K	131K
Parameters	—	117B
Open weights	No	No
Released	Oct 2025	Aug 2025

Qwen3 VL Flash details →GPT OSS 120B details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.