Is GLM 4.1V Thinking FlashX or Auto model (Standard) cheaper?

GLM 4.1V Thinking FlashX is cheaper on output tokens ($0.30 vs $19.99 per 1M).

Which has the larger context window, GLM 4.1V Thinking FlashX or Auto model (Standard)?

Auto model (Standard) has the larger context window (1M tokens).

GLM 4.1V Thinking FlashX vs Auto model (Standard)

GLM 4.1V Thinking FlashX is cheaper on output tokens, while Auto model (Standard) offers a larger context window. Choose GLM 4.1V Thinking FlashX or Auto model (Standard) based on the trade-off between cost, context, and the benchmarks that matter for your use case.

Spec	GLM 4.1V Thinking FlashX	Auto model (Standard)
Provider	NanoGPT	NanoGPT
Input / 1M tokens	$0.30	$10.00
Output / 1M tokens	$0.30	$19.99
Context window	64K	1M
Parameters	—	—
Open weights	No	No
Released	Jul 2025	Jun 2024

GLM 4.1V Thinking FlashX details →Auto model (Standard) details →

FAQ

Pricing is indicative — confirm with the provider before production use. Highlighted values indicate the better figure for that row.