Skip to content

llama-3_2-nemoretriever-300m-embed-v1

NvidiaOpen weights

llama-3_2-nemoretriever-300m-embed-v1 by Nvidia costs $0.00 per 1M input tokens and $0.00 per 1M output tokens, with a 33K-token context window.

Pricing

Input (per 1M tokens)
$0.00
Output (per 1M tokens)
$0.00
Cached input (per 1M)

Specifications

Provider
Nvidia
Context window
33K tokens
Parameters
Released
Jul 2025
Open weights
Yes
Frontier model
No

Compare llama-3_2-nemoretriever-300m-embed-v1 with…

FAQ

Pricing is per 1M tokens (USD); confirm with the provider before production use.