Calculator
AI Embeddings Cost Calculator
Estimate one-time and recurring embedding cost across 9+ providers. Plug in document corpus size, chunk strategy, and refresh frequency.
Pricing data refreshed:
The AITOT Embeddings Cost calculator estimates one-time corpus embedding plus recurring re-embed cost across 9 providers — OpenAI text-embedding-3-small/large, Cohere Embed v4, Voyage 3 (Lite + standard), Jina v3, BGE-M3 (self-host), Mistral Embed, Google text-embedding-005, and Azure OpenAI Embed.
For a 1M-document corpus at 500 tokens average per doc = 500M tokens. OpenAI text-embedding-3-small: $10. OpenAI text-embedding-3-large: $65. Cohere Embed v4: $50. Voyage 3 Lite: $10. Most one-time embed bills are small; recurring re-embedding from doc updates is what scales.
Toggle refresh frequency (0 = never, 0.25 = every 4 months, 1 = monthly, 4 = weekly). Above 5B tokens/month, self-hosted BGE-M3 on a single H100 beats OpenAI. The calculator surfaces self-host break-even alongside managed provider costs.
Cheapest · year 1
Together · BGE-M3
1024 dim · 8,192 max tokens
$2
| Provider | Model | $ / 1M tokens | One-time embed cost | Monthly cost | Year 1 |
|---|---|---|---|---|---|
| Together | BGE-M3 1024 dim · Self-host open weights for $0 | $0.008 | $0.40 | $0.14 | $2 |
| Together | bge-large-en-v1.5 1024 dim | $0.008 | $0.40 | $0.14 | $2 |
| Fireworks | nomic-embed-text-v1.5 768 dim | $0.008 | $0.40 | $0.14 | $2 |
| Jina AI | jina-embeddings-v3 1024 dim · configurable | $0.012 | $0.60 | $0.21 | $3 |
| Jina AI | jina-embeddings-v4 2048 dim · configurable | $0.018 | $0.90 | $0.31 | $5 |
| OpenAI | text-embedding-3-small 1536 dim · configurable | $0.02 | $1.00 | $0.35 | $5 |
| Voyage AI | voyage-4-lite 512 dim · 200M tokens free | $0.02 | $1.00 | $0.35 | $5 |
| Voyage AI | voyage-3-lite 512 dim | $0.02 | $1.00 | $0.35 | $5 |
| Amazon Bedrock | Titan Embed v2 1024 dim · configurable | $0.02 | $1.00 | $0.35 | $5 |
| Voyage AI | voyage-4 1024 dim · configurable · 200M tokens free | $0.06 | $3.00 | $1.05 | $16 |
| Voyage AI | voyage-3 1024 dim | $0.06 | $3.00 | $1.05 | $16 |
| Cohere | embed-english-v3.0 1024 dim | $0.10 | $5.00 | $1.75 | $26 |
| Cohere | embed-multilingual-v3.0 1024 dim | $0.10 | $5.00 | $1.75 | $26 |
| Cohere | embed-english-light-v3.0 384 dim · Smaller, cheaper at inference | $0.10 | $5.00 | $1.75 | $26 |
| Mistral | mistral-embed 1024 dim | $0.10 | $5.00 | $1.75 | $26 |
| Voyage AI | voyage-4-large 1024 dim · configurable · Top MTEB 2026; 200M tokens free | $0.12 | $6.00 | $2.10 | $31 |
| OpenAI | text-embedding-3-large 3072 dim · configurable · Matryoshka — truncate to 256/512/1024 without retrain | $0.13 | $6.50 | $2.28 | $34 |
| Gemini Embedding 3072 dim · configurable · Text-only | $0.15 | $7.50 | $2.63 | $39 | |
| Voyage AI | voyage-3-large 1024 dim · configurable · Legacy v3; consider voyage-4-large | $0.18 | $9.00 | $3.15 | $47 |
| Voyage AI | voyage-code-3 1024 dim · Optimized for code retrieval | $0.18 | $9.00 | $3.15 | $47 |
| Gemini Embedding 2 3072 dim · configurable · Multimodal: text $0.20, image $0.45, audio $6.50, video $12 per 1M tokens | $0.20 | $10.00 | $3.50 | $52 |
Refresh frequency of 0.25 means re-embed the corpus once every 4 months. Models marked "configurable" support Matryoshka truncation — you can downsize dimensions post-hoc without re-embedding.
What this calculator does
9 providers compared
OpenAI 3-small/large, Cohere v4, Voyage 3, Jina, Mistral, Google, Azure, BGE-M3 self-host.
One-time + recurring
Initial corpus embed cost + monthly re-embed cost shown separately.
Refresh frequency slider
Model how often you re-embed (never, quarterly, monthly, weekly).
Self-host break-even
Compares managed APIs to BGE-M3 on rented H100. Break-even ~2B tokens/month.
Dimension truncation
Matryoshka models (OpenAI 3-large) let you truncate dimensions for storage savings.
Query token modeling
Embeddings cost is symmetrical — query tokens count too. Often overlooked.
Quick comparison
Cost to embed a 500M-token corpus + 50M monthly query tokens
| Provider | One-time | Monthly | $ / 1M tokens |
|---|---|---|---|
| Jina v3 | $9 | $0.90 | $0.018 |
| Voyage 3 Lite | $10 | $1 | $0.02 |
| OpenAI text-embed-3-small | $10 | $1 | $0.02 |
| Cohere Embed v4 Light | $50 | $5 | $0.10 |
| Voyage 3 Large | $65 | $6.50 | $0.13 |
| OpenAI text-embed-3-large | $65 | $6.50 | $0.13 |
| Self-host BGE-M3 (H100) | ~$45 | ~$1,300 | flat /mo |
Self-host wins above ~2B tokens/month total throughput.
How to use this calculator
Calculate one-time corpus embedding + recurring re-embed cost across 9 providers.
- 1
Enter corpus size
Tokens in your full corpus. Documents × avg tokens/doc. Typical: 1 doc = 500 tokens.
- 2
Set refresh frequency
0 = never, 1 = monthly, 4 = weekly. Most production corpora re-embed quarterly.
- 3
Add query volume
Monthly query tokens (queries × tokens/query). Often the biggest line item over time.
- 4
Compare and pick
Sort by monthly cost. Self-host BGE-M3 wins above ~2B tokens/month.
Why use this calculator
- ✓9 providers refreshed monthly
- ✓One-time + recurring split
- ✓Self-host break-even modeled
- ✓Matryoshka dimension truncation
- ✓Query tokens included
- ✓No login required