Question 1

What is the cheapest embeddings provider in 2026?

Accepted Answer

For one-time corpus embedding: Voyage 3 Lite at $0.02/M tokens. OpenAI text-embedding-3-small at $0.02/M. Cohere Embed v4 Light at $0.10/M. Jina v3 at $0.018/M. BGE M3 self-hosted is effectively free at scale. For quality + price balance, OpenAI text-embedding-3-large at $0.13/M wins.

Question 2

How much does it cost to embed a 1M-document corpus?

Accepted Answer

At 500 tokens/doc average × 1M docs = 500M tokens. OpenAI text-embedding-3-small: $10. OpenAI text-embedding-3-large: $65. Cohere Embed v4: $50. For most one-time corpus embeds, the bill is small — recurring re-embedding from doc updates is what scales.

Question 3

How often should I re-embed my corpus?

Accepted Answer

Static reference data (legal, scientific): annually or on schema change. Frequently-updated docs (product catalog, docs site): weekly delta re-embed of changed chunks only. Don't batch-re-embed unchanged data — use change-detection on file hash or last-modified timestamps.

Question 4

Should I use 1536 or 3072 dimension embeddings?

Accepted Answer

1536 (OpenAI default) is sufficient for 90% of use cases. 3072 wins on long-context retrieval (legal, scientific). 1536 stores 2× cheaper in your vector DB and queries faster. Use Matryoshka truncation to test 512 → 1024 → 1536 — often gains plateau at 1024.

Question 5

Is self-hosting BGE-M3 cheaper than OpenAI embeddings?

Accepted Answer

Above ~5B embedded tokens/month, yes. BGE-M3 on a single H100 ($1.85–$2.50/hr) runs ~2M tokens/sec — that's 5T tokens/month at $1.3k/month flat. OpenAI text-embedding-3-large at $0.13/M = $650 per billion tokens, so self-host beats above ~2B tokens/month.

Question 6

How are embeddings priced — by tokens or by documents?

Accepted Answer

Always by input tokens. The calculator converts your doc count × avg tokens/doc into the billable token count. OpenAI, Cohere, Voyage, and Jina all charge per million input tokens regardless of dimension. Storage is separate (paid to your vector DB).

Provider	Model	$ / 1M tokens	One-time embed cost	Monthly cost	Year 1
Together	BGE-M3 1024 dim · Self-host open weights for $0	$0.008	$0.40	$0.14	$2
Together	bge-large-en-v1.5 1024 dim	$0.008	$0.40	$0.14	$2
Fireworks	nomic-embed-text-v1.5 768 dim	$0.008	$0.40	$0.14	$2
Jina AI	jina-embeddings-v3 1024 dim · configurable	$0.012	$0.60	$0.21	$3
Jina AI	jina-embeddings-v4 2048 dim · configurable	$0.018	$0.90	$0.31	$5
OpenAI	text-embedding-3-small 1536 dim · configurable	$0.02	$1.00	$0.35	$5
Voyage AI	voyage-4-lite 512 dim · 200M tokens free	$0.02	$1.00	$0.35	$5
Voyage AI	voyage-3-lite 512 dim	$0.02	$1.00	$0.35	$5
Amazon Bedrock	Titan Embed v2 1024 dim · configurable	$0.02	$1.00	$0.35	$5
Voyage AI	voyage-4 1024 dim · configurable · 200M tokens free	$0.06	$3.00	$1.05	$16
Voyage AI	voyage-3 1024 dim	$0.06	$3.00	$1.05	$16
Cohere	embed-english-v3.0 1024 dim	$0.10	$5.00	$1.75	$26
Cohere	embed-multilingual-v3.0 1024 dim	$0.10	$5.00	$1.75	$26
Cohere	embed-english-light-v3.0 384 dim · Smaller, cheaper at inference	$0.10	$5.00	$1.75	$26
Mistral	mistral-embed 1024 dim	$0.10	$5.00	$1.75	$26
Voyage AI	voyage-4-large 1024 dim · configurable · Top MTEB 2026; 200M tokens free	$0.12	$6.00	$2.10	$31
OpenAI	text-embedding-3-large 3072 dim · configurable · Matryoshka — truncate to 256/512/1024 without retrain	$0.13	$6.50	$2.28	$34
Google	Gemini Embedding 3072 dim · configurable · Text-only	$0.15	$7.50	$2.63	$39
Voyage AI	voyage-3-large 1024 dim · configurable · Legacy v3; consider voyage-4-large	$0.18	$9.00	$3.15	$47
Voyage AI	voyage-code-3 1024 dim · Optimized for code retrieval	$0.18	$9.00	$3.15	$47
Google	Gemini Embedding 2 3072 dim · configurable · Multimodal: text $0.20, image $0.45, audio $6.50, video $12 per 1M tokens	$0.20	$10.00	$3.50	$52

AI Embeddings Cost Calculator

What this calculator does

9 providers compared

One-time + recurring

Refresh frequency slider

Self-host break-even

Dimension truncation

Query token modeling

Quick comparison

How to use this calculator

Why use this calculator

Frequently Asked Questions

Provider	One-time	Monthly	$ / 1M tokens
Jina v3	$9	$0.90	$0.018
Voyage 3 Lite	$10	$1	$0.02
OpenAI text-embed-3-small	$10	$1	$0.02
Cohere Embed v4 Light	$50	$5	$0.10
Voyage 3 Large	$65	$6.50	$0.13
OpenAI text-embed-3-large	$65	$6.50	$0.13
Self-host BGE-M3 (H100)	~$45	~$1,300	flat /mo