Performance

Benchmarks

Real benchmarks comparing Memvid to leading vector DBs, tested on 40k Wikipedia articles and 2,500 queries.

Last updated: Nov 15, 2025 · New benchmarks coming soon

Memvid

State-of-the-Art Performance in Retrieval

Memvid
+13.3%SOTA
Others (avg)
Baseline
+18.5%
vs Chroma
+10.1%
vs LanceDB
+10.1%
vs Qdrant
+14.9%
vs Weaviate

Accuracy@1 improvement on Wikipedia benchmark with 40k articles and 2,500 queries.

Memvid Highlights

0%
Accuracy@1

Best-in-class retrieval accuracy on Wikipedia benchmark

Best in class
0ms
Latency P50

Median query latency across 2,500 queries

Sub-20ms responses
0ms
Cold Start

Time to first query - instant availability

Instant startup
0
MRR Score

Mean Reciprocal Rank - ranking quality metric

Best overall ranking
0 QPS
Queries/sec

Throughput on 39K document corpus

High throughput
0
NDCG@10

Normalized Discounted Cumulative Gain

Highest search quality

Full Comparison

MetricMemvidChromaLanceDBQdrantWeaviate
Accuracy@192.7%78.2%84.2%84.2%80.7%
Accuracy@1098.1%88.7%96.2%96.2%91.5%
Latency P5016.0ms55.6ms16.0ms28.0ms5.3ms
Latency P9919.7ms65.2ms19.3ms31.4ms7.9ms
Cold Start0.5ms66.3ms72.4ms71.8ms147.7ms
MRR0.9490.8230.8880.8880.849
QPS61.117.861.335.7180.4

Storage Efficiency

Memvid
507.7 MB
Chroma
1025.1 MB
LanceDB
213.5 MB
Qdrant
212.2 MB
Weaviate
1008.5 MB

Storage size for 40k Wikipedia articles with embeddings

Memvid stores everything in a single portable file — no database server required

Test Environment

Dataset
Wikipedia (40k articles)
Queries
2,500 test queries
Original Size
151.8 MB
Embedding Model
BGE-small

Run your own benchmarks: cd benchmarks/python && python run_benchmark.py

Across every major retrieval metric, Memvid consistently outperforms leading vector databases, delivering higher accuracy, lower latency, instant cold starts, and dramatically better overall ranking quality. And it does all of this without a server, without pipelines, and inside a single portable file. These benchmarks reflect what developers see in real-world use: faster answers, better recall, and a simpler memory stack that just works.