Benchmark

ANN Search Benchmark

codelibs/search-ann-benchmark evaluates the performance of various Approximate Nearest Neighbor (ANN) algorithms, comparing both response time and accuracy metrics. This provides a comprehensive comparison of ANN-enabled systems for handling high-dimensional data spaces.

Overview

The tests focus on two main metrics:

  • QTime(msec): The time taken to respond to a search query, calculated as the average time for 10,000 queries.
  • Precision@K: The accuracy of the search results for K=10 and K=100.

The tables also include columns labeled "Top 10" and "Top 100," which indicate the number of results retrieved:

  • Top 10: Retrieves the top 10 results.
  • Top 100: Retrieves the top 100 results.

Results

The tests involve searching through a dataset of 100,000 vectors with 768 dimensions, yielding the following results:

ANN Search (Vector Only)

Product Top 10 Top 100 Test Date
QTime Precision@10 QTime Precision@100
chroma 1.5.2 3.3171 0.99256 5.6211 0.95789 2026-03-05
elasticsearch 9.3.1 0.9554 0.99611 1.0969 0.98264 2026-03-08
elasticsearch 9.3.1 (int8) 1.0264 0.95767 1.0479 0.96041 2026-03-08
elasticsearch 9.3.1 (int4) 3.3574 0.82021 3.7216 0.84701 2026-03-08
elasticsearch 9.3.1 (bbq) 0.5633 0.92890 1.2369 0.96666 2026-03-08
elasticsearch 9.3.1 (bbq_disk) 0.0010 0.00000 0.0027 0.00000 2026-03-08
milvus 2.6.11 3.2824 0.93457 3.7507 0.96842 2026-03-05
opensearch 3.5.0 2.1209 0.99727 10.7711 0.99038 2026-03-05
opensearch 3.5.0 (faiss) 3.9982 0.99861 13.0569 0.99617 2026-03-05
pgvector 0.8.2-pg17 4.6037 0.99613 6.0092 0.97691 2026-03-08
qdrant 1.17.0 1.5681 0.99540 1.9238 0.97182 2026-03-05
qdrant 1.17.0 (int8) 0.7555 0.93162 1.1845 0.93658 2026-03-05
vespa 8.650.16 1.7777 0.99008 2.0701 0.95117 2026-03-05
weaviate 1.36.2 5.1775 0.99183 6.4456 0.95664 2026-03-05
redisstack 7.4.0-v8 0.8986 0.99185 2.3229 0.95648 2026-03-06
clickhouse 26.2 6.1217 0.94935 7.2634 0.91581 2026-03-06
lancedb 0.29.2 44.0889 0.99907 74.6486 0.99918 2026-03-06
vald v1.7.17 0.8758 0.84806 3.8194 0.94194 2026-01-08

ANN Search with Keyword Filtering

Product Top 10 Top 100 Test Date
QTime Precision@10 QTime Precision@100
chroma 1.5.2 48.4880 0.99874 50.9714 0.99155 2026-03-05
elasticsearch 9.3.1 0.5589 0.99178 1.2675 0.98347 2026-03-08
elasticsearch 9.3.1 (int8) 0.5240 0.95102 1.3381 0.96192 2026-03-08
elasticsearch 9.3.1 (int4) 1.7735 0.80977 2.0953 0.85440 2026-03-08
elasticsearch 9.3.1 (bbq) 0.4171 0.93250 1.2404 0.98191 2026-03-08
elasticsearch 9.3.1 (bbq_disk) 0.0004 0.00000 0.0027 0.00000 2026-03-08
milvus 2.6.11 3.7221 0.93284 4.4341 0.93516 2026-03-05
opensearch 3.5.0 2.3646 0.99899 11.1649 0.99925 2026-03-05
opensearch 3.5.0 (faiss) 1.7531 0.99687 10.8605 0.99408 2026-03-05
pgvector 0.8.2-pg17 39.0170 1.00000 40.1950 1.00000 2026-03-08
qdrant 1.17.0 0.8428 0.99360 1.2491 0.98313 2026-03-05
qdrant 1.17.0 (int8) 0.7753 0.93449 0.9006 0.95319 2026-03-05
vespa 8.650.16 5.2989 0.99288 5.7896 0.99224 2026-03-05
weaviate 1.36.2 6.3947 0.99891 7.7263 0.99919 2026-03-05
redisstack 7.4.0-v8 1.8205 0.99502 3.8486 0.99929 2026-03-06
clickhouse 26.2 8.8238 0.33576 13.0974 0.40473 2026-03-06
lancedb 0.29.2 41.4673 0.99899 72.0425 0.99929 2026-03-06
vald v1.7.17 3.8065 0.33386 25.8800 0.40058 2026-01-08

The tests are run on GitHub Actions, and the results are collected and summarized in tables. These benchmarks provide basic reference values, allowing users to evaluate and select an appropriate system based on their specific requirements. Be sure to test and verify the chosen solution's performance in your particular context before deployment.