Introduce an int4 off-heap vector scorer #129824

iverase · 2025-06-23T05:47:03Z

In our IVF implementation, we are currently scoring centroid quantized using int4 on heap. We know that scoring vectors directly from the Index can yield good speedups. Therefore this commit introduces a new class that scorers vectors quantized using int4 off-heap data structures.

this indeed yields a nice speed up:

Benchmark                                             (dims)   Mode  Cnt   Score   Error   Units
Int4ScorerBenchmark.scoreFromArray                       384  thrpt    5  15.384 ± 0.445  ops/ms
Int4ScorerBenchmark.scoreFromArray                       702  thrpt    5   8.908 ± 0.697  ops/ms
Int4ScorerBenchmark.scoreFromArray                      1024  thrpt    5   7.605 ± 0.149  ops/ms
Int4ScorerBenchmark.scoreFromMemorySegmentOnlyVector     384  thrpt    5  16.463 ± 0.456  ops/ms
Int4ScorerBenchmark.scoreFromMemorySegmentOnlyVector     702  thrpt    5   9.854 ± 0.276  ops/ms
Int4ScorerBenchmark.scoreFromMemorySegmentOnlyVector    1024  thrpt    5   8.701 ± 0.865  ops/ms

elasticsearchmachine · 2025-06-23T05:47:27Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

server/src/main/java/org/elasticsearch/index/codec/vectors/DefaultIVFVectorsReader.java

benwtrent

This is a good step one towards bulk scoring!

I wonder if we can be even faster if we did "two pass" approximations e.g. dot-product against the centroids with their higher bits, and then only refined with lower bits if within some acceptable error threshold. This would complicate the centroid query logic as we would need to do multiple passes over the centroids. But since we score all of them right now, it seems pretty simple to do.

…aultIVFVectorsReader.java Co-authored-by: Benjamin Trent <[email protected]>

* Introduce an int4 off-heap vector scorer * iter * Update server/src/main/java/org/elasticsearch/index/codec/vectors/DefaultIVFVectorsReader.java Co-authored-by: Benjamin Trent <[email protected]> --------- Co-authored-by: Benjamin Trent <[email protected]>

Introduce an int4 off-heap vector scorer

9a341c9

iverase requested review from benwtrent and john-wagster June 23, 2025 05:47

iverase added >non-issue :Search Relevance/Vectors Vector search v9.1.0 labels Jun 23, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Jun 23, 2025

iverase added 2 commits June 23, 2025 07:54

iter

86c3bf9

Merge branch 'main' into ES91Int4VectorsScorer

b594528

benwtrent reviewed Jun 23, 2025

View reviewed changes

server/src/main/java/org/elasticsearch/index/codec/vectors/DefaultIVFVectorsReader.java Outdated Show resolved Hide resolved

benwtrent approved these changes Jun 23, 2025

View reviewed changes

iverase and others added 2 commits June 23, 2025 15:36

Update server/src/main/java/org/elasticsearch/index/codec/vectors/Def…

7c4a2ed

…aultIVFVectorsReader.java Co-authored-by: Benjamin Trent <[email protected]>

Merge branch 'main' into ES91Int4VectorsScorer

c6bc662

iverase added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jun 23, 2025

iverase enabled auto-merge (squash) June 23, 2025 16:43

iverase disabled auto-merge June 23, 2025 16:44

iverase merged commit ffea6ca into elastic:main Jun 23, 2025
31 of 32 checks passed

iverase deleted the ES91Int4VectorsScorer branch June 23, 2025 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce an int4 off-heap vector scorer #129824

Introduce an int4 off-heap vector scorer #129824

Uh oh!

iverase commented Jun 23, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

Uh oh!

benwtrent left a comment

Uh oh!

Uh oh!

Uh oh!

Introduce an int4 off-heap vector scorer #129824

Introduce an int4 off-heap vector scorer #129824

Uh oh!

Conversation

iverase commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jun 23, 2025

Uh oh!

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

iverase commented Jun 23, 2025 •

edited

Loading