Introduce batched query execution and data-node side reduce (#121885) #126563

original-brownbear · 2025-04-09T20:22:06Z

This change moves the query phase a single roundtrip per node just like can_match or field_caps work already. A a result of executing multiple shard queries from a single request we can also partially reduce each node's query results on the data node side before responding to the coordinating node.

As a result this change significantly reduces the impact of network latencies on the end-to-end query performance, reduces the amount of work done (memory and cpu) on the coordinating node and the network traffic by factors of up to the number of shards per data node!

Benchmarking shows up to orders of magnitude improvements in heap and network traffic dimensions in querying across a larger number of shards.

backport of #121885

…121885) This change moves the query phase a single roundtrip per node just like can_match or field_caps work already. A a result of executing multiple shard queries from a single request we can also partially reduce each node's query results on the data node side before responding to the coordinating node. As a result this change significantly reduces the impact of network latencies on the end-to-end query performance, reduces the amount of work done (memory and cpu) on the coordinating node and the network traffic by factors of up to the number of shards per data node! Benchmarking shows up to orders of magnitude improvements in heap and network traffic dimensions in querying across a larger number of shards.

javanna

LGTM shall we include #126385 in it directly so we don't get test failures due to that bug?

`Lucene.EMPTY_TOP_DOCS` to identify empty to docs results. These were previously null results, but did not need to be send over transport as incremental reduction was performed only on the data node. Now it can happen that the coord node received a merge result with empty top docs, which has nothing interesting for merging, but that can lead to an exception because the type of the empty array does not match the type of other shards results, for instance if the query was sorted by field. To resolve this, we filter out empty top docs results before merging. Closes elastic#126118

original-brownbear · 2025-04-10T13:30:05Z

makes sense pulled it in here :)

original-brownbear added the backport label Apr 9, 2025

elasticsearchmachine added the v8.19.0 label Apr 9, 2025

Merge remote-tracking branch 'elastic/8.x' into 121885-8.x

c3c3e9e

javanna approved these changes Apr 10, 2025

View reviewed changes

Merge remote-tracking branch 'elastic/8.x' into 121885-8.x

536b714

original-brownbear merged commit e9c179a into elastic:8.x Apr 10, 2025
15 checks passed

original-brownbear deleted the 121885-8.x branch April 10, 2025 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce batched query execution and data-node side reduce (#121885) #126563

Introduce batched query execution and data-node side reduce (#121885) #126563

original-brownbear commented Apr 9, 2025

javanna left a comment

original-brownbear commented Apr 10, 2025

Introduce batched query execution and data-node side reduce (#121885) #126563

Introduce batched query execution and data-node side reduce (#121885) #126563

Conversation

original-brownbear commented Apr 9, 2025

javanna left a comment

Choose a reason for hiding this comment

original-brownbear commented Apr 10, 2025