Force all FieldCaps response handling onto a single thread per request #120863

original-brownbear · 2025-01-25T19:44:44Z

The current implementation uses a task per response model. This is unnecessarily costly when all results are merged into a single synchronized map. The concurrency across requests allows for concurrent deserialization of responses (this advantage of running on multiple threads would effectively disappear with #120010) but becomes incredibly costly when responses collide because of the hot lock acquisitions.
Also, deserializing more than a single response at a time comes with higher than necessary heap overheads (especially for the remote cluster use-case) because we hold multiple responses on-heap and deserialized but merging is sequential in the end anyways.
Also, removing all synchronization from the hot loops just outright reduces their cost even in the uncontended case.

The current implementation uses a task per response model. This is unnecessarily costly when all results are merged into a single synchronzied map. The concurrency across requests allows for concurrent deserialization of responses but becomes incredibly costly when responses collide because of the hot lock acquisitions. Also, deserializing more than a single response at a time comes with higher than necessary heap overheads (especially for the remote cluster use-case) because we hold multiple responses on-heap and deserialized but merging is sequential in the end anyways.

elasticsearchmachine · 2025-01-25T19:45:16Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

ChrisHegarty

LGTM.

This change will queue handling of the responses, and process them one at a time (rather than concurrently). I guess this could slow things a little in some cases, but that is the desired outcome to avoid unnecessary concurrency and resource utilisation in many cases.

original-brownbear · 2025-04-23T15:53:16Z

Thanks Chris! In my benchmarking via the many_shards run I couldn't really make out any slowdown.
You're right that in theory we lose the concurrent deserialization of the responses but also the actual merging gets visibly cheaper by avoiding synchronization (could have also started doing the synchronization more cleverly as an alternative but the memory issues this solves and the lack of benchmarking showing a possible win out of doing that made me not look into that). In my benchmarking I couldn't actually reprodue any slowdown but I bet one can be crafted depending on the hardware + mappings but probably not a huge one.

That said my hope/assumption is that doing something like #120010 we could actually get a serious speedup out of single-threadedness and a smaller working set :) 🤞

original-brownbear · 2025-04-24T00:05:51Z

Thanks Chris!

Same reasoning as for field_caps in elastic#120863, no need to have multiple threads contending the same mutex(s) when the heavy lifting step in handling the results is sequential anyway.

original-brownbear added 2 commits January 25, 2025 20:34

cleanup

e3d5b02

original-brownbear added >non-issue :Search Foundations/Search Catch all for Search Foundations v9.0.0 v8.18.0 labels Jan 25, 2025

elasticsearchmachine added the Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch label Jan 25, 2025

elasticsearchmachine added v8.19.0 v9.1.0 and removed v8.18.0 v9.0.0 labels Jan 30, 2025

Merge branch 'main' into faster-field-caps

6483868

ChrisHegarty approved these changes Apr 23, 2025

View reviewed changes

Merge branch 'main' into faster-field-caps

609e4f1

original-brownbear added 3 commits April 23, 2025 23:17

Merge remote-tracking branch 'elastic/main' into faster-field-caps

7201dda

fix test

151c3a9

Merge branch 'main' into faster-field-caps

cbf8e6a

original-brownbear added the backport pending label Apr 24, 2025

original-brownbear merged commit f4a6d18 into elastic:main Apr 24, 2025
17 checks passed

original-brownbear deleted the faster-field-caps branch April 24, 2025 00:06

original-brownbear mentioned this pull request Apr 24, 2025

Force all per-node query response handling onto a single thread #127317

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force all FieldCaps response handling onto a single thread per request #120863

Force all FieldCaps response handling onto a single thread per request #120863

original-brownbear commented Jan 25, 2025 •

edited

Loading

elasticsearchmachine commented Jan 25, 2025

ChrisHegarty left a comment

original-brownbear commented Apr 23, 2025

original-brownbear commented Apr 24, 2025

Force all FieldCaps response handling onto a single thread per request #120863

Force all FieldCaps response handling onto a single thread per request #120863

Conversation

original-brownbear commented Jan 25, 2025 • edited Loading

elasticsearchmachine commented Jan 25, 2025

ChrisHegarty left a comment

Choose a reason for hiding this comment

original-brownbear commented Apr 23, 2025

original-brownbear commented Apr 24, 2025

original-brownbear commented Jan 25, 2025 •

edited

Loading