Avoid walking the complete list of search contexts on shard creation #123855

original-brownbear · 2025-03-03T09:12:01Z

This I found in the many-shards benchmark during some manual testing. Creating indices slows down measurably when there's concurrent searches going on. Interestingly enough, the bulk of the cost is coming from this hook. This makes sense to some extend because the map can quickly grow to a massive size as it scales as O(shards_searched_on_average * concurrent_searches) and a CHM generally is anything but cheap to iterate over.

=> no need to do this iteration if we're creating a new shard.

This I found in the many-shards benchmark during some manual testing. Creating indices slows down measurably when there's concurrent searches going on. Interestingly enough, the bulk of the cost is coming from this hook. This makes sense to some extend because the map can quickly grow to a massive size as it scales as O(shards_searched_on_average * concurrent_searches) and a CHM generally is anything but cheap to iterate over.

elasticsearchmachine · 2025-03-03T09:12:44Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

benchaplin · 2025-04-10T15:00:08Z

The comment in that method: "we prefer to stop searches to restore full availability as fast as possible" makes me think that allowing searches to continue also slows down index creation - did your change actually speed up the index creation as a whole? Maybe I'm misunderstanding the comment... but it reads like: 'freeing the contexts is supposed to speed things up,' and you're saying 'trying to speed things up is slow, let's skip it to go faster.'

Another thought on:

O(shards_searched_on_average * concurrent_searches)

I wonder, does it make sense to maintain a different data structure of active readers per shard? That would speed up freeAllContextsForShard.

original-brownbear · 2025-04-10T15:08:00Z

I wonder, does it make sense to maintain a different data structure of active readers per shard? That would speed up freeAllContextsForShard.

Yea this definitely should just live on IndexShard, well spotted :) Me and a collegue had the same thought recently. And this also holds the answer to your other question I think.

but it reads like: 'freeing the contexts is supposed to speed things up,' and you're saying 'trying to speed things up is slow, let's skip it to go faster.'

I think it's more like: "freeing contexts is sometimes slow and more importantly slows down under contention, here we can skip it because it does not do anything anyway to remove some contention introduced by index creation".
Hope that helps :)

You're 100% right, the current approach is not referencing the contexts from the correct place and dealing with that would be a much stronger fix, I just figured that would have a harder time getting a review in the short term and this change made my benchmarking easier to interpret when I opened it :D (and also still removes some contention/noise from heavily loaded production environments)

benchaplin

Thanks! I didn't fully understand your change. I believe I now see why you said freeing contexts "does not do anything anyway".

server/src/main/java/org/elasticsearch/search/SearchService.java

benchaplin

Thanks for the explanation!

original-brownbear · 2025-04-14T19:19:11Z

Thanks Ben!

elasticsearchmachine · 2025-04-14T19:21:00Z

💚 Backport successful

Status	Branch	Result
✅	8.x

…lastic#123855) This I found in the many-shards benchmark during some manual testing. Creating indices slows down measurably when there's concurrent searches going on. Interestingly enough, the bulk of the cost is coming from this hook. This makes sense to some extend because the map can quickly grow to a massive size as it scales as O(shards_searched_on_average * concurrent_searches) and a CHM generally is anything but cheap to iterate over.

…123855) (#126798) This I found in the many-shards benchmark during some manual testing. Creating indices slows down measurably when there's concurrent searches going on. Interestingly enough, the bulk of the cost is coming from this hook. This makes sense to some extend because the map can quickly grow to a massive size as it scales as O(shards_searched_on_average * concurrent_searches) and a CHM generally is anything but cheap to iterate over.

original-brownbear added >non-issue :Search Foundations/Search Catch all for Search Foundations v8.19.0 v9.1.0 labels Mar 3, 2025

elasticsearchmachine added the Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch label Mar 3, 2025

Merge branch 'main' into avoid-walking-all-contexts

2620b2e

benchaplin reviewed Apr 10, 2025

View reviewed changes

server/src/main/java/org/elasticsearch/search/SearchService.java Show resolved Hide resolved

Merge branch 'main' into avoid-walking-all-contexts

546a0d8

benchaplin approved these changes Apr 14, 2025

View reviewed changes

Merge branch 'main' into avoid-walking-all-contexts

d72a0c0

original-brownbear added the auto-backport Automatically create backport pull requests when merged label Apr 14, 2025

original-brownbear merged commit 235867c into elastic:main Apr 14, 2025
17 checks passed

original-brownbear deleted the avoid-walking-all-contexts branch April 14, 2025 19:19

original-brownbear mentioned this pull request Apr 14, 2025

[8.x] Avoid walking the complete list of search contexts on shard creation (#123855) #126798

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid walking the complete list of search contexts on shard creation #123855

Avoid walking the complete list of search contexts on shard creation #123855

original-brownbear commented Mar 3, 2025 •

edited

Loading

elasticsearchmachine commented Mar 3, 2025

benchaplin commented Apr 10, 2025

original-brownbear commented Apr 10, 2025

benchaplin left a comment

benchaplin left a comment

original-brownbear commented Apr 14, 2025

elasticsearchmachine commented Apr 14, 2025

Avoid walking the complete list of search contexts on shard creation #123855

Avoid walking the complete list of search contexts on shard creation #123855

Conversation

original-brownbear commented Mar 3, 2025 • edited Loading

elasticsearchmachine commented Mar 3, 2025

benchaplin commented Apr 10, 2025

original-brownbear commented Apr 10, 2025

benchaplin left a comment

Choose a reason for hiding this comment

benchaplin left a comment

Choose a reason for hiding this comment

original-brownbear commented Apr 14, 2025

elasticsearchmachine commented Apr 14, 2025

💚 Backport successful

original-brownbear commented Mar 3, 2025 •

edited

Loading