Run `newShardSnapshotTask` tasks concurrently #126452

DaveCTurner · 2025-04-08T10:20:49Z

In #88707 we changed the behaviour here to run the shard-snapshot
initialization tasks all in sequence. Yet these tasks do nontrivial work
since they may flush to acquire the relevant index commit, so with this
commit we go back to distributing them across the SNAPSHOT pool again.

In elastic#88707 we changed the behaviour here to run the shard-snapshot initialization tasks all in sequence. Yet these tasks do nontrivial work since they may flush to acquire the relevant index commit, so with this commit we go back to distributing them across the `SNAPSHOT` pool again.

elasticsearchmachine · 2025-04-08T10:21:13Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

elasticsearchmachine · 2025-04-08T10:21:13Z

Hi @DaveCTurner, I've created a changelog YAML for you.

…ntly

pxsalehi · 2025-04-08T13:27:16Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotShardsService.java

+        // apply some backpressure by reserving one SNAPSHOT thread for the startup work
+        startShardSnapshotTaskRunner.runSyncTasksEagerly(threadPool.executor(ThreadPool.Names.SNAPSHOT));


Is this part really necessary?

I see this is a pattern used else where too. Adds basically one more level on top of throttled task runner which is one more level on top of threadpool. :) Was there a case where the throttled task runner was causing an issue that we had to add this extra piece to it? I'm wondering then if we're just running too many stuff in the same threadpool which lead to this.

The unbounded queue in the ThrottledTaskRunner is always a worry, yes.

pxsalehi

One generic question for my understanding. Otherwise, LGTM.

…ntly

elasticsearchmachine · 2025-04-08T15:11:27Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 126452

In elastic#88707 we changed the behaviour here to run the shard-snapshot initialization tasks all in sequence. Yet these tasks do nontrivial work since they may flush to acquire the relevant index commit, so with this commit we go back to distributing them across the `SNAPSHOT` pool again. Backport of elastic#126452 to `8.x`

DaveCTurner · 2025-04-08T15:25:13Z

Backport is #126478

In #88707 we changed the behaviour here to run the shard-snapshot initialization tasks all in sequence. Yet these tasks do nontrivial work since they may flush to acquire the relevant index commit, so with this commit we go back to distributing them across the `SNAPSHOT` pool again. Backport of #126452 to `8.x`

DaveCTurner added >bug :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs auto-backport Automatically create backport pull requests when merged v8.19.0 v9.1.0 labels Apr 8, 2025

DaveCTurner requested review from ywangd and pxsalehi April 8, 2025 10:20

elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Apr 8, 2025

Update docs/changelog/126452.yaml

7499c36

Merge branch 'main' into 2025/04/08/newShardSnapshotTask-run-concurre…

75b7114

…ntly

pxsalehi reviewed Apr 8, 2025

View reviewed changes

pxsalehi approved these changes Apr 8, 2025

View reviewed changes

DaveCTurner added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Apr 8, 2025

Merge branch 'main' into 2025/04/08/newShardSnapshotTask-run-concurre…

ad7ba88

…ntly

elasticsearchmachine merged commit 94c385d into elastic:main Apr 8, 2025
17 checks passed

DaveCTurner deleted the 2025/04/08/newShardSnapshotTask-run-concurrently branch April 8, 2025 15:10

elasticsearchmachine added the backport pending label Apr 8, 2025

DaveCTurner mentioned this pull request Apr 8, 2025

Run newShardSnapshotTask tasks concurrently #126478

Merged

DaveCTurner removed the backport pending label Apr 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Run `newShardSnapshotTask` tasks concurrently #126452

Run `newShardSnapshotTask` tasks concurrently #126452

Uh oh!

DaveCTurner commented Apr 8, 2025

Uh oh!

elasticsearchmachine commented Apr 8, 2025

Uh oh!

elasticsearchmachine commented Apr 8, 2025

Uh oh!

pxsalehi Apr 8, 2025

Uh oh!

pxsalehi Apr 8, 2025

Uh oh!

DaveCTurner Apr 8, 2025

Uh oh!

pxsalehi left a comment

Uh oh!

Uh oh!

elasticsearchmachine commented Apr 8, 2025

Uh oh!

DaveCTurner commented Apr 8, 2025

Uh oh!

Uh oh!

		// apply some backpressure by reserving one SNAPSHOT thread for the startup work
		startShardSnapshotTaskRunner.runSyncTasksEagerly(threadPool.executor(ThreadPool.Names.SNAPSHOT));

Run newShardSnapshotTask tasks concurrently #126452

Run newShardSnapshotTask tasks concurrently #126452

Uh oh!

Conversation

DaveCTurner commented Apr 8, 2025

Uh oh!

elasticsearchmachine commented Apr 8, 2025

Uh oh!

elasticsearchmachine commented Apr 8, 2025

Uh oh!

pxsalehi Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

pxsalehi Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

pxsalehi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Apr 8, 2025

💔 Backport failed

Uh oh!

DaveCTurner commented Apr 8, 2025

Uh oh!

Uh oh!

Run `newShardSnapshotTask` tasks concurrently #126452

Run `newShardSnapshotTask` tasks concurrently #126452