scx_rustland: Introduce a congestion threshold #1894

arighi · 2025-05-16T08:35:56Z

If too many tasks are piling up in the user-space scheduler we may risk to hit stall conditions.

To prevent this, introduce a congestion threshold: when the number of waiting tasks exceeds this threshold, the scheduler will proactively flush the queue to bring the task count back below the critical level.

This helps handle heavy stress tests that might flood the system with a high volume of tasks.

hodgesds · 2025-05-16T12:30:47Z

scheds/rust/scx_rustland/src/main.rs

@@ -117,6 +117,9 @@ struct Opts {
 // Time constants.
 const NSEC_PER_USEC: u64 = 1_000;

+// Congestion threshold.
+const NR_WAITING_MAX: u64 = 128;


Could be interesting to try scaling this by the number of cores.

Could be interesting to try scaling this by the number of cores.

I was thinking about that initially, but it's not trivial to model this effectively. In systems that are really big we may allow thousands of tasks to queue up before triggering any flush, leading to burstiness and stuttering behavior.

If tasks are queuing up and the length of the queue keeps growing over a certain threshold, it doesn't matter much if we have 1 CPU or 1000 CPUs, the fact is that the system doesn't have enough capacity to consume the amount of requests, so, in that case, we may want to operate in a more synchronous way, flushing tasks to prevent too long wait time (that may lead to stalls).

BTW, I may also update this PR, I'm not really happy how I've implemented the flush. I'm currently running more tests with a slightly different approach. :)

Long term it could be interesting to see if an arena based approach could work as well, would require a bit of thinking though.

Oh yes, also replacing the ring buffers used to bounce tasks from/to BPF with arenas would be interesting. Something that I'm planning to do at once arenas become a bit more stable.

If too many tasks are piling up in the user-space scheduler we may risk to hit stall conditions. To prevent this, introduce a congestion threshold: when the number of waiting tasks exceeds this threshold, the scheduler will proactively flush the queue to bring the task count back below the critical level. Moreover, introduce the new option --nr-waiting-max to make this threshold configurable from the command line. This helps handle heavy stress tests that might flood the system with a high volume of tasks. Signed-off-by: Andrea Righi <[email protected]>

arighi requested review from htejun, multics69 and hodgesds May 16, 2025 08:35

hodgesds reviewed May 16, 2025

View reviewed changes

hodgesds approved these changes May 16, 2025

View reviewed changes

arighi force-pushed the rustland-congestion-threshold branch from cd0ef85 to 32ce51a Compare May 16, 2025 13:49

arighi force-pushed the rustland-congestion-threshold branch from 32ce51a to f3f24fe Compare May 16, 2025 14:04

arighi added this pull request to the merge queue May 16, 2025

Merged via the queue into main with commit 2a5a1d2 May 16, 2025
32 checks passed

arighi deleted the rustland-congestion-threshold branch May 16, 2025 15:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

scx_rustland: Introduce a congestion threshold #1894

scx_rustland: Introduce a congestion threshold #1894

arighi commented May 16, 2025

Uh oh!

hodgesds May 16, 2025

Uh oh!

arighi May 16, 2025

Uh oh!

hodgesds May 16, 2025

Uh oh!

arighi May 16, 2025

Uh oh!

Uh oh!

Uh oh!

scx_rustland: Introduce a congestion threshold #1894

scx_rustland: Introduce a congestion threshold #1894

Conversation

arighi commented May 16, 2025

Uh oh!

hodgesds May 16, 2025

Choose a reason for hiding this comment

Uh oh!

arighi May 16, 2025

Choose a reason for hiding this comment

Uh oh!

hodgesds May 16, 2025

Choose a reason for hiding this comment

Uh oh!

arighi May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!