`TaskDeps` improvements #147508

nnethercote · 2025-10-09T05:19:58Z

Some cleanups and minor perf improvements relating to TaskDeps.

There are only two places that create a `TaskDeps`. One constructs it manually, the other uses `default`. It's weird that `default()` uses a capacity of 128. This commit just gets rid of `default` and introduces `new` so that both construction sites can be equivalent.

`INLINE_CAPACITY` has two different uses: - It dictates the inline capacity of `EdgesVec::edges`, which is a `SmallVec`. - It dictates when `TaskDeps` switches from a linear scan lookup to a hashset lookup to determine if an edge has been seen before. These two uses are in the same part of the code, but they're fundamentally separate and don't need to use the same constant. This commit separates the two uses, and adds some helpful comments, making the code clearer. It also changes the value used for the linear/hashset threshold from 8 to 16, which gives slightly better perf.

nnethercote · 2025-10-09T05:20:26Z

@bors try @rust-timer queue

`TaskDeps` improvements

rust-bors · 2025-10-09T11:22:41Z

💥 Test timed out after 21600s

nnethercote · 2025-10-09T20:34:07Z

@bors retry

nnethercote · 2025-10-09T22:03:20Z

@bors try @rust-timer queue

`TaskDeps` improvements

rust-bors · 2025-10-10T00:15:08Z

☀️ Try build successful (CI)
Build commit: e4658cf (e4658cffaf309c44def5c182c71fc3b1e4f810d8, parent: b925a865e2c9a0aefe5a2877863cb4df796f2eaf)

rust-timer · 2025-10-10T03:07:10Z

Finished benchmarking commit (e4658cf): comparison URL.

Overall result: ❌✅ regressions and improvements - please read the text below

Benchmarking this pull request means it may be perf-sensitive – we'll automatically label it not fit for rolling up. You can override this, but we strongly advise not to, due to possible changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please do so in sufficient writing along with @rustbot label: +perf-regression-triaged. If not, please fix the regressions and do another perf run. If its results are neutral or positive, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

Our most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.

	mean	range	count
Regressions ❌ (primary)	0.7%	[0.6%, 0.8%]	3
Regressions ❌ (secondary)	0.8%	[0.2%, 1.0%]	22
Improvements ✅ (primary)	-0.4%	[-1.2%, -0.1%]	29
Improvements ✅ (secondary)	-0.5%	[-1.1%, -0.1%]	23
All ❌✅ (primary)	-0.3%	[-1.2%, 0.8%]	32

Max RSS (memory usage)

Results (primary -0.8%, secondary 0.4%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	1.9%	[0.8%, 2.5%]	3
Regressions ❌ (secondary)	1.8%	[1.2%, 2.3%]	2
Improvements ✅ (primary)	-4.8%	[-8.8%, -0.9%]	2
Improvements ✅ (secondary)	-1.1%	[-1.1%, -1.0%]	2
All ❌✅ (primary)	-0.8%	[-8.8%, 2.5%]	5

Cycles

Results (secondary -1.8%)

A less reliable metric. May be of interest, but not used to determine the overall result above.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	3.2%	[2.0%, 4.3%]	2
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-3.3%	[-4.0%, -2.2%]	7
All ❌✅ (primary)	-	-	0

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 472.277s -> 472.516s (0.05%)
Artifact size: 388.06 MiB -> 388.09 MiB (0.01%)

nnethercote · 2025-10-13T10:42:30Z

I'm a bit surprised by the mix of perf improvements and regressions. Locally I saw almost universal improvements with a small number of miniscule regressions, e.g. some icount numbers:

BEFORE (Check-IncrFull)
bitmaps-3.2.1                3,999,278,379 (100.0%)  PROGRAM TOTALS
typenum-1.18.0               5,364,985,952 (100.0%)  PROGRAM TOTALS
externs                      1,184,979,961 (100.0%)  PROGRAM TOTALS
many-assoc-items             5,117,422,096 (100.0%)  PROGRAM TOTALS
wg-grammar                   6,753,582,514 (100.0%)  PROGRAM TOTALS
tuple-stress                12,562,234,370 (100.0%)  PROGRAM TOTALS
unicode-normalization-0.1.24 2,729,862,515 (100.0%)  PROGRAM TOTALS
ucd                         24,642,112,477 (100.0%)  PROGRAM TOTALS
coercions                    2,098,125,489 (100.0%)  PROGRAM TOTALS

AFTER (Check-IncrFull)
bitmaps-3.2.1                3,976,850,944 (100.0%)  PROGRAM TOTALS
typenum-1.18.0               5,339,821,854 (100.0%)  PROGRAM TOTALS
externs                      1,176,914,821 (100.0%)  PROGRAM TOTALS
many-assoc-items             5,098,681,486 (100.0%)  PROGRAM TOTALS
wg-grammar                   6,546,434,699 (100.0%)  PROGRAM TOTALS
tuple-stress                12,431,543,435 (100.0%)  PROGRAM TOTALS
unicode-normalization-0.1.24 2,725,721,823 (100.0%)  PROGRAM TOTALS
ucd                         24,665,476,690 (100.0%)  PROGRAM TOTALS
coercions                    2,096,889,857 (100.0%)  PROGRAM TOTALS

Anyway, overall it's still a perf win (esp. if you focus on primary benchmarks) and it's also as much a cleanup PR as a perf PR, so I think it's good enough.

nnethercote added 4 commits October 9, 2025 11:18

Use contains with task_deps.reads.

cc69b42

Rename a badly-named variable.

38d7e8b

nnethercote closed this Oct 9, 2025

rustbot removed the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Oct 9, 2025

nnethercote reopened this Oct 9, 2025

rustbot added the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Oct 9, 2025

This comment has been minimized.

Sign in to view

rust-bors bot added a commit that referenced this pull request Oct 9, 2025

Auto merge of #147508 - nnethercote:TaskDeps-improvements, r=<try>

ec47f3e

`TaskDeps` improvements

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Oct 9, 2025

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Oct 9, 2025

nnethercote closed this Oct 9, 2025

nnethercote reopened this Oct 9, 2025

rustbot added the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Oct 9, 2025

This comment has been minimized.

Sign in to view

rust-bors bot added a commit that referenced this pull request Oct 9, 2025

Auto merge of #147508 - nnethercote:TaskDeps-improvements, r=<try>

e4658cf

`TaskDeps` improvements

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Oct 10, 2025

rustbot assigned saethlin Oct 13, 2025

nnethercote marked this pull request as ready for review October 13, 2025 10:58

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`TaskDeps` improvements #147508

`TaskDeps` improvements #147508

nnethercote commented Oct 9, 2025 •

edited

Loading

Uh oh!

nnethercote commented Oct 9, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Oct 9, 2025

Uh oh!

nnethercote commented Oct 9, 2025

Uh oh!

nnethercote commented Oct 9, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Oct 10, 2025

Uh oh!

This comment has been minimized.

rust-timer commented Oct 10, 2025

Uh oh!

nnethercote commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

TaskDeps improvements #147508

Are you sure you want to change the base?

TaskDeps improvements #147508

Conversation

nnethercote commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nnethercote commented Oct 9, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Oct 9, 2025

Uh oh!

nnethercote commented Oct 9, 2025

Uh oh!

nnethercote commented Oct 9, 2025

Uh oh!

This comment has been minimized.

This comment has been minimized.

rust-bors bot commented Oct 10, 2025

Uh oh!

This comment has been minimized.

rust-timer commented Oct 10, 2025

Overall result: ❌✅ regressions and improvements - please read the text below

Instruction count

Max RSS (memory usage)

Cycles

Binary size

Uh oh!

nnethercote commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

`TaskDeps` improvements #147508

`TaskDeps` improvements #147508

nnethercote commented Oct 9, 2025 •

edited

Loading