reuse VertexRef instance on update #3444

arnetheduck · 2025-07-03T11:03:15Z

When updates to the MPT happen, a new VertexRef is allocated every time - this keeps the code simple but has the significant downside that updates cause unnecessary allocations.

Instead of allocating a new VertexRef on every update, we can update the existing one provided that it is not shared. We can prevent it from being shared by copying it eagerly when it's added to the layer. A downside of this approach is that we also have to make a copy when invalidating hash keys, which affects branch nodes mainly.

The tradeoff seems well worth it though, specially for imports that clock a nice perf boost, like in this little test:

(21005462, 21008193]  14.46  15.50  2,479.35  2,656.98  9m26s  8m48s   7.16%   7.16%   -6.69%
(21013654, 21016385]  15.28  16.14  2,523.74  2,665.83  8m56s  8m27s   5.63%   5.63%   -5.33%
(21021846, 21024577]  15.52  17.66  2,539.25  2,889.61  8m47s  7m43s  13.80%  13.80%  -12.12%

blocks: 16384, baseline: 27m10s, contender: 24m59s
Time (total): -2m10s, -8.00%

When updates to the MPT happen, a new VertexRef is allocated every time - this keeps the code simple but has the significant downside that updates cause unnecessary allocations. Instead of allocating a new `VertexRef` on every update, we can update the existing one provided that it is not shared. We can prevent it from being shared by copying it eagerly when it's added to the layer. A downside of this approach is that we also have to make a copy when invalidating hash keys, which affects branch and account nodes mainly. The tradeoff seems well worth it though, specially for imports that clock a nice perf boost, like in this little test: ``` (21005462, 21008193] 14.46 15.50 2,479.35 2,656.98 9m26s 8m48s 7.16% 7.16% -6.69% (21013654, 21016385] 15.28 16.14 2,523.74 2,665.83 8m56s 8m27s 5.63% 5.63% -5.33% (21021846, 21024577] 15.52 17.66 2,539.25 2,889.61 8m47s 7m43s 13.80% 13.80% -12.12% blocks: 16384, baseline: 27m10s, contender: 24m59s Time (total): -2m10s, -8.00% ```

Deferred GC seemed like a good idea to reduce the amount of work done during block processing, but a side effect of this is that more memory ends up being allocated in certain workloads which in turn causes an overall slowdown, with a long test showing a net performance effect that hovers around 0% and more memory usage. In particular, the troublesome range around 2M sees a 10-15% slowdown and an ugly memory usage spike. Reverting for now - it might be worth revisiting in the future under different memory allocation patters, but as usual, it's better to not do work at all (like in #3444) than to do work faster. This reverts commit 3a00915.

arnetheduck mentioned this pull request Jul 3, 2025

Revert "defer gc during block processing (#3384)" #3445

Merged

arnetheduck added 2 commits July 4, 2025 09:42

simplify key updates

3455983

merge noise

f381899

arnetheduck merged commit 00d2ad4 into master Jul 4, 2025
23 checks passed

arnetheduck deleted the reuse-vtx branch July 4, 2025 10:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

reuse VertexRef instance on update #3444

reuse VertexRef instance on update #3444

Uh oh!

arnetheduck commented Jul 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

reuse VertexRef instance on update #3444

reuse VertexRef instance on update #3444

Uh oh!

Conversation

arnetheduck commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

arnetheduck commented Jul 3, 2025 •

edited

Loading