triedb/pathdb, eth: use double-buffer mechanism in pathdb #30464

rjl493456442 · 2024-09-19T06:18:10Z

Previously, PathDB used a single buffer to aggregate database writes, which needed to be flushed atomically. However, flushing large amounts of data (e.g., 256MB) caused significant overhead, often blocking the system for around 3 seconds during the flush.

To mitigate this overhead and reduce performance spikes, a double-buffer mechanism is introduced. When the active buffer fills up, it is marked as frozen and a background flushing process is triggered. Meanwhile, a new buffer is allocated for incoming writes, allowing operations to continue uninterrupted.

This approach reduces system blocking times and provides flexibility in adjusting buffer parameters for improved performance.

TODO:

release the content in the frozen buffer after flushing

holiman

All in all, this looks promising, I suspect this could help quite a bit

triedb/pathdb/nodebuffer.go

holiman · 2024-09-19T06:58:09Z

triedb/pathdb/nodebuffer.go

+		nodes := writeNodes(batch, b.nodes, clean)
+		rawdb.WritePersistentStateID(batch, id)
+
+		// Flush all mutations in a single batch


Note: at this point, mutations were already applied on the clean, i.e, dl.cleans cache. That happened during writeNodes. I've tried to figure out if that is a problem, but come to the conclusion that it's fine, but just wanted to raise it so you can also give it a think.

Regarding "flush all mutations in a single batch" -- is that important only because of crash-safety, or some other more subtle reason?

How about this
in disklayer.go, function node(), we lookup a node. Order:

buffer

frozen

cleans

database

And if found, write to cleans

if dl.cleans != nil && len(blob) > 0 { dl.cleans.Set(key, blob) cleanWriteMeter.Mark(int64(len(blob))) }

I'm trying to think of a case where this write-to-cleans conflicts with the write-to-cleans in the background committer writeNodes method.

if it's found in buffer/frozen => return and no interaction with cache

if it's found in cache => return

if it's found in disk (it implicitly means the item is not in these places above, even the item is marked as deleted, it will still be caught in buffer/frozen/cache), load it from db and add it into the cache

so, no conflict should happen

But i have to say it's a really good point, i haven't thought about it

Regarding "flush all mutations in a single batch" -- is that important only because of crash-safety, or some other more subtle reason?

Only because of crash-safety

holiman

LGTM, would be interesting to see some performance-charts. This PR needs some runtime before merging, IMO

rjl493456442 · 2024-09-23T07:13:34Z

For sure, it’s not a please-merge-it pull request, it will be twisted a bit and have a full performance impact inspection Thanks and Best regards Gary rong Martin HS ***@***.***>于2024年9月23日周一下午2:49写道：

…

***@***.**** approved this pull request. LGTM, would be interesting to see some performance-charts. This PR needs some runtime before merging, IMO — Reply to this email directly, view it on GitHub <#30464 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABNO6OOSVOX3WRFPVBR6AJ3ZX62XNAVCNFSM6AAAAABOPFBYCCVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDGMRRGI2DKNRVGY> . You are receiving this because you authored the thread.Message ID: ***@***.***>

joey0612 · 2024-10-10T03:05:47Z

Referenced #28471 ？

rjl493456442 · 2024-12-10T02:26:18Z

The triedb commit is improved as expected, but the EVM execution is constantly slower, with unknown reasons.

waynercheung · 2025-04-27T09:35:23Z

Referenced #28471 ？

I have the same question, is this PR referenced #28471 ？

rjl493456442 · 2025-04-27T12:55:17Z

@joey0612 @waynercheung

No, double-buffering has been part of my design since day one when I first implemented the path database.
I even had a prototype a few years ago. However, it hasn’t been merged yet for several reasons:
(a) the overall performance improvement was not significant; and (b) after flushing the write buffer into the database, subsequent database reads became 2–3× slower, which in turn slowed down the entire chain progression.

However, with the fix to block prefetcher, the state read has been significantly improved. And I want to pile
up this change on top one more time.

cockroachdb/pebble#4109

triedb/pathdb/disklayer.go

MariusVanDerWijden

LGTM

MariusVanDerWijden · 2025-06-20T07:07:43Z

triedb/pathdb/buffer.go

-		return fmt.Errorf("buffer layers (%d) cannot be applied on top of persisted state id (%d) to reach requested state id (%d)", b.layers, head, id)
+func (b *buffer) flush(root common.Hash, db ethdb.KeyValueStore, freezer ethdb.AncientWriter, progress []byte, nodesCache, statesCache *fastcache.Cache, id uint64, postFlush func()) {
+	if b.done != nil {
+		panic("duplicated flush operation")


This can never happen, even if the flushing takes a long time to do, right? Because we are always rotating out the buffers

Exactly. The buffer is supposed to be flushed for one time.

rjl493456442 requested review from karalabe and holiman as code owners September 19, 2024 06:18

rjl493456442 force-pushed the multibuffer branch 2 times, most recently from b48c0c9 to 20b4ffd Compare September 19, 2024 07:08

holiman reviewed Sep 19, 2024

View reviewed changes

rjl493456442 force-pushed the multibuffer branch from 432633f to fc0cd1e Compare September 23, 2024 05:01

holiman previously approved these changes Sep 23, 2024

View reviewed changes

holiman mentioned this pull request Oct 14, 2024

all: unify the trie database and snapshot in path mode #30159

Closed

rjl493456442 mentioned this pull request Oct 15, 2024

core, trie, triedb: port changes from the snapshot integration #30599

Merged

rjl493456442 force-pushed the multibuffer branch from fc0cd1e to 05036ff Compare December 3, 2024 02:16

holiman changed the title ~~triedb/pathdb, eth: introduce Double-Buffer Mechanism in PathDB~~ triedb/pathdb, eth: use double-buffer mechanism in pathdb Dec 5, 2024

rjl493456442 force-pushed the multibuffer branch 2 times, most recently from 2383977 to 28ee3bc Compare December 6, 2024 06:17

rjl493456442 dismissed holiman’s stale review via 28ee3bc December 10, 2024 13:00

rjl493456442 added the status:on-hold label Dec 30, 2024

fjl removed the status:on-hold label Jan 7, 2025

rjl493456442 commented May 5, 2025

View reviewed changes

triedb/pathdb/disklayer.go Show resolved Hide resolved

rjl493456442 force-pushed the multibuffer branch from 28ee3bc to 1ceed0b Compare June 12, 2025 08:50

rjl493456442 added the status:triage label Jun 19, 2025

rjl493456442 force-pushed the multibuffer branch from 94c427b to 84aa25d Compare June 19, 2025 10:43

rjl493456442 added 4 commits June 19, 2025 18:51

triedb/pathdb, eth: use double-buffer mechanism in pathdb

761ac5d

triedb/pathdb: polish

dbf6d04

core/state: improve test

5b80a7a

triedb/pathdb: fix comments

41f8520

rjl493456442 force-pushed the multibuffer branch from 84aa25d to 41f8520 Compare June 19, 2025 10:52

rjl493456442 added 2 commits June 20, 2025 14:20

core: fix broken tests

24f6b1f

core: fix broken test

e4f7a37

MariusVanDerWijden approved these changes Jun 20, 2025

View reviewed changes

MariusVanDerWijden reviewed Jun 20, 2025

View reviewed changes

rjl493456442 added this to the 1.15.12 milestone Jun 22, 2025

rjl493456442 merged commit 2192020 into ethereum:master Jun 22, 2025
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

triedb/pathdb, eth: use double-buffer mechanism in pathdb #30464

triedb/pathdb, eth: use double-buffer mechanism in pathdb #30464

rjl493456442 commented Sep 19, 2024 •

edited

Loading

Uh oh!

holiman left a comment

Uh oh!

Uh oh!

holiman Sep 19, 2024

Uh oh!

holiman Sep 19, 2024

Uh oh!

rjl493456442 Sep 19, 2024 •

edited

Loading

Uh oh!

rjl493456442 Sep 19, 2024

Uh oh!

holiman left a comment

Uh oh!

rjl493456442 commented Sep 23, 2024 via email

Uh oh!

joey0612 commented Oct 10, 2024 •

edited

Loading

Uh oh!

rjl493456442 commented Dec 10, 2024

Uh oh!

waynercheung commented Apr 27, 2025

Uh oh!

rjl493456442 commented Apr 27, 2025

Uh oh!

Uh oh!

MariusVanDerWijden left a comment

Uh oh!

MariusVanDerWijden Jun 20, 2025

Uh oh!

rjl493456442 Jun 20, 2025

Uh oh!

Uh oh!

Uh oh!

triedb/pathdb, eth: use double-buffer mechanism in pathdb #30464

triedb/pathdb, eth: use double-buffer mechanism in pathdb #30464

Conversation

rjl493456442 commented Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

holiman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

holiman Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

holiman Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

holiman left a comment

Choose a reason for hiding this comment

Uh oh!

rjl493456442 commented Sep 23, 2024 via email

Uh oh!

joey0612 commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rjl493456442 commented Dec 10, 2024

Uh oh!

waynercheung commented Apr 27, 2025

Uh oh!

rjl493456442 commented Apr 27, 2025

Uh oh!

Uh oh!

MariusVanDerWijden left a comment

Choose a reason for hiding this comment

Uh oh!

MariusVanDerWijden Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

rjl493456442 Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rjl493456442 commented Sep 19, 2024 •

edited

Loading

rjl493456442 Sep 19, 2024 •

edited

Loading

joey0612 commented Oct 10, 2024 •

edited

Loading