Add support for global size-based retention policy #628

kolesnikovae · 2023-04-14T13:04:56Z

Resolves #612.

One of the drawbacks is that mount points and symlinks are not honoured: if one uses a separate file system for a specific tenant (or a group), this fact is completely ignored. An ultimate solution to this would be tracking each file system independently. As a workaround, we could make it so that a user can exclude tenants from this global policy.

This change also allows implementing per-tenant retention policies in an easy way.

pkg/ingester/retention_test.go

pkg/phlaredb/phlaredb.go

kolesnikovae · 2023-04-14T13:22:00Z

pkg/phlaredb/phlaredb.go

+		case b := <-f.evictCh:
+			b.evicted, b.err = f.blockQuerier.evict(b.blockID)
+			close(b.done)


Not related to the PR, but: to me it's not entirely clear whether the head block is visible to queries in-between Flush and runBlockQuerierSync that should pick the new block:

case <-f.Head().flushCh: if err := f.Flush(ctx); err != nil { level.Error(f.logger).Log("msg", "flushing head block failed", "err", err) continue } f.runBlockQuerierSync(ctx)

pkg/ingester/ingester.go

pkg/phlaredb/phlaredb.go

…tion-policy-enforcement

kolesnikovae · 2023-05-09T16:49:11Z

pkg/ingester/retention.go

+	defaultMinFreeDisk                        = 10 * 1024 * 1024 * 1024 // 10Gi
+	defaultMinDiskAvailablePercentage         = 0.05
+	defaultRetentionPolicyEnforcementInterval = 5 * time.Minute


I think it's worth making these parameters configurable

I tend to agree, it will always be a balance between flexibility and making Pyroscope complex to operate, but being able to change that or disable deletion all together is more than a nice to have. That said I wouldn't mind if you do this in here or in a follow up PR.

simonswine

LGTM. Great work! I have tested this extensively locally and it did behave exactly as I expected it to. Also good effort on covering it all with tests.

simonswine · 2023-05-10T10:06:48Z

pkg/ingester/retention.go

+		// cleanup there to avoid deleting all blocks when disk usage reporting
+		// is delayed.
+		if volumeStatsPrev != nil && volumeStatsPrev.BytesAvailable >= volumeStatsCurrent.BytesAvailable {
+			level.Warn(e.logger).Log("msg", "disk utilization is not lowered by deletion of a block, pausing until next cycle")


defaultRetentionPolicyEnforcementInterval = 5 * time.Minute

Just a note here, for my own Laptop, the filesystem btrfs is fairly slow to reflect the free disk change (or doesn't at all because of snapshots). So that basically means we would delete a block every 5 minutes. Which I think is ok and better than deleting all blocks because the free disk space doesn't go down.

Oh, that's a great point, which worth mentioning in docs. As a small optimization, I'll implement batch removal that relies on the actual block size on disk.

kolesnikovae · 2023-05-10T10:43:59Z

Thank you so much for the review and help with testing @simonswine!

* feat: support for global size-based retention policy * feat: use manager for ingester subservices * add querier block eviction test * decouple retention enforcer and ingester

feat: support for global size-based retention policy

4523e43

kolesnikovae commented Apr 14, 2023

View reviewed changes

pkg/ingester/retention_test.go Outdated Show resolved Hide resolved

feat: use manager for ingester subservices

796228c

kolesnikovae commented Apr 14, 2023

View reviewed changes

pkg/phlaredb/phlaredb.go Outdated Show resolved Hide resolved

kolesnikovae added 3 commits May 9, 2023 12:00

Merge remote-tracking branch 'phlare/main' into feat/size-based-reten…

b3b52b1

…tion-policy-enforcement

add querier block eviction test

798805a

decouple retention enforcer and ingester

0d8b636

kolesnikovae force-pushed the feat/size-based-retention-policy-enforcement branch from d0dc191 to 0d8b636 Compare May 9, 2023 16:10

kolesnikovae marked this pull request as ready for review May 9, 2023 16:26

kolesnikovae requested a review from simonswine May 9, 2023 16:26

kolesnikovae commented May 9, 2023

View reviewed changes

simonswine approved these changes May 10, 2023

View reviewed changes

kolesnikovae changed the title ~~feat: support for global size-based retention policy~~ Add support for global size-based retention policy May 11, 2023

kolesnikovae merged commit 86b115f into grafana:main May 12, 2023

kolesnikovae mentioned this pull request May 18, 2023

Fix queries and storage block operations synchronisation #699

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for global size-based retention policy #628

Add support for global size-based retention policy #628

Uh oh!

kolesnikovae commented Apr 14, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

kolesnikovae Apr 14, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

kolesnikovae May 9, 2023 •

edited

Loading

Uh oh!

simonswine May 10, 2023

Uh oh!

simonswine left a comment •

edited

Loading

Uh oh!

simonswine May 10, 2023 •

edited

Loading

Uh oh!

kolesnikovae May 10, 2023

Uh oh!

kolesnikovae commented May 10, 2023

Uh oh!

Uh oh!

Add support for global size-based retention policy #628

Add support for global size-based retention policy #628

Uh oh!

Conversation

kolesnikovae commented Apr 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kolesnikovae Apr 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kolesnikovae May 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

simonswine May 10, 2023

Choose a reason for hiding this comment

Uh oh!

simonswine left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

simonswine May 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kolesnikovae May 10, 2023

Choose a reason for hiding this comment

Uh oh!

kolesnikovae commented May 10, 2023

Uh oh!

Uh oh!

kolesnikovae commented Apr 14, 2023 •

edited

Loading

kolesnikovae Apr 14, 2023 •

edited

Loading

kolesnikovae May 9, 2023 •

edited

Loading

simonswine left a comment •

edited

Loading

simonswine May 10, 2023 •

edited

Loading