Prune by either int or interval for all retention policies #8775

Goddesen · 2025-04-19T19:50:52Z

This PR adds optional interval handling for all retention filter flags of the prune command, previously only available on --keep-within. E.g. prune --keep-hourly=7d will keep hourly archives for the last 7 days regardless of count. Adds --keep which acts as a combination of --keep-last and --keep-within.

Implements #8637.

I opted to make the existing flags handle both ints and intervals instead of adding new flags as there are already so many flags for this command. Simplified some prune filtering: With the default filter function now also handling intervals, prune_within is no longer needed as a special case.

I added a library to freeze time in testing, let me know if that's not wanted and I'll figure out something else. It's such a hassle to deal with timestamps relative to now() in test. The tests should be fairly comprehensive in checking both their timely filter (hourly/yearly, etc.) and the new inclusive timestamp check. I did not add new helper tests for prune_split as this function is not used anywhere other than prune_cmd and isn't really a helper.

TODOs:

New complicated example using intervals, showing how they overlap and interact with simple ints. Test that confirms this example (currently no test at all for overlapping intervals).
Docs once exact implementation is decided.

TODOs from comments:

Figure out default values and their interaction with the must pass at least one flag check.

Notes:

New retention flag --keep: Merges functionality of --keep-within and --keep-last with new int-or-interval handling.
--keep-last is no longer an alias of --keep-secondly and thus keeps archives made on the same second. It now fits together with --keep-within and new flag --keep.
Intervals now support 0 seconds. While working with int_or_interval and creating timedelta objects it seemed weird to restrict input like this. Not extremely useful unless you want to prune on archives in the same second as they were created, but it seemed logical when setting up tests to verify new --keep-last behavior.. Technically breaking, but will likely not meaningfully affect any real scenarios.
Timestamp comparisons for retention intervals are now inclusive on seconds. Matches (my personal) human intuition: If it is currently xx:xx:10 and there's an archive at xx:xx:05 then --keep-within 5S should cover that archive. Easy change to make when already altering the filtering logic. Technically breaking, but will likely not meaningfully affect any real scenarios.

This has been a pet peeve of mine in the pruning command for a long time. In my mind the most clean backup regime keeps all backups for a short time (allows catching small errors quickly), then hourly backups for a reasonable time (say, 7 days), then daily backups for a little longer and then finally weekly/monthly as storage permits. This was not previously possible, requiring for example --keep-within 7d --keep-hourly 168 --keep-daily 14 --keep-weekly -1 for an approximation. But keeping around 168 archives for a machine that's only running a few hours a day seems mighty excessive. So here we are :)
With this implemented my ideal retention for my primary laptop with archives every 15m looks something like --keep 3d --keep-hourly 7d --keep-daily 30d --keep-weekly -1.

Page not found · GitHub · GitHub

404 “This is not the web page you are looking for”

Contact Support — GitHub Status — @githubstatus

Accepts either int or interval, first tries parsing int then tries parsing as interval if that fails. Returns a timedelta for easy date math later. Now allows intervals of length 0 as a 0-length timedelta is perfectly fine to work with.

Support is added for setting prune retention with either an int (keep n archives) or an interval (keep within). This works much like --keep-within currently does, but extends support to all retention filters. Additionally adds a generic --keep flag to take over (or live alongside) both --keep-last and --keep-within. --keep-last is no longer an alias of --keep-secondly, now keeps archives made on the same second. Timestamp comparison is made inclusive, 'within 5 seconds' now means 'current second and the preceding 5 seconds' to match intuitive understanding of intervals in the past. Comparisons against archive timestamp are made to use local timezone instead of UTC. Should be equal result in practice, but allows for easier testing with frozen local time.

Default with tzinfo=None is local timezone anyway, no need to set it manually.

No more fetching new now() timestamp for every check. Hard to test for this, but could have once in a blue moon caused a missed archive due to the timestamp changing during iteration.

PhrozenByte · 2025-04-19T20:38:26Z

This is loosely related to #8715 as well.

I like the idea very much (I didn't and can't review the code, but I'm happy to help with the docs if desired). Borg's current retention policy IMHO is rather hard to understand for beginners (but very safe, because it usually keeps more than what users expect) and an interval-based approach feels more intuitive to me. However, special care is necessary to not cause unexpected behaviour, especially with frequent backups (e.g. daily), combined with overlapping rules (e.g. including all of within, daily, weekly, monthly, and yearly), and some missing backups (and thus the need to sometimes keep the "next best" to later fulfil less frequent rules). As far as I remember there were some issues with this in the early days of Borg that at least caused multiple major docs overhauls (not sure whether the behaviour actually changed substantially, but the docs definitely did because many users understood things wrong). Maybe Thomas can give some insights about the challenges back then?

src/borg/archiver/prune_cmd.py

Additionally simplifies the break-out logic and moves it up front for readability.

Additionally changes name of default func to be more descriptive.

src/borg/archiver/prune_cmd.py

ThomasWaldmann

Thanks for the PR!

Some minor stuff I found...

src/borg/archiver/prune_cmd.py

Puts imports back on just two lines.

Uses itertools counter for more descriptive code.

Since timestamp is not used when given just int to retain, can pass None for simpler code.

Sets UTC timezone on MockArchives in helpers_test Is more true to how real archive timestamps are instantiated, as per `parse_timestamp` in time.py, which defaults to overriding tzinfo with timezone.utc.

Can simply replace tzinfo with timezone.utc to create zoned datetime.min without underflow issues.

Goddesen · 2025-04-21T13:18:49Z

It seems the GHA test runner did not like the previous tzinfo=None change in helpers_test.py. Replaced with an implementation in MockArchive more alike to the one for real archives, let's see if that plays nicely.

PhrozenByte · 2025-04-21T14:00:28Z

New retention flag --keep: Merges functionality of --keep-within and --keep-last with new int-or-interval handling.

I'd vote for removing --keep-within and --keep-last in favour of the new --keep option with Borg 2.0 for consistency with other --keep-* options. If this gets backported to Borg 1.x, they should be deprecated. WDYT?

Timestamp comparisons for retention intervals are now inclusive on seconds. Matches (my personal) human intuition: If it is currently xx:xx:10 and there's an archive at xx:xx:05 then --keep-within 5S should cover that archive. Easy change to make when already altering the filtering logic.

IMHO this must be consistent with #8776. Question is what we agree on. I honestly believe differently: If it is xx:xx:10 now and what to cover the last five seconds, that's exclusive the xx:xx:05 second, because I also count archives that are created at xx:xx:10 (i.e. seconds 10, 9, 8, 7, and 6). WDYT?

src/borg/archiver/prune_cmd.py

ThomasWaldmann · 2025-04-21T15:33:29Z

If you want to be able to backport this to 1.4-maint, you must not break compatibility in this PR, but deprecating some options that can be replaced by a better new option can be done here.

codecov · 2025-04-21T15:41:31Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 81.94%. Comparing base (df2bb8c) to head (a87a0d9).
Report is 27 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #8775      +/-   ##
==========================================
+ Coverage   81.89%   81.94%   +0.04%     
==========================================
  Files          74       74              
  Lines       13324    13351      +27     
  Branches     1968     1971       +3     
==========================================
+ Hits        10912    10940      +28     
+ Misses       1750     1746       -4     
- Partials      662      665       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Goddesen · 2025-04-21T21:28:03Z

If you want to be able to backport this to 1.4-maint, you must not break compatibility in this PR, but deprecating some options that can be replaced by a better new option can be done here.

How do you do separation of functionality like this when a breaking 2.0 change is to be backported with deprecations instead? The code might turn out very different, especially with changes introduced on master.

Fixes bug where changed is-atleast-one-flag-given check didn't work. Adds a test to verify.

Goddesen · 2025-04-21T22:13:31Z

IMHO this must be consistent with #8776. Question is what we agree on. I honestly believe differently: If it is xx:xx:10 now and what to cover the last five seconds, that's exclusive the xx:xx:05 second, because I also count archives that are created at xx:xx:10 (i.e. seconds 10, 9, 8, 7, and 6). WDYT?

I don't think #8776 is relevant, as it matches absolute timestamps based on a pattern (like how the prune period grouping functions work) and does not deal with relative intervals where this consideration is needed. I am not familiar with the code being touched in that PR but I don't think there is overlap with the work here.

I had a whole comment here written out defending the inclusive check based on comparing timestamps only using seconds-granularity. In that scenario this makes sense. Having taken a closer look I see that archive.save explicitly saves timestamps with microseconds and the second-precision timestamps I have been seeing have all been test values. Good thing I checked. In that case the change from > to >= only makes for one microsecond of difference. I'm comfortable reverting this.

Adds optional interval support for all prune retention flags Support is added for setting prune retention with either an int (keep n archives) or an interval (keep within). This works much like --keep-within currently does, but extends support to all retention filters. Additionally adds a generic --keep flag to take over (or live alongside) both --keep-last and --keep-within. --keep-last is no longer an alias of --keep-secondly, now keeps archives made on the same second. Comparisons against archive timestamp are made to use local timezone instead of UTC. Should be equal result in practice, but allows for easier testing with frozen local time.

Goddesen · 2025-04-24T19:34:13Z

Comments fixed, inclusive timestamp change reverted with tests made slightly more robust. If that's it for implementation comments I'll extend the documentation with an involved example like the retention policy I have described in the PR description and write a test for it.

I am still not sure how to go about doing breaking change on 2.0 and backporting change with deprecations to 1.*. @ThomasWaldmann ?

ThomasWaldmann · 2025-04-24T19:40:39Z

Easiest is to backport a non-breaking change and then do the breaking change in a separate PR afterwards (and not backport that).

There are some code structure differences between master and 1.4-maint though, so even backporting the first PR might not be trivial. As 1.4.x is a stable release, we need to be very careful to not break anything there. That sometimes can mean "no backport" if the change is too big or too risky.

src/borg/archiver/prune_cmd.py

Goddesen · 2025-04-24T20:03:32Z

Last explicit default removed.

There are some code structure differences between master and 1.4-maint though, so even backporting the first PR might not be trivial. As 1.4.x is a stable release, we need to be very careful to not break anything there. That sometimes can mean "no backport" if the change is too big or too risky.

The changiest change introduced here is allowing intervals to be of length 0. Since this is an extension of functionality I think it's a fairly safe backport.

Am I correct in assuming I just put entries for --keep-within and --keep-last in deprecations in preprocess_args in archiver/__init__.py?

PhrozenByte · 2025-04-24T20:06:38Z

In regards to backporting this to 1.4-maint and breaking changes:

I kinda feel like I accidentally caused some confusion, so: Do we even have breaking changes right now?

The only "breaking" change is that --keep-within 0 works now. Since that is undocumented anyway, I personally wouldn't consider that a breaking change. The only breaking change would be to drop --keep-within and --keep-last in favour of the new --keep. I still strongly vote for doing that. However, just for 2.0: We can safely keep them for 1.4 and mark them as deprecated. They're just aliases for --keep with this PR, but don't really change functionality, do they?

Goddesen · 2025-04-24T20:15:42Z

The only "breaking" change is that --keep-within 0 works now.

Ah I forgot there is one more thing: --keep-last is no longer an alias for --keep-secondly. A little more breaky than the interval change I think, but I struggle to believe this behavior is heavily relied upon in the wild? That is, unless you read the docs about secondly pruning and then decide to just use --keep-last for some reason.

But if you want, it's pretty easy to just not introduce that change now, and do it in the breaking PR instead.

PhrozenByte · 2025-04-25T11:18:54Z

Ah I forgot there is one more thing: --keep-last is no longer an alias for --keep-secondly.

There's only a difference if someone actually managed to create multiple archives within the same second, right? I'd say that's similar to 0 intervals: Sure, it's a breaking change in theory, but in practice?

Anyway, I don't think we really have a problem here. If we decide on dropping --keep-last (and --keep-within for this matter) with Borg 2.x anyway, we can simply let --keep-last stay an alias for --keep-secondly (whose behaviour hasn't changed, right?) with Borg 1.4. Users that want to try the "true" and new "last" with Borg 1.x can use the new --keep. Same for 0 intervals: Adding a check and explicitly rejecting 0 intervals with just Borg 1.4 isn't that much of a big deal, is it?

Not for me to decide, but I suggest dealing with that stuff in the backport PR.

I don't think #8776 is relevant, as it matches absolute timestamps based on a pattern (like how the prune period grouping functions work) and does not deal with relative intervals where this consideration is needed.

It does: It can also match all archives within e.g. a full month relative to "now" (and added just recently, also relative to arbitrary other timestamps). However, I agree that there's no practical difference due to the microseconds scale. I didn't consider that. Yet I'd still vote for consistency. I just skimmed through @c-herz's work in #8776 and noticed that matching exclusively causes some problems there as well (minor, yet there are some). IMHO the consistency question still somewhat matters, so how about we simply decide on always matching inclusively? I'm thus also tagging @c-herz, WDYT?

Goddesen · 2025-04-28T07:46:48Z

After some consideration I think this new retention interval should also account for the oldest rule that was implemented for the retention count flags. I'll get back to this.

ThomasWaldmann · 2025-05-09T20:11:33Z

before doing further changes, please rebase on current master branch to get the cython workaround. otherwise, builds will fail.

ThomasWaldmann · 2025-05-22T17:26:12Z

Guess I'ld like to merge this for next beta, can we finish this until then?

Goddesen · 2025-05-22T17:47:55Z

I'll endeavor to get something done soon then. Factorio has been a distraction 😄

ThomasWaldmann · 2025-05-23T05:17:19Z

btw, i dissected the borg.testsuite.helpers_test monster module into a borg.testsuite.helpers package with modules corresponding to modules in borg.helpers.

Adds int_or_interval format parser

455e876

Accepts either int or interval, first tries parsing int then tries parsing as interval if that fails. Returns a timedelta for easy date math later. Now allows intervals of length 0 as a 0-length timedelta is perfectly fine to work with.

Goddesen mentioned this pull request Apr 19, 2025

Added --keep-all, as an alias for '--keep-last <inf>', as an option… #7462

Open

Goddesen added 3 commits April 19, 2025 22:05

Removes unnecessarily complicated local timezone in test

746f09b

Default with tzinfo=None is local timezone anyway, no need to set it manually.

Sets timestap for interval comparision just once

9b828f5

No more fetching new now() timestamp for every check. Hard to test for this, but could have once in a blue moon caused a missed archive due to the timestamp changing during iteration.

Goddesen force-pushed the prune-timely-by-interval branch from 9beb1b7 to 9b828f5 Compare April 19, 2025 20:05

jdchristensen reviewed Apr 19, 2025

View reviewed changes

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

Goddesen added 2 commits April 20, 2025 10:52

Breaks out of pruning loop early if interval is passed

3e4570e

Additionally simplifies the break-out logic and moves it up front for readability.

Comments on functionality of period grouping functions

da77550

Additionally changes name of default func to be more descriptive.

Goddesen force-pushed the prune-timely-by-interval branch from 82c9e4e to da77550 Compare April 20, 2025 08:56

jdchristensen reviewed Apr 20, 2025

View reviewed changes

ThomasWaldmann requested changes Apr 20, 2025

View reviewed changes

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

Goddesen added 7 commits April 21, 2025 14:21

fixup! Adds optional interval support for all prune retention flags

f62acd1

Puts imports back on just two lines.

fixup! Adds optional interval support for all prune retention flags

e0b13c0

Uses itertools counter for more descriptive code.

fixup! Comments on functionality of period grouping functions

17fd857

fixup! Breaks out of pruning loop early if interval is passed

71d52d2

fixup! Sets timestap for interval comparision just once

73bdebc

Since timestamp is not used when given just int to retain, can pass None for simpler code.

amend! Removes unnecessarily complicated local timezone in test

ab3d260

Sets UTC timezone on MockArchives in helpers_test Is more true to how real archive timestamps are instantiated, as per `parse_timestamp` in time.py, which defaults to overriding tzinfo with timezone.utc.

fixup! Breaks out of pruning loop early if interval is passed

38fbd43

Can simply replace tzinfo with timezone.utc to create zoned datetime.min without underflow issues.

Goddesen force-pushed the prune-timely-by-interval branch from 64c48ed to 38fbd43 Compare April 21, 2025 13:15

ThomasWaldmann reviewed Apr 21, 2025

View reviewed changes

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

ThomasWaldmann reviewed Apr 21, 2025

View reviewed changes

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

Defaults all prune flags to nothing instead of 0

2e71d32

Fixes bug where changed is-atleast-one-flag-given check didn't work. Adds a test to verify.

Goddesen added 2 commits April 24, 2025 19:42

fixup! Comments on functionality of period grouping functions

42ddb80

ThomasWaldmann reviewed Apr 24, 2025

View reviewed changes

src/borg/archiver/prune_cmd.py Outdated Show resolved Hide resolved

fixup! Defaults all prune flags to nothing instead of 0

a87a0d9

Prune by either int or interval for all retention policies #8775

Are you sure you want to change the base?

Prune by either int or interval for all retention policies #8775

Conversation

Goddesen commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PhrozenByte commented Apr 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ThomasWaldmann left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Goddesen commented Apr 21, 2025

Uh oh!

PhrozenByte commented Apr 21, 2025

Uh oh!

Uh oh!

Uh oh!

ThomasWaldmann commented Apr 21, 2025

Uh oh!

codecov bot commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Goddesen commented Apr 21, 2025

Uh oh!

Goddesen commented Apr 21, 2025

Uh oh!

Goddesen commented Apr 24, 2025

Uh oh!

ThomasWaldmann commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Goddesen commented Apr 24, 2025

Uh oh!

PhrozenByte commented Apr 24, 2025

Uh oh!

Goddesen commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PhrozenByte commented Apr 25, 2025

Uh oh!

Goddesen commented Apr 28, 2025

Uh oh!

ThomasWaldmann commented May 9, 2025

Uh oh!

ThomasWaldmann commented May 22, 2025

Uh oh!

Goddesen commented May 22, 2025

Uh oh!

ThomasWaldmann commented May 23, 2025

Uh oh!

Goddesen commented Apr 19, 2025 •

edited

Loading

codecov bot commented Apr 21, 2025 •

edited

Loading

ThomasWaldmann commented Apr 24, 2025 •

edited

Loading

Goddesen commented Apr 24, 2025 •

edited

Loading