Versioning based on btrfs snapshots #4221

Borgvall · 2025-11-22T20:09:08Z

Description

I actually like the bottles application for managing wine prefixes. However it itches me, that I can not create the bottles as btrfs subvolumes, which I want to use it with my backup solution, to backup/restore bottles independently. I also tried to create the subvolume in place before creating a bottle, but bottles is appending a random number to the path, if the bottle directory already exists.

I went a bit over the top and implemented it further, until I can create and restore bottle snapshots using the bottles GUI. With all updates added to this PR, I think this is ready to be merged.

This is a rework of #3420, that can not be reopened for github reasons. Compared to that PR the commits have been rebased on bottlesdev main, the flatpak module btrfs-progs has been reworked and updated to current version and one small bug fix have been added.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Get rid of the else branch by moving common code out of if-else.

... but notice the many TODOs

on btrfs filesystem with user_subvol_rm_allowed mount option set.

orowith2os · 2025-11-23T03:44:05Z

I believe the plan is to use something similar to ostree or git for snapshots, not anything tied to a specific filesystem.

Borgvall · 2025-11-23T07:45:42Z

Both proposals wouldn't work:

OSTree is designed for providing read-only filesystem-trees. This is unsuitable for wine-prefixes.

git is designed for text file repositories and doesn't scale well with large or a lot of binary files.

evertonstz · 2025-11-27T16:52:56Z

I believe the plan is to use something similar to ostree or git for snapshots, not anything tied to a specific filesystem.

Honestly, this is a pretty sophisticated solution: btrfs is the default for a bunch of modern distros, the snapshot feat is mature and stable and it uses copy-on-write to make them pretty lightweight and is extremely secure due to guaranteed atomicity, this implementation can even be extended to support prefix deduplication. I also think we should have a fallback for other file-systems (maybe rsync could be a good fit, I have seen other projects use it as a fallback for fs snapshots, not atomic tho)

Like the OP said, Git would not be a good fit for this. I am not sure about ostree tho, do you have anything about it I can take a look? Maybe we could design the feature based in interfaces where these multiples backend could work from.

Borgvall · 2025-11-29T07:39:11Z

Hi all,

This PR already lays the foundation for supporting multiple versioning backends. Currently, it implements btrfs and the existing FVS versioning, but the architecture could easily extend to other systems like XFS, ZFS, or potentially even OSTree (?) in the future.

One particularly useful enhancement this enables is the ability to delete specific snapshots - something that, to my knowledge, isn't possible with the current FVS system, but works seamlessly with btrfs snapshots. At the moment it isn't provided via the GUI. It would be a follow-up.

As it stands, this PR delivers significant benefits for btrfs users: reliable, lightweight snapshots and efficient bottle duplication through copy-on-write.

I'd be particularly interested to hear from @mirkobrombin, who showed interest in the original PR implementation.

evertonstz · 2025-11-30T15:43:03Z

Hi all,

This PR already lays the foundation for supporting multiple versioning backends. Currently, it implements btrfs and the existing FVS versioning, but the architecture could easily extend to other systems like XFS, ZFS, or potentially even OSTree (?) in the future.

One particularly useful enhancement this enables is the ability to delete specific snapshots - something that, to my knowledge, isn't possible with the current FVS system, but works seamlessly with btrfs snapshots. At the moment it isn't provided via the GUI. It would be a follow-up.

As it stands, this PR delivers significant benefits for btrfs users: reliable, lightweight snapshots and efficient bottle duplication through copy-on-write.

I'd be particularly interested to hear from @mirkobrombin, who showed interest in the original PR implementation.

Yes, I think the PR is solid for BTRFS and the current FVS versioning, but I think we could actually improve the architecture itself (could be in a future PR ofc).

I have been thinking about this for a couple days and making some notes as I reach some conclusions (ps I do my notes via voice and and resume everything via AI, so ignore if it sounds too formal or something like that, I am a portuguese speaker so the robot is not very good with tone): here’s a practical way to make snapshots in Bottles easier to extend beyond Btrfs, without scattering filesystem logic all over the app. The idea is simple: one clean interface for versioning, small backend implementations for each filesystem, and a registry that picks the right one at runtime. This keeps the UI predictable, reduces risk, and makes future backends (ZFS, LVM, etc.) straightforward.

What we’re aiming for

Put all filesystem-specific code behind a single interface.
Have each backend tell us what it can do (capabilities), so the UI never guesses.
Use consistent snapshot metadata and error types across backends.
Always have a safe fallback (FVS), and make migrations (e.g., directory → subvolume) a first-class thing.
Keep bottle lifecycle code simple: resolve a backend, call methods, done.

The building blocks

VersioningBackend (the interface every backend implements):
- backend_type() → short name like "btrfs", "zfs", "fvs"
- capabilities() → what features are supported
- ensure_bottle_initialized() → prep the bottle for this backend
- create_snapshot(label?, read_only?) → returns SnapshotMetadata
- list_snapshots() → returns normalized snapshot list
- restore_snapshot(snapshot_id)
- delete_snapshot(snapshot_id)
- mark_active(snapshot_id) → optional; raise NotSupportedError if not supported
- duplicate_bottle(target_path, mode=FULL_COPY|SNAPSHOT_CLONE)
- cleanup_on_bottle_delete()
BackendCapabilities (tiny feature matrix):
- supports_managed_container
- supports_read_only_snapshots
- supports_writable_snapshots
- supports_active_marker
- supports_streaming
- supports_inplace_conversion
SnapshotMetadata (one shape for all backends):
- id, label?, created_at, read_only, is_active, parent_id?, backend_type
Errors (shared and simple):
- VersioningError → operation failed
- NotSupportedError → backend doesn’t support that feature

Choosing the backend

BackendRegistry.resolve(bottle_path, prefer_native=True) returns the best backend:
- If we’re on Btrfs and can manage/convert → use BtrfsBackend
- Otherwise → use FVSBackend (guaranteed to work everywhere)

This keeps logic out of the UI and bottle lifecycle. The app just asks the registry and delegates.

Where it plugs into Bottles

On bottle creation/import:
- backend.ensure_bottle_initialized()
For snapshots:
- create/list/restore/delete
- mark_active when supported
For duplication:
- duplicate_bottle(target_path, mode), choosing lightweight clone when available
On delete:
- cleanup_on_bottle_delete()

In the UI, call capabilities() to decide which buttons to show. For example, only show “Lightweight clone” when the backend supports writable snapshots and managed containers.

How to roll this out

Add the base module:
- VersioningBackend interface
- BackendCapabilities, SnapshotMetadata
- DuplicateMode enum and the shared errors
Move Btrfs logic into BtrfsBackend:
- Detect filesystem and subvolume robustly (prefer libbtrfsutil when available in Flatpak)
- ensure_bottle_initialized(): convert directory → subvolume if allowed
- Implement snapshot/restore/delete/duplicate using btrfs subvolume tools
- Decide on an “active” marker (file/symlink) if the UI needs it
Wrap the existing FVS code in FVSBackend:
- Keep current behavior for snapshot/restore/delete/mark_active
- duplicate_bottle: support FULL_COPY only; raise NotSupportedError for SNAPSHOT_CLONE
Add BackendRegistry and update bottle lifecycle to use it.
UI and settings:
- Drive features from capabilities()
- Offer migration options when supported (e.g., “Convert to Btrfs subvolume”)
Tests:
- Per-backend: capabilities, snapshot lifecycle, duplicate, cleanup
- Fallback on non-Btrfs filesystems
- Flatpak sandbox behavior (btrfsutil availability)
- Regression coverage for existing FVS

Notes for the current Btrfs PR

Keep all Btrfs commands and logic inside BtrfsBackend; don’t leak them into the UI or lifecycle.
Use real subvolume metadata for timestamps rather than “now”.
Where operations need multiple steps, consider a small transactional helper to avoid partial changes.
Document how “active” is determined so the UI can reflect it consistently.

Why this is worth it

Clean boundaries and cleaner code.
Easy to add new backends without touching the UI.
Fewer edge cases: the UI only exposes what’s supported.
Safer behavior and a consistent user experience across filesystems.

This approach respects the work in the Btrfs PR, gives us a solid foundation for more filesystems, and keeps Bottles maintainable as we grow.

Borgvall · 2025-11-30T20:21:15Z

Hi,

I have tested the fallback on non btrfs filesystems using an ext4 loop mount, except for snapshot restoring. The GUI does not show the created snapshots. This happens for me, both on my development flatpak and the official 60.1 flatpak from flathub. Can anyone please check, if it's a local problem or a general issue?

@evertonstz about your notes for the current PR:

Keep all Btrfs commands and logic inside BtrfsBackend; don’t leak them into the UI or lifecycle.

At the moment all work is delegated to the model.btrfssubvolume Python module

Use real subvolume metadata for timestamps rather than “now”.

This is already done, isn't it? The modification time of the snapshot directory is the creation time of the snapshot.

Where operations need multiple steps, consider a small transactional helper to avoid partial changes.

Is this a Python feature? Can you point me to some documentation?

Document how “active” is determined so the UI can reflect it consistently.

Should I add the documentation to the versioning manager?

evertonstz · 2025-12-01T15:16:11Z

Where operations need multiple steps, consider a small transactional helper to avoid partial changes.

Is this a Python feature? Can you point me to some documentation?

Not a built‑in magic feature; it’s a pattern. Goal: group multi-step filesystem changes so you can roll them back if something fails mid-way. You could do this in multiple ways, ex: logging your steps in a journal and trigger a recovery to undo these steps in case of a fail (or trigger a full recovery in case of a crash). Lifecycle with sqlite would be something like this (this is backend agnostic btw, would work for zfs, btrfs, etc):

Begin Transaction
- Insert row into transactions:
  - state = 'PENDING'
  - started_at = now, updated_at = now
- Insert each planned step into steps with:
  - status = 'PENDING'
  - step_order (0-based)
  - details_json (paths, IDs, etc.)
Execute Steps (Forward Phase)
For each step in step_order:
- Update step status = 'IN_PROGRESS'
- (Optional) set transaction state = 'APPLYING' if still PENDING
- Perform the filesystem action (create subvolume, rsync, rename, etc.)
- On success: update step status = 'DONE'
- On failure:
  - Update step status = 'FAILED'
  - Update transaction state = 'ROLLING_BACK'
  - Record error message in transactions.error
  - Jump to Rollback Phase
Rollback Phase (if any step failed)
- Iterate previously DONE steps in reverse order.
- For each, run its compensating (rollback) action.
  - Rollbacks should be idempotent (ignore missing targets).
- After all possible rollbacks:
  - Update transaction state = 'ABORTED'
  - Optionally store a final error message / reason.
- End (no commit).
Commit Phase (only if all steps reached DONE)
- Update transaction state = 'COMMITTING'
- Perform any final atomic action (e.g., rename staging → live, delete backups).
- Validate final state (paths exist, metadata intact).
- Update transaction state = 'COMMITTED'
Cleanup
- Release any locks (file lock, advisory lock).
- Optionally prune or archive old committed/aborted entries (housekeeping task).
Crash Recovery (on next startup)
- Query transactions where state IN ('PENDING','APPLYING','COMMITTING','ROLLING_BACK')
- For each:
  - Load ordered steps and their statuses.
  - If state = 'ROLLING_BACK':
    - Ensure rollback finished (redo reverse rollback for any remaining DONE steps).
    - Set to ABORTED.
  - Else if all steps status = 'DONE':
    - Finalize commit actions (if not already done).
    - Set state = 'COMMITTED'.
  - Else:
    - Perform rollback of all DONE steps (reverse order).
    - Set state = 'ABORTED'.
- Log outcomes for debugging/audit.

Borgvall · 2025-12-02T09:50:59Z

bottles/backend/models/btrfssubvolume.py

+            # Try to delete the subvolume as a normal directory tree. This is
+            # in particular needed, if the btrfs filesystem is not mounted with
+            # 'user_subvol_rm_allowed' option.
+            shutil.rmtree(path)


@evertonstz Please describe, how you would rollback shutil.rmtree(), that fails in the middle, in your transaction proposal.

The journaling system is not specifically about rolling back transactions, its just to keep tabs on what was already performed in the file system, with that information you can decide what you wanna do with it, ex: rollback, resume, etc.

Borgvall · 2025-12-02T10:17:39Z

bottles/backend/models/btrfssubvolume.py

+# Internal subvolumes created at initialization time:
+_internal_subvolumes = [
+    "cache",
+    ]


I'm considering to get rid of this _internal_subvolumes. The idea is to save some disk space, when creating snapshots. However it significantly increases the complexity of the code. It makes it prone to have partially performed operations, who's proper handling is complex. I'm not sure, but it might have inspired @evertonstz to implement a semi transaction system, which seems overkill to me. Filesystems are no acid databases.

Agreed—filesystems aren’t ACID databases, so we shouldn’t build heavyweight transactional machinery. Still, it’s worth adding a light safety net around the few risky operations.

My propose: for minimal crash safety: before a risky operation, write a tiny intent file (e.g. .versioning-intent.json containing { "op": "restore", "target": "" }), and delete it on success. On startup, if the file still exists, we can either auto-complete the operation or revert, or at least surface a clear message to the user. This isn’t a transaction system, just a breadcrumb.

Why this matters: an orphan/rogue subvolume left behind after a failed swap still consumes disk space (even with CoW it retains its extents) and clutters the layout. It’s trivial for us to delete, but a user with little filesystem knowledge may not even know it exists. A tiny intent marker prevents silent leftovers and improves trust without adding complexity. It's a win-win IMO.

Also, keep in mind I'm just a contributor, nothing I proposed here is needed for your PR to be merged, you just need to convince the maintainer. I might be a little picky about security layers, but on scale corner cases will happen.

I definitely need some feedback of a maintainer:

Shall I remove the internal subvolumes in order to keep the code simple? I have done it in commit e4f90bc, but I can drop or revert it.

The idea of "intent files", to improve reliability of more broader filesystem changes against crashes or power outages, seems very good. Afaik nothing in this regard is implemented in bottles. In my opinion this should be implemented as an extension of the Task system. It would be out of scope of this PR.

Agreed—filesystems aren’t ACID databases, so we shouldn’t build heavyweight transactional machinery. Still, it’s worth adding a light safety net around the few risky operations.

My propose: for minimal crash safety: before a risky operation, write a tiny intent file (e.g. .versioning-intent.json containing { "op": "restore", "target": "" }), and delete it on success. On startup, if the file still exists, we can either auto-complete the operation or revert, or at least surface a clear message to the user. This isn’t a transaction system, just a breadcrumb.

Why this matters: an orphan/rogue subvolume left behind after a failed swap still consumes disk space (even with CoW it retains its extents) and clutters the layout. It’s trivial for us to delete, but a user with little filesystem knowledge may not even know it exists. A tiny intent marker prevents silent leftovers and improves trust without adding complexity. It's a win-win IMO.

Also, keep in mind I'm just a contributor, nothing I proposed here is needed for your PR to be merged, you just need to convince the maintainer. I might be a little picky about security layers, but on scale corner cases will happen.

I definitely agree with this

bottles/backend/models/btrfssubvolume.py

This simplifies the code significantly. The only downside is, that the snapshots will consume a bit more disk space.

Borgvall added 19 commits November 22, 2025 19:48

Fix typo in comment

1782d37

Refactor: simplify if-else

71ed09a

Get rid of the else branch by moving common code out of if-else.

Add btrfs-progs module to flatpak build

89078e2

Empty BtrfsSubvolumeManager

e6aa2ad

Create bottles as btrfs subvolumes

a5f8bf2

BottleSnapshotsHandle class

07385f7

Initialize BottleSnapshotsHandle._snapshots lazily

95128af

Free method and rename it

8c11950

First working btrfs snapshot creation

53bbce9

... but notice the many TODOs

Return meaningful timestamps of subvolume snapshots

0587b67

Save active btrfs snapshot ID

44696ec

Refactor perform FVS versioning work in dedicated class

f742b5b

Refactor move code from manager to model module

2add6cc

Delete bottles' btrfs snapshots during deletion

e68f63c

Add btrfsutil to the requirements

26185cc

Restore previous state if the restore from snapshot fails

98d8942

Duplicate bottle as subvolumes

145a5bd

Fix some pylint warnings

24225aa

fix _delete_subvolume()

b691e20

on btrfs filesystem with user_subvol_rm_allowed mount option set.

Borgvall added 2 commits November 30, 2025 20:23

Fix: fallback on non btrfs filesystems

6ec85be

Fix: restore missed chenges of rebasing

16ed850

Borgvall commented Dec 2, 2025

View reviewed changes

Borgvall added 2 commits December 3, 2025 11:41

Create bottles as plain subvolumes

e4f90bc

This simplifies the code significantly. The only downside is, that the snapshots will consume a bit more disk space.

Fix btrfs-progs' tag-pattern

a5171c9

Uh oh!

Versioning based on btrfs snapshots #4221

Are you sure you want to change the base?

Versioning based on btrfs snapshots #4221

Uh oh!

Conversation

Borgvall commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

How Has This Been Tested?

Uh oh!

orowith2os commented Nov 23, 2025

Uh oh!

Borgvall commented Nov 23, 2025

Uh oh!

evertonstz commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Borgvall commented Nov 29, 2025

Uh oh!

evertonstz commented Nov 30, 2025

What we’re aiming for

The building blocks

Choosing the backend

Where it plugs into Bottles

How to roll this out

Notes for the current Btrfs PR

Why this is worth it

Uh oh!

Borgvall commented Nov 30, 2025

Uh oh!

evertonstz commented Dec 1, 2025

Uh oh!

Borgvall Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

evertonstz Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Borgvall Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

evertonstz Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Borgvall Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mirkobrombin Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Borgvall commented Nov 22, 2025 •

edited

Loading

evertonstz commented Nov 27, 2025 •

edited

Loading

Borgvall Dec 3, 2025 •

edited

Loading