-
Apologies if this is in the docs or just obvious to most... I've checked the docs, but didn't find an explicit answer. I'm confused about the best practice with repositories and archives. If I have multiple sets of data on the same machine that I want to back up, is there any reason that I can't/shouldn't create uniquely named archives for each of the sets of data, and store them all in the same repository? Every example I've found shows the same data backed up in each archive in a repository. There's nothing stopping me from having everything in one archive, or separate repositories for each set of data (though I believe this means I'd lose dedup), just curious if this would work or not? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 1 reply
-
Multiple different archives are fine. Do use different prefixes for their names so you can properly match them in operations. Only possible downside is that there is operations like In your case, which AIUI is disjoint sets of data, it'd depend on amount of sets and backup frequency - 3 sets of daily backups wouldn't be something to be worried about, 10 sets being backed up hourly and all to be kept for years would get you into a situation where a Sidenote: The recommended approach to work around maximum archive size limit (the thnig that shows 0-1% for almost everyone 😉) is to split data across two (or more) archives. |
Beta Was this translation helpful? Give feedback.
-
"Do use different prefixes for their names so you can properly match them in operations." Just re-emphasizing this: if you use multiple backup jobs, backing up different sets of input data, it is pretty much required that you use different archive names prefixes and |
Beta Was this translation helpful? Give feedback.
-
While it works to have different data sets in the same repo, unless you expect there to be a big savings due to deduplication, I would strongly recommend putting different data sets in different repos. This has various advantages. If a repo is corrupted, you don't lose everything. You can move one of the repos to another disk if space becomes an issue. Various operations on repos will be faster and use a lot less memory. Etc. That said, if there is a lot of overlap in the data sets, then putting them in the same repo can make sense. |
Beta Was this translation helpful? Give feedback.
Multiple different archives are fine. Do use different prefixes for their names so you can properly match them in operations.
The can overlap as well, due to the beaty of deduplication.
I e.g. have hostname-set-isodate style names and examples for set are sysconfig (/etc + select thisgs from /var), src (sourcecode from my
$HOME
), Maildir, which e.g. can run more frequently, and home and system (/, ie. everything) to run daily. Thus e.g. home includes Maildir and src and system includes everything at the cost of the few kB it takes to reference the existing chunks.While nowadays leaning towards never ever pruning, the idea to have e.g. both home and syntem was to at some point (e.g. 10+ y…