Backup Strategy against loss of complete Pool

aurel81 · March 5, 2025, 5:38am

Yeah, its me again
i am still riddling about this topic.
Snapshots are fine, but become useless at the moment, when the originating pool gets lost/destroyed what so ever. Replicationtasks are based on Snapshots

Rsync the pool/dataset is coming into my mind, but - depending on the schedule - if i have to restore the complete pool, i will have to clear all Snapshots and start again on the recently restored pool and the data.

Correct ?

etorix · March 5, 2025, 7:07am

No, and I’m unsure what you’re thinking.
Why would you have to “clear all snapshots”? Do you understand what a snapshot is?

If the primary is lost you can either:

Turn the backup into a primary by removing the read-only flag on the target dataset; or
Build a new primary, run a one-off replication task to restore from backup and, again, remove the read-only flag.

aurel81 · March 5, 2025, 8:41am

Snapshots on ZFS are based on the changes made on the pool. So Snapshots that are younger than the last complete rsync are referencing changes that havent happend…thats why - imho - are those snapshots not working anymore.
But i am willing to learn.

etorix · March 5, 2025, 1:59pm

No. Non-sense.

A snapshot is a complete record of the content of a dataset at the instant the snapshot was taken. Snapshot creation is an atomic event: Either the snapshot exists, and all the data it refers exists in the pool, or it doesn’t exist; there’s no such thing as a partial snapshot. (And no “snaphot referencing changes that haven’t happened”.)
The snapshot itself is metadata—hence uses little to no space on disk. But “snapshot replication” copies the snapshot (i.e. metadata) and the data it refers to; as long as all data is not received by the destination, the snapshot does not exist yet on the destination. (And you cannot “rsync” a snapshot: You may rsync the data content of a snapshot, but only ZFS replication can copy the snapshot itself, i.e. metadata.)
Data referred to by a snapshot is immutable. It cannot be removed from the pool as long as the snapshot exists. (The active dataset can be modified, but its previous content is retained in the pool as long as the snapshot exists.)

For your own good, leave rsync to non-ZFS systems, and use only ZFS replication when both source and destination are ZFS. Replication is always (way) faster than rsync.

Constantin · March 5, 2025, 3:36pm

Moreover, snapshots are incredibly efficient for backup purposes because they encapsulate all the necessary changes. One of the reasons that rsync can take as long as it does is because it has to exhaustively go down every file directory to see what’s different between dataset A and B.

Snapshots basically avoid all that traversal work because they have already captured every change. Thus, the only data that is transferred by snapshot is the changes to the respective datasets. All things being equal, snapshots are the way to go re: backup.

The only reason to use rsync is if you want a backup in whatever native file system your home computer OS uses, if it doesn’t understand ZFS. For example, I have used rsync to back up my files to a HFS+ formatted volume in the past. That said, where exhaustive rsyncs can take hours, the replication will take minutes.

aurel81 · March 7, 2025, 10:49am

hm … so what you are telling me, if i create a snapshot, replicate it to my TrueNas Backupserver via VPN, my house burns down, all harddrives were destroyed… i take a new PC with blanc discs, create a pool and then i am able to restore all the data from a 170kb file ?!
Sounds a bit … impossible ?!

etorix · March 7, 2025, 11:36am

Impossible indeed… because a snapshot is not a file.

A snapshot is a collection of metadata. You cannot “save” or “download” snapshot as a 170 kB item: It only exists whithin a ZFS pool, together with all the 2 TB of data it refers—can’t have the first without the second.
By the way, today the snapshot may be 170 kB on disk, but tomorrow if might use 2.3 MB because it referred to an earlier snapshot which has been deleted, and so the newer snaphsot has taken over the relevant metadata. The space used by a snapshot may change at any time; the content referred to by a snapshot, however, is immutable—the 2 TB do not change.

If you had “replicated the snapshot” to external backup you have copied the full 2 TB of data alongside the metadata. The full data can be restored to a NAS by replicating from external backup. Likely time consuming but very possible.

winnielinnie · March 7, 2025, 1:26pm

If you want a non-technical way to look at snapshots, check out these graphics.

Think of the “truck” as the available capacity (or pool).
Think of each “box” as a data block or file.
Think of any box with a “white sticker” as included in the active dataset/filesystem.
Think of a “color tag set” as a snapshot, which can also be viewed as a dataset/filesystem itself.
Think of a “faded” box as hidden/unavailable, and only exists in whatever snapshot(s) it is tagged with.

Read the intro and rules carefully.

aurel81 · March 7, 2025, 4:10pm

yeah, i know how snapshots are working and they arent files (-: but they have a size, used the wrong terms. Sorry but…

THATS the solution for my problem XD as its now clear … i’ve never understood “Replication” facepalm …

Get all data back is always timeconsuming

@ winniliennie … great link !

Johnny_Fartpants · March 7, 2025, 4:46pm

I’ve only skim read this thread as I’m incredibly busy atm so apologies if this has already been stated but zpool checkpoints can protect against accidental/malicious deletion of datasets. @winnielinnie has made a great post regarding this.

etorix · March 7, 2025, 5:11pm

Ouf ! I’m glad we finally cracked it.

Even the right terminology may not be entirely helpful here.

“A snapshot”, in itself, is strictly metadata; BUT
“Mounting/Broswing a snapshot” pertains to the data referred to by the snapshot, presented as a read-only file system;
“Replicating a snapshot” copies both data and metadata.

I hope you understand that I cannot understand what you understand or don’t understand when your language appears to suggest that think that there’s, somewhere, a tank-YYMMDD-HHmm.snapshot file, with a defined size and which snapshot file could be copied on it own…

Constantin · March 7, 2025, 7:20pm

Simply put, snapshots are a brilliant way to be able to almost treat the timeline for data on a NAS like the timeline inside a video editor. I have not had to use it often, but it’s super helpful when it’s needed.

aurel81 · March 9, 2025, 6:35am

Snapshots are insane and mindblowing… loving it

completly right! and i am really thankful that you helped me through this disaster

Jeff_Goldrich · March 10, 2025, 8:37am

SNAPSHOTS ARE NOT BACKUP. If you replicate onto another system then great… you are somewhat protected.

Louie1961 · March 10, 2025, 11:37pm

I back up to one copy to a Synology, and another copy to an instance of Debian, because that is is all I have to back up to. ZFS replication is not an option for me. I use Rsync and it works flawlessly. I still take snapshots of selected data sets, but not really as a back up function.

davistw · March 12, 2025, 7:22pm

So a snapshot is a reflection of the data in a pool in total? Snapshots aren’t related to one another except they represent the data as it was at the time of snapshot creation? If they only contain metadata how do you recover data that no longer exists?

etorix · March 12, 2025, 7:31pm

Snapshots are at dataset level, not necessarily pool—but can if you snapshots the root dataset.
Snapshots of the same dataset are related to each other in that they know how to share metadata. Leave the details to ZFS.

Once again: There’s no such thing as a snapshot without the corresponding data.
As long as a block is referenced by at least one snapshot this block cannot be deleted or modified. (Remember: ZFS is Copy-on-Write.)

davistw · March 12, 2025, 8:00pm

So if I create a dataset, put data on it and do an initial snapshot it, It would be a complete copy of the data. Then I modify the data and do more snapshot’s over time it the snapshot would one have the changed blocks from snapshot to snapshot. Am I right? Snapshots have always confused me.

etorix · March 12, 2025, 8:18pm

The snapshot is metadata, not “a copy of the data”. Consider that the snapshot puts a little lock on each and every block of the data.

myfile.dat consist of blocks 1 to 10. You request to modify block 3. The block is locked by a snapshot . ZFS records a new block 11 with the modified data, and records that the current version of myfile.dat is 1-2-11-4-5-6-7-8-9-10. The snapshot records that myfile.dat at snapshot time was and shall forever remain 1-2-3-4-5-6-7-8-9-10.
Copy-on-Write. Look this up.

davistw · March 12, 2025, 9:46pm

Thanks its getting a little clearer I appreciate it. So lets say I create a dataset and put myfile.dat on it along with data in blocks 1-2-3-4-5-6-7-8-9-10. Then do a snapshot. it locks blocks1-10, I then modify block 4 it makes a new block 11 with the new modified data. When I do a snapshot it will lock block 11 etc… Am I close to the operation?