Here is the tl;dr of this post now that I figured it out:
- Referenced will be smaller on the clone replicas than the original. That’s normal. All the clones will likely have the same size.
- You can always safely delete any snapshot (that the system will allow you since some have clones), subject to #4.
- Full replication will find the newest snapshot in common between the systems.
- However, if the newest snapshot in COMMON is not COMPLETE, it will let you know and STOP rather than trying to find an earlier complete snapshot. It’s just trying to get your attention that that shouldn’t have happened (and was likely because you did some manual snapshot deletion).
----- END OF SUMMARY-------
Here’s the original post.
My full replica backups to a second truenas system were going along just fine until I decided to do a little housecleaning.
So I decided to move minio dataset to a new location. So I did a clone and promote and then i deleted the old dataset
But when my daily replication job started, I gave me this error which is super confusing to me because that particular snapshot was replicated identically on the two backup systems I have.
Here’s the output. See my additional comments below.
Critical
Replication "main -> backup" failed: Last full ZFS replication failed to
transfer all the children of the snapshot main@auto-2024-09-24_21-10.
The snapshot main/apps/minio@auto-2024-09-24_21-10 was not transferred.
Please run `zfs destroy -r main@auto-2024-09-24_21-10`
on the target system and run replication again..
2024-09-25 21:10:06 (America/Los_Angeles)
Dismiss
And here’s the screen shot showing the original is there and both the replicas of the snap are there (with a slightly smaller Referenced size):
The top image is my running system. main is the original, SSD is the backup.
I have an offsite clone of main, that’s the second image.
The issue is the error message… It’s baffling because nothing is missing for minio… all three snapshots on 9/24 are there just as expected.
So why did it think something was missing and not tranferred? there is no evidence of that so I’m baffled.
OK, i see the problem. I think I wasn’t careful when I hand deleted some snapshots to try to roll things back when I turned off zero snapshots.
I started these full replicas on 9/21, but there isn’t a minio replica on 9/21 so clearly something is missing.
The error was right that things were missing it’s just the stuff before that was mising I belive due to user error.
I’m going to start from scratch and monitor.
For now, I’m good.