Hi All,
My system has been great for years but I’m currently suffering a very serious problem and I’m running out of ideas of what it could be.
I have a variety of systems reading from and writing back to TrueNAS via SMB causing file corruption on the destination files, confirmed 100% with a binary file compare.
(This is obviously, very very bad)
Current testing involves the following:
cp testfile.bin /mnt/tnas-smb/compare/compare1 && cp testfile.bin /mnt/tnas-smb/compare/compare2 && cp testfile.bin /mnt/tnas-smb/compare/compare3 && cp testfile.bin /mnt/tnas-smb/compare/compare4
Full genuine binary file compare confirms all files are NOT identical, on most tries.
If I run the same command and copy the files from TrueNAS to TrueNAS directly, it seems to consistently work.
I can confirm that the problem occurs to both of my pools, one being 6x16TB drives, the other 6x2TB SSDs Z2
The system has been dead stable for me.
Here’s where the problem gets significantly more frustrating and complicated.
(Seriously stop reading here, this is ridiculous stuff)
Machine1: Proxmox host, HP Mini 12th Gen machine running:
Proxmox (obviously)
DietPiVM
UbuntuLXC provided by proxmox template.
UbuntuVM 22.04
UbuntuVM 24.04
Windows10VM
Machine2:
Spare laptop, Ubuntu 22.04
The DietPi VM infrequently corrupts files
The Ubuntu 22.04 VM corrupts files very often and is very slow to copy
The UbuntuLXC infrequently corrupts files
Windows10 doesn’t seem to corrupt files.
Ubuntu 24.04 doesn’t seem to corrupt files
Even the proxmox host will corrupt files (!!!) if I SSH into it, mount TrueNAS and copy from and back to TrueNAS…
The spare laptop with Ubuntu 22.04 is not corrupting files.
I would be fine with all the VMs and LXCs corrupting files on proxmox, but I can’t seem to fault the 24.04 UbuntuVM. (built specifically to replace the 22.04 VM with the most issues)
None of this makes any sense to me at all.
What other testing should I be doing?
I’m going to loop in @pmh because I know he’s particularly skilled but at this point I’m quite lost and frustrated. I’m going to destroy the SSDs with writes at this rate just trying to isolate what systems do and do not cause this problem to occur.