One of my TrueNAS machines is having an issue where if I update Scale past 24.10.0.2, to say 24.10.2, a day after the install, one of the two boot NVMEs in the boot-pool gets removed.
It will show an error that it is failing to read NVMe SMART/Health Information, and doing a manual SMART test errors out with an IO error.
Specific error from the SMART test:
Read NVMe Identify Controller failed: NVME_IOCTL_ADMIN_CMD: Input/output error
I’ve had the company I bought the server from replace the NVME, but it occurred again even on a clean install of 24.10.2.
If I go back to 24.10.0.2, I have no issues with the NVME drives, and all SMART tests run and pass as they should.
Machine it’s running on: SSG-110P-NTR10
NVME drives: Samsung PM9A3 960GB M.2
Did something change with SMART between those two version, and does a later version like the new 25.04 fix it?