I have Scale 24.04 running on an older Supermicro board and booting from a cheap consumer SSD. Recently, the whole machine up with an error about the pool pool. Unfortunately, I can’t find the wording in the logs anywhere. I think it was something about an uncorrectable error.
I didn’t lose any data, and after rebooting everything came up normally. SMART shows no errors. A scrub completed cleanly as well.
The config doesn’t change much and I have it backed up, so if the drive actually fails I can just reinstall, but since this server is running the shared storage for my VMs, I’d like to not have it randomly lock up due to a boot pool error.
What causes this and how can I avoid it in the future? Is my only option to reinstall with a mirrored boot?