I’ve replaced my da2 disk twice, it’s failed each time. I now suspect the cable, I’ve bought a new one, but is there anything I can look at to point me to the cable and help me differenciate between cable and disk. I hope it’s not the onboard controller. as I have no slots spare to plug in a new on, so it would be a new server, which I can’t afford.
It failed in the middle of a pool scrub, will that create any additional problems?
When it was powered off I opened up the server and reconnected ada2’s data connector, maybe that’s why the disk isthere now, even it it’s not in the pool
I know it’s a stripe0, and the data has potentially gone, but I have a good backup.
I can’t see anything blindingly wrong with the drive and it seems fairly young based on its power on hours. As this isn’t the first time it’s happened to you then you are right to potentially suspect cabling/port. The fact the pool is a stripe of 4 drives is a different conversation but it sounds like you are well aware of the dangers of that.
I’d be tempted to make a note of your drive serial numbers and which SATA connector and port they are connected to and monitor perhaps even occasionally moving drives around (when powered off) to see if the issue stays with a particular cable/port or follows a specific drive.
Perhaps look around the forums for some burn-in test tools to stress the drives/connectors to try and find the weak spot before you but any data on them again.
PS: Why not share the specs and setup of your system incase there is anything odd there. Chassis, mobo how the drives are connected etc.
Ideally create a signature with them in so that when you post forums members can understand your system setup almost as well as you do. Start with Chassis and work your way through. It doesn’t have to be exhaustive but if you could identify the key components that is always helpful.
It’s possible the reference to the drive in the alert has changed if you rebooted since the alert was generated. Device names like adaX or sdaX, etc, aren’t necessarily static. This is fine from a ZFS standpoint but can cause issues when troubleshooting, if you’re not aware of it.
If you aren’t sure, you can post the SMART report from all the drives, just in case the culprit is somewhere else.
Edit: I see I missed the last screenshot with the pool status clearly showing ada2 as REMOVED, that should be reliable enough so the above request isn’t really relevant. Still good to know of though so I will leave it.
Once I get the cable, later today, is there a way of adding the removed disk back into the pool to bring the pool online? Or is the pool effectivly dead and I’ll delete it, recreate it and start again?
It is all gone. A stripe with a single failed drive is a complete failure of the entire pool. You need to treat a Stripe as if it were a single drive, any part of that drive fails and the entire thing fails.
You are better off creating a RAIDZ1 or bette a RAIDZ2, but that depends on how important the data is to you.
You might be lucky and it may jump back into life but like @joeschmuck said you have to assume it’s lost as by design losing a drive in a stripe loses the pool.
I will digress a little bit. For the correct amount of money, and using your old failed drive (preferred), you may be able to recover some of your data. The correct amount of money is a lot of money as you are likely going to need a data recovery company.
The reason I had a stripe 0 was I didn’t have a spare port to plug in the extra drive. However thanks to Alister I have another look at the mother board and discovered a spare ODD sata port, after checking online Dell SATA ODD ports are no different to the other mini sas sata ports, so that gave me the extra port for the drive to give me raidz1, which I implemented,
I swapped the drives around, and the physical disk that was in ADA2 failed on a pool scrub again, although was perfectly ok before a throught check during the scrub. As I have Raidz1 all the data’s safe and I don’t have to wait 5 days for it to be restored.
GREAT news it’s the disk, not the controller, I’d swapped the cable. Weird in that fact it was in the same place as a previous one, but coincidental. It’s in warraty and waiting for UPS to pick it up.