Hi
My boot pool ran a scrub last night and one out of the two became degraded.
It is now sitting in a “removed” state.
The disks are a pair of - Silcon Power SSD 128GB which i only purchased last year August.
Here is a snippet of the logs:
Jul 17 00:14:44 S-STORE01 ahcich1: Timeout on slot 23 port 0
Jul 17 00:14:44 S-STORE01 ahcich1: is 00000000 cs 00000000 ss 00800000 rs 00800000 tfd 40 serr 00080000 cmd 0004d717
Jul 17 00:14:44 S-STORE01 (ada1:ahcich1:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 04 34 8a 7f 40 0c 00 00 00 00 00
Jul 17 00:14:44 S-STORE01 (ada1:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:14:44 S-STORE01 (ada1:ahcich1:0:0:0): Retrying command, 3 more tries remain
Jul 17 00:15:47 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:15:47 S-STORE01 ahcich1: Timeout on slot 24 port 0
Jul 17 00:15:47 S-STORE01 ahcich1: is 00000000 cs 01000000 ss 00000000 rs 01000000 tfd 80 serr 00000000 cmd 0004d817
Jul 17 00:15:47 S-STORE01 (aprobe0:ahcich1:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00
Jul 17 00:15:47 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:15:47 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retry was blocked
Jul 17 00:16:50 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:16:50 S-STORE01 ahcich1: Timeout on slot 25 port 0
Jul 17 00:16:50 S-STORE01 ahcich1: is 00000000 cs 02000000 ss 00000000 rs 02000000 tfd 80 serr 00000000 cmd 0004d917
Jul 17 00:16:50 S-STORE01 (aprobe0:ahcich1:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00
Jul 17 00:16:50 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:16:50 S-STORE01 (aprobe0:ahcich1:0:0:0): Retrying command, 0 more tries remain
Jul 17 00:17:53 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:17:53 S-STORE01 ahcich1: Timeout on slot 26 port 0
Jul 17 00:17:53 S-STORE01 ahcich1: is 00000000 cs 04000000 ss 00000000 rs 04000000 tfd 80 serr 00000000 cmd 0004da17
Jul 17 00:17:53 S-STORE01 (aprobe0:ahcich1:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00
Jul 17 00:17:53 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:17:53 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted
Jul 17 00:17:53 S-STORE01 ada1 at ahcich1 bus 0 scbus2 target 0 lun 0
Jul 17 00:17:53 S-STORE01 ada1: <SPCC Solid State Disk V0823A0> s/n AA230202S3128003036 detached
Jul 17 00:18:56 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:18:56 S-STORE01 ahcich1: Timeout on slot 27 port 0
Jul 17 00:18:56 S-STORE01 ahcich1: is 00000000 cs 08000000 ss 00000000 rs 08000000 tfd 80 serr 00000000 cmd 0004db17
Jul 17 00:18:56 S-STORE01 (aprobe0:ahcich1:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00
Jul 17 00:18:56 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:18:56 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retry was blocked
Jul 17 00:19:44 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:19:44 S-STORE01 ahcich1: Poll timeout on slot 29 port 0
Jul 17 00:19:44 S-STORE01 ahcich1: is 00000000 cs 20000000 ss 00000000 rs 20000000 tfd 80 serr 00000000 cmd 0004dd17
Jul 17 00:19:44 S-STORE01 (aprobe0:ahcich1:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Jul 17 00:19:44 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:19:44 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted
Jul 17 00:20:44 S-STORE01 ahcich1: Timeout on slot 30 port 0
Jul 17 00:20:44 S-STORE01 ahcich1: is 00000000 cs 40000000 ss 00000000 rs 40000000 tfd 80 serr 00000000 cmd 0004de17
Jul 17 00:20:44 S-STORE01 (ada1:ahcich1:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00
Jul 17 00:20:44 S-STORE01 (ada1:ahcich1:0:0:0): CAM status: Unconditionally Re-queue Request
Jul 17 00:20:44 S-STORE01 (ada1:ahcich1:0:0:0): Error 5, Periph was invalidated
Jul 17 00:21:32 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:21:32 S-STORE01 ahcich1: Poll timeout on slot 0 port 0
Jul 17 00:21:32 S-STORE01 ahcich1: is 00000000 cs 00000001 ss 00000000 rs 00000001 tfd 80 serr 00000000 cmd 0004c017
Jul 17 00:21:32 S-STORE01 (aprobe0:ahcich1:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Jul 17 00:21:32 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:21:32 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted
Jul 17 00:22:02 S-STORE01 ahcich1: Timeout on slot 1 port 0
Jul 17 00:22:02 S-STORE01 ahcich1: is 00000000 cs 00000002 ss 00000000 rs 00000002 tfd 80 serr 00000000 cmd 0004c117
Jul 17 00:22:02 S-STORE01 (ada1:ahcich1:0:0:0): SETFEATURES ENABLE WCACHE. ACB: ef 02 00 00 00 40 00 00 00 00 00 00
Jul 17 00:22:02 S-STORE01 (ada1:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:22:02 S-STORE01 (ada1:ahcich1:0:0:0): Error 5, Periph was invalidated
Jul 17 00:22:50 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:22:50 S-STORE01 ahcich1: Poll timeout on slot 3 port 0
Jul 17 00:22:50 S-STORE01 ahcich1: is 00000000 cs 00000008 ss 00000000 rs 00000008 tfd 80 serr 00000000 cmd 0004c317
Jul 17 00:22:50 S-STORE01 (aprobe0:ahcich1:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Jul 17 00:22:50 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:22:50 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted
Jul 17 00:23:20 S-STORE01 ahcich1: Timeout on slot 4 port 0
Jul 17 00:23:20 S-STORE01 ahcich1: is 00000000 cs 00000010 ss 00000010 rs 00000010 tfd 80 serr 00000000 cmd 0004c417
Jul 17 00:23:20 S-STORE01 (ada1:ahcich1:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 04 34 8a 7f 40 0c 00 00 00 00 00
Jul 17 00:23:20 S-STORE01 (ada1:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:23:20 S-STORE01 (ada1:ahcich1:0:0:0): Error 5, Periph was invalidated
Jul 17 00:23:20 S-STORE01 (ada1:ahcich1:0:0:0): Periph destroyed
Jul 17 00:24:08 S-STORE01 ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Jul 17 00:24:08 S-STORE01 ahcich1: Poll timeout on slot 6 port 0
Jul 17 00:24:08 S-STORE01 ahcich1: is 00000000 cs 00000040 ss 00000000 rs 00000040 tfd 80 serr 00000000 cmd 0004c617
Jul 17 00:24:08 S-STORE01 (aprobe0:ahcich1:0:0:0): NOP FLUSHQUEUE. ACB: 00 00 00 00 00 00 00 00 00 00 00 00
Jul 17 00:24:08 S-STORE01 (aprobe0:ahcich1:0:0:0): CAM status: Command timeout
Jul 17 00:24:08 S-STORE01 (aprobe0:ahcich1:0:0:0): Error 5, Retries exhausted
I want to try changing the cable first to see if that is the cause.
The board supports hot swapping etc.
Can anyone advise what is the best method to do this.
- can it be done while it is on - I have two options when I click on the eclipses to either detach or replace.
or
- Power it off and swap the cable… Upon powering on the unit, will the system try to recognise the drive again?
If the drive is faulty, I was thinking to replace it with an Intel® SSD DC S3500 Series, 2.5" SATA 6Gb/s SSDSC2BB120G4 drive. I have seen these mentioned quite a lot here.
Any ideas what the best approach for this.
Thanks