I rolled the dice and picked up several used drives from ebay. As they were used drives, I wanted to do some additional testing before trusting them with any of my data. One of the drives is failing smart tests but passed a badblocks run.
Here are the most recent runs of smartctl:
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Failed in segment --> 3 2319 - [0x1 0xb 0x96]
# 2 Background short Failed in segment --> 3 2314 - [0x1 0xb 0x96]
# 3 Background long Completed - 2236 - [- - -]
# 4 Background long Failed in segment --> 7 2193 175919960 [0x3 0x5d 0x1]
# 5 Background short Completed - 2192 - [- - -]
# 6 Background short Failed in segment --> 3 2192 - [0x1 0xb 0x96]
# 7 Background short Completed - 2169 - [- - -]
The drive showed up with one passed short test (#7). It failed my first test (#6), then passed (#5) then failed a long test (#4). At some point I did a run of badblocks, which it passed:
truenas_admin@truenas[~]$ time sudo badblocks -t random -w -s -b 4096 /dev/sda
Testing with random pattern: done
Reading and comparing: ^C9.76% done, 64:10:04 elapsed. (0/0/0 errors)
Interrupted at block 142973376
sudo badblocks -t random -w -s -b 4096 /dev/sda 247.30s user 1524.73s system 0% cpu 64:10:04.34 total
and now itās failing the smart tests again. What gives? Since badblocks writes then reads from every sector, isnāt that supposed to be more robust than a smart test? Would you consider this drive DOA?
If itās at all relevant, these are SAS drives and it took something like 2.5 days to run the badblocks test.