I’m still pretty new to TrueNAS, so please forgive me if I’m yet another bozo who didn’t take the time to research similar issues. I haven’t found anything similar to what I’m experiencing, so now I’m posting about it.
I have 5 drives on my TrueNAS server. 4x 12TB SAS HDDs, and 1x 128GB NVMe boot drive. After configuring my server, I set up a regular interval for SMART checking, and went to test my drives. The 4 SAS drives checked out, and in the TrueNAS web GUI I could see the successful tests in the SMART Test Results. The only drive not listed is my NVMe.
For my own sake of mind, I tried running smartctl -t short /dev/nvme0n1p1
and took a look at the results:
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.12.15-production+truenas] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Number: ORICO
Serial Number: P0C03PDTYQS2N4049VDB
Firmware Version: W0830B
PCI Vendor ID: 0x126f
PCI Vendor Subsystem ID: 0x2261
IEEE OUI Identifier: 0x5cd2e4
Total NVM Capacity: 128,035,676,160 [128 GB]
Unallocated NVM Capacity: 0
Controller ID: 1
NVMe Version: 1.3
Number of Namespaces: 1
Namespace 1 Size/Capacity: 128,035,676,160 [128 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64: efcdab 0000000000
Local Time is: Mon Aug 4 00:36:35 2025 CDT
Firmware Updates (0x14): 2 Slots, no Reset required
Optional Admin Commands (0x0016): Format Frmw_DL Self_Test
Optional NVM Commands (0x0015): Comp DS_Mngmt Sav/Sel_Feat
Log Page Attributes (0x03): S/H_per_NS Cmd_Eff_Lg
Maximum Data Transfer Size: 64 Pages
Warning Comp. Temp. Threshold: 83 Celsius
Critical Comp. Temp. Threshold: 85 Celsius
Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 9.00W - - 0 0 0 0 0 0
Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 0
=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x00
Temperature: 42 Celsius
Available Spare: 100%
Available Spare Threshold: 42%
Percentage Used: 0%
Data Units Read: 150,195 [76.8 GB]
Data Units Written: 563,197 [288 GB]
Host Read Commands: 1,544,060
Host Write Commands: 15,201,384
Controller Busy Time: 59
Power Cycles: 21
Power On Hours: 314
Unsafe Shutdowns: 6
Media and Data Integrity Errors: 0
Error Information Log Entries: 0
Warning Comp. Temperature Time: 0
Critical Comp. Temperature Time: 0
Error Information (NVMe Log 0x01, 16 of 256 entries)
No Errors Logged
Self-test Log (NVMe Log 0x06)
Self-test status: No self-test in progress
Num Test_Description Status Power_on_Hours Failing_LBA NSID Seg SCT Code
0 Short Completed: failed segments 313 - - 2 - -
1 Short Completed: failed segments 313 - - 2 - -
2 Short Completed: failed segments 313 - - 2 - -
3 Short Completed: failed segments 313 - - 2 - -
4 Short Completed: failed segments 312 - - 2 - -
From a little bit of research, it sounds like “failed segments” could mean a failure to read the drive. In the few weeks I’ve had it, no more than 300GB has been written to it, so I can’t understand why I encounter this problem.
Looking into this some more, I tried using Multi-Report, just as a tool to give more detailed SMART results for all of my drives, in addition to the NVMe. It, too, reflected the failed segments as a critical error.
I think it’s pretty clear that I don’t know what’s going on, but if anyone could point me in the right direction, I would truly appreciate the help.