hello
I have a problem.
The disks connected via lsi 9206-16e fall off after a while. If you connect the disks directly to the motherboard, then there is no problem. There is a suspicion that the problem is in the settings. But which ones are not clear. I tried connecting two different 9206 controllers, the behavior is identical. Can you tell me what the problem is?
root@rpc-nas[~]# uname -a
FreeBSD rpc-nas.local 13.1-RELEASE-p9 FreeBSD 13.1-RELEASE-p9 n245432-de4561397a1 TRUENAS amd64
root@rpc-nas[~]# sas2flash -listall
LSI Corporation SAS2 Flash Utility
Version 16.00.00.00 (2013.03.01)
Copyright (c) 2008-2013 LSI Corporation. All rights reservedAdapter Selected is a LSI SAS: SAS2308_2(D1)
Num Ctlr FW Ver NVDATA x86-BIOS PCI Addr
0 SAS2308_2(D1) 20.00.07.00 14.01.00.06 07.39.02.00 00:84:00:00
1 SAS2308_2(D1) 20.00.07.00 14.01.00.06 No Image 00:86:00:00Finished Processing Commands Successfully. Exiting SAS2Flash.
root@rpc-nas[~]# sas2flash -c 1 -list
LSI Corporation SAS2 Flash Utility
Version 16.00.00.00 (2013.03.01)
Copyright (c) 2008-2013 LSI Corporation. All rights reservedAdapter Selected is a LSI SAS: SAS2308_2(D1) Controller Number : 1 Controller : SAS2308_2(D1) PCI Address : 00:86:00:00 SAS Address : 5000d31-0-0050-fadd NVDATA Version (Default) : 14.01.00.06 NVDATA Version (Persistent) : 14.01.00.06 Firmware Product ID : 0x2214 (IT) Firmware Version : 20.00.07.00 NVDATA Vendor : LSI NVDATA Product ID : SAS9206-16e BIOS Version : N/A UEFI BSD Version : N/A FCODE Version : N/A Board Name : SAS9206-16E Board Assembly : H3-25553-01A Board Tracer Number : SV42822840 Finished Processing Commands Successfully. Exiting SAS2Flash.
dmesg
(da0:mps1:0:0:0): SCSI sense: NOT READY asc:4,0 (Logical unit not ready, cause not reportable)
(da0:mps1:0:0:0): Retrying command (per sense data)
(da0:mps1:0:0:0): READ(6). CDB: 08 00 00 28 01 00
(da0:mps1:0:0:0): CAM status: SCSI Status Error
(da0:mps1:0:0:0): SCSI status: Check Condition
(da0:mps1:0:0:0): SCSI sense: NOT READY asc:4,0 (Logical unit not ready, cause not reportable)
(da0:mps1:0:0:0): Error 5, Retries exhausted
GEOM_PART: da0 was automatically resized.
Usegpart commit da0
to save changes orgpart undo da0
to revert them.
GEOM_PART: integrity check failed (da0, GPT)
mps1: Controller reported scsi ioc terminated tgt 0 SMID 783 loginfo 31170000
mps1: Controller reported scsi ioc terminated tgt 0 SMID 782 loginfo 31170000
mps1: mpssas_prepare_remove: Sending reset for target ID 0
da0 at mps1 bus 0 scbus13 target 0 lun 0
da0: <ATA Netac SSD 1TB 915a> s/n AA202410111T21444225 detached
mps1: No pending commands: starting remove_device
(da0:mps1:0:0:0): Periph destroyed
da0 at mps1 bus 0 scbus13 target 0 lun 0
da0: <ATA Netac SSD 1TB 915a> Fixed Direct Access SPC-4 SCSI device
da0: Serial Number AA202410111T21444225
da0: 600.000MB/s transfers
da0: Command Queueing enabled
da0: 953869MB (1953525168 512 byte sectors)
mps1: Controller reported scsi ioc terminated tgt 0 SMID 913 loginfo 31110d00
(da0:mps1:0:0:0): WRITE(10). CDB: 2a 00 14 37 4e 10 00 00 08 00
(da0:mps1:0:0:0): CAM status: SCSI Status Error
(da0:mps1:0:0:0): SCSI status: Check Condition
(da0:mps1:0:0:0): SCSI sense: UNIT ATTENTION asc:29,0 (Power on, reset, or bus device reset occurred)
(da0:mps1:0:0:0): Retrying command (per sense data)
I changed the thermal paste on the controller. cables too.