Possible HBA failure after upgrade or am I barking up the wrong tree?

So first ever post please be gentle :grinning:

I have a system thats been running for many years but I suddenly have an issue. I wanted to change the boot drives from USB Sticks to SSD’s whilst upgrading the software. I created an install stick and ran the installer to create a boot drive on a new SSD. I then went into the bios and changed the boot order to the SDD.

Since this point I’ve had massive issues trying to get the system to boot. I can get it to boot but only with the SDD connected directly to the MOBO’s SATA ports and my HBA must be removed. If the HBA is installed or the SSD is connected via the HBA the system will hang during boot with different errors.This isn’t workable as I need the HBA to have enough drive ports.

My current thought is the HBA has failed and I need to get a new one but thought I would ask here before that step.

As it stands I have taken the following steps:

  • Reset MOBO CMOS
  • Check MOBO has latest BIOS
  • Upgrade Firmware on LSI HBA
  • Tried removing all drives, then remove cables from HBA then remove HBA from PCIE slot. Boot only succeeds when card is physically removed.
  • Tried moving HBA to a different PCIE slot
  • There are probably other steps I’m forgetting as its been a bit of a wild ride getting to this point.

I appreciate you will likely need screenshots of the errors to diagnose properly but I just wondered if this rings any bells to anyone or if you have any surface thoughts I need to try. I will try and follow up with screenshots in the next few days and if you need any more info to help diagnose please just ask.

Thanks in advice.

Basic Hardware:

  • MB = Supermicro X10SLH-F
  • HBA = LSI 9211-8i
  • OS Drives = Formally USB Sticks / Now 256GB SanDisk SSD

What are the errors?

Full listing of Hardware, OS version, etc.

Run sudo sas2flash -list and paste the results back in Preformattted Text (</>) or Ctrl + e on the tool bar.

Make sure you have enough cooling air flow over the HBA. 200 linear feet per minute!

I’ll get those details for you tomorrow. If it helps it should be on the latest firmware now and flashed in IT mode.

It had been working for around 8 years up to this point with no issue.

Regarding airflow I honestly wouldn’t know how to measure the rate of air going over it. There are a number of fans in the system but not one directly blowing air over the HBA. If it turns out I’ve eventually cooked it I will rectify that when a new one is installed. Any thoughts on how to measure the airflow?

Ta

If you have been running the same HBA for 8 years and you have been stressing the system the same, I doubt it was and air flow issue.

If you can’t boot off the HBA and just a boot drive in the system, I would just plan on another HBA. Download a backup of your system configuration and try reinstalling with only the boot drive attached to the HBA.

If you can’t boot as a new install with only the HBA and a boot drive, the HBA is, most likely, a problem.

When you say that system fails to boot while HBA is connected, we talking like fully failing to boot? Like can’t even get into bios? If so, I’d argue HBA is likely toast.