I’ve had an x20 since 2019, long enough for the service contract to go EOL. Now iXSystems tells me to go to the forum for help. Other than bad power supplies, which were quickly replaced, it’s been rock solid. Because of that, I haven’t actually learned much about how it works other than using TrueNAS and updating it and such. This is my first “datacenter” server, so I’m totally new to the world of datacenter servers.
About 2 weeks ago it sent me an email saying it rebooted and then a few hours later it went down. I went and physically disconnected it from power and reconnected it and it came back up. I foolishly thought all was good and forgot about it.
But a few days later it went down again and now I can’t get it back up. By down, I mean I can’t connect to the TrueNAS IP. Because it powers up and there’s no blinking or amber LEDs. It looks completely normal. There’s no video out so I’m restricted to serial and ethernet connectivity. I can connect to the server using the usb-to-serial cable and ipmitool via ethernet.
ipmitool sol activate gives me an “ESM” prompt. Typing “$%^0” followed by 2 returns is supposed to give me the x86 console (according to the docs) but it doesn’t. It gives no feedback until I type “$%^2”, which takes me back to the ESM prompt. I’ve done lots of debugging with ESM and ChatGPT but can’t really figure out why it wont boot. Everything looks normal (according to ChatGPT). Except ChatGPT sometimes says it’s missing a CPU board. But I’m not sure if I believe ChatGPT.
screen /dev/ttyUSB0 38400 gives me access to a Linux OS running on arm but my server is x86 and it’s TrueNAS Core, FreeBSD. ChatGPT says the Linux OS is an embedded BMC/IPMI service processor OS. It’s running. But I can’t see any way to get from that to the x86 console either. I don’t really understand how these servers are built (I’ve never built my own server). So I’m not sure how it does all this communication between the embedded processor and the main board. I’m just used to the BIOS being the thing that starts once it receives power. But I can’t view the BIOS. I’ve never had a computer that had no video out. So I’m unsure what to do next.
I have purchased an identical SAS controller canister from eBay (I retrieved the model number from ESM’s fru get). I swapped the M.2 NVMe disks and I installed it but that didn’t work either. However, I didn’t try everything when this was installed, so I should try it all again. But first I’m going to see if I can read the M.2 NVMe disk that came with new canister and see if I can install TrueNAS on it. Then I’m going to try everything all over.
Anyway, ChatGPT says the controller board or boot device might have failed. But it would be nice if I could see something to confirm that. So I’m trying the forum. Does anyone have any info they can give me to try to view the BIOS? I’ve still got all the debugging output from ipmitool, the ESM, and the serial connection and can post it if it might be useful. Well, fru get might be useful. It says ESCE B because I swapped it from the first bay to the second to see if reseating it helped. It didn’t.
ESM B => fru get
--- EL LOBO Enclosure ---
[Product Info]
Product Name: PUMA LFF BMC NO HA
Product Manufacturer Name: CELESTICA-CSS
Product Serial Number: bla-does-it-matter
Product Part: P3217-B
[Incumbent Canister ID]
ICID = CLS PUMA
Total 24 bytes:
43 4c 53 20 20 20 20 20 50 55 4d 41 20 20 20 20 CLS PUMA
20 20 20 20 20 20 20 20
[Chassis]
Chassis Part Number: R0930-F0105-01
Chassis Serial Number: bla-does-it-matter
Chassis Product Name: PUMA LFF
[DriveBoard]
Drive Board Product Name: PUMA 3.5 INCH
Drive Board Serial Number: bla-does-it-matter
Drive Board Manufacturer: CELESTICA-CSS
Drive Board Part Number: R0930-G1036-01
Drive Board HW Version:
Drive Board MFG Serial Number:
Drive Board SAS Seed: 50-0E-0E-CA-06-C0-9E-00-3E-3D
--- ESCE A ---
NotIstall
--- ESCE B ---
[General]
Product Name: PUMA
Canister ID: CLS PUMA
SAS Address: 50-0E-0E-CA-06-C0-9E-7E
Running Time: 1 day 1 hours 19 minutes 55 seconds
[Board]
Manufacture Name: CELESTICA-CSS
Part Number: R0930-G0006-01
Serial Number: bla-does-it-matter
[Revision]
FW Revision 4.0.3.3
Tamer r662 Built 2018/06/28
CFG Revision 4.0.3.3
CPLD Revision Code: 0.1.0.3
HW EC LEVEL: 03
--- Power Supply 0 ---
PS Type: 800W-JBOD-PSU
Power Capacity: 800W
PS Manufacturer: DELTA-THAILAND
PS Serial Number: bla-does-it-matter
PS Part Number: TDPS-800EB A
PS Firmware Version: 010=
--- Power Supply 1 ---
PS Type: 800W-JBOD-PSU
Power Capacity: 800W
PS Manufacturer: DELTA-THAILAND
PS Serial Number: bla-does-it-matter
PS Part Number: TDPS-800EB A
PS Firmware Version: 010=
