This was mentioned on old forum and never resolved. I have also encountered the issue and think I found the issue, so wanted to share with any who may experience this now or in the future.
I receive a large number of IPMI / SEL (Server Event Log) errors of the form:
Voltage #0x0d Asserted Lower Non-critical going low
(and other similar forms, all for “0x0d”)
There were mentions to view current IPMI sensor values and alert/status:
freenas# ipmitool sensor list all
CPU1 Temp | 0x0 | discrete | 0x0000| na | na | na | na | na | na
CPU2 Temp | 0x0 | discrete | 0x0000| na | na | na | na | na | na
Sys Temp | 34.000 | degrees C | ok | na | na | na | 76.000 | 78.000 | 80.000
CPU1 Vcore | 1.120 | Volts | ok | 0.560 | 0.576 | 0.592 | 1.384 | 1.392 | 1.400
CPU2 Vcore | 1.096 | Volts | ok | 0.560 | 0.576 | 0.592 | 1.384 | 1.392 | 1.400
CPU1 DIMM | 1.544 | Volts | ok | 1.144 | 1.152 | 1.160 | 1.656 | 1.664 | 1.672
CPU2 DIMM | 1.544 | Volts | ok | 1.144 | 1.152 | 1.160 | 1.656 | 1.664 | 1.672
1.5V | 1.528 | Volts | ok | 1.312 | 1.328 | 1.344 | 1.656 | 1.672 | 1.688
5V | 5.024 | Volts | ok | 4.416 | 4.448 | 4.480 | 5.536 | 5.568 | 5.600
12V | 12.243 | Volts | ok | 10.653 | 10.706 | 10.759 | 13.250 | 13.303 | 13.356
5VSB | 5.088 | Volts | ok | 4.416 | 4.448 | 4.480 | 5.536 | 5.568 | 5.600
3.3V | 3.264 | Volts | ok | 2.880 | 2.904 | 2.928 | 3.672 | 3.696 | 3.720
3.3VSB | 3.288 | Volts | ok | 2.880 | 2.904 | 2.928 | 3.672 | 3.696 | 3.720
VBAT | 2.832 | Volts | nr | 2.880 | 2.904 | 2.928 | 3.672 | 3.696 | 3.720
Fan1 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Fan2 | 2500.000 | RPM | ok | 200.000 | 300.000 | 400.000 | na | na | na
Fan3 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Fan4 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Fan5 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Fan6 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Fan7 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Fan8 | na | RPM | na | 200.000 | 300.000 | 400.000 | na | na | na
Intrusion | 0x0 | discrete | 0x0100| na | na | na | na | na | na
Power Supply | 0x0 | discrete | 0x0000| na | na | na | na | na | na
CAT Error | 0x0 | discrete | 0x0000| na | na | na | na | na | na
IOH Error | 0x0 | discrete | 0x0000| na | na | na | na | na | na
CPU Overheat | 0x0 | discrete | 0x0000| na | na | na | na | na | na
Thermal Trip1 | 0x0 | discrete | 0x0000| na | na | na | na | na | na
Thermal Trip2 | 0x0 | discrete | 0x0000| na | na | na | na | na | na
If seems if you “number” the sensors in the list (using hex):
0x00 - CPU1 Temp |
0x01 - CPU2 Temp |
0x02 - Sys Temp |
0x03 - CPU1 Vcore |
0x04 - CPU2 Vcore |
0x05 - CPU1 DIMM |
0x06 - CPU2 DIMM |
0x07 - 1.5V |
0x08 - 5V |
0x09 - 12V |
0x0a - 5VSB |
0x0b - 3.3V |
0x0c - 3.3VSB |
0x0d - VBAT |
0x0e - Fan1 |
0x0f - Fan2 |
0x10 - Fan3 |
0x11 - Fan4 |
0x12 - Fan5 |
0x13 - Fan6 |
0x14 - Fan7 |
0x15 - Fan8 |
0x16 - Intrusion |
0x17 - Power Supply |
0x18 - CAT Error |
0x19 - IOH Error |
0x1a - CPU Overheat |
0x1b - Thermal Trip1 |
0x1c - Thermal Trip2 |
So sensor “Voltage 0x0d” = “VBAT” (CMOS/BIOS battery).
Looking at only this line of output, shows status is “nr” (non recoverable) (vs. the rest that are “ok” or “na”):
VBAT | 2.832 | Volts | nr | 2.880 | 2.904 | 2.928 | 3.672 | 3.696 | 3.720