Need IX-Systems definitive answer for shutdown commands

truenas-fan · January 22, 2025, 8:46pm

So, it “appears”, that the only “proper” way to shutdown a TrueNAS Scale machine, is with:

/usr/bin/midclt call system.shutdown

This came from the very developers in their youtube channel, BUT,
I’m having a hard time believing that IX-Systems did not use the proper systemd setup in order to properly shutdown their middleware for ANY shutdown or reboot commands (power button, poweroff, reboot, etc.).

So, I’m here asking for the “official” answer to this important process:

Should all people with scripts CHANGE their shutdown commands to the above middleware call?

Thanks.

John · January 23, 2025, 1:03am

In another thread about a similar task on CORE, neofusion tracked that it will have to be an API call on SCALE but I tracked down that it’s safe to call shutdown on CORE (the rc.d system will do what’s right).

Link to neofuson’s post about SCALE.

kris · January 23, 2025, 1:15pm

Its always been recommended to try and use the API calls for specific tasks on TrueNAS, even back on the CORE days. TrueNAS is not some generic BSD/Linux distro. It’s a pretty specifically built appliance, designed to be used through the UI and APIs. While there is access to a shell prompt, it is always recommended to use the API commands for those kinds of operations, since that’s how we test and validate the right things happen here internally. We don’t spend cycles wiring up every shell command (of which their are many) to hook back into the API to do the correct and tested shutdown procedures.

Calling the API will essentially run the exact shutdown procedure you would get had you clicked the “Shutdown” button in the UI. Calling that means as we make changes and improvements to the backend across new versions, your scripts will always be following the recommended and tested procedures for whatever task you are trying to automate.

John · January 23, 2025, 1:32pm

Out of curiosity (mainly because I found this a fun “thread to pull on”/diversion/sidebar), why did you choose to not use the ready-made solution (rc.d) (I mean, I know the rc.d scripts are there, did you find those “not enough” -i.e., did you find a deficiency)?

kris · January 23, 2025, 1:41pm

We used selected rc.d scripts in various places for specific parts of the middleware. But relying on it to do everything right was out of scope and not really feasible. We already had to fork rc.d for that limited usage in the product as it was. So the version in BSD base was already insufficient for our use-case and it was apparent years gone past that we just needed to have finer grained control over what happens, what order, and timers / triggers, etc.

Same goes for systemd. Systemd is far more powerful than rc.d and we possibly could have used it “out of box” to do what we needed here, but its an effort vs impact calculation. Would take a fair bit of work to make that all work properly, and become the supported thing, but we already had a working API which did the right things in the right sequences, so not worth the effort at this time.

John · January 23, 2025, 1:49pm

Thanks. I’m going to try and find that fork and play around a bit (just for the sake of playing around). Seems like fun. Any and all tips welcome (if you remember any “odd ducks”, feel free to point them out). …not sure why that was such a fun diversion though; it was kind of like the first time you wrote a program to parse a sentence.

John · January 23, 2025, 2:15pm

Did a quick search on my coffee break and I can’t find the fork. I’ll have to search after work.

Whattteva · January 23, 2025, 2:43pm

Worth noting; I’m not sure how SCALE does it, but in Proxmox, it helps to have qemu-guest-agent installed on all your VM’s to make the shutdown process a lot smoother and more graceful as well.

John · January 23, 2025, 3:03pm

Didn’t know what that was, googled and found basic information.

See! Fascinating.

The only role QEMU plays in guest commands is unmarshalling and remarshalling the input and output. This means that data from the guest is not being sent directly to a management tool which significantly decreases the guest attack surface.

In the inner workings of a shutdown/startup “thing” I can foresee problems with dealing with things like interrupts and/or hangs. And after a very quick github search on freebsd’s rc.d I found something touching on that topic right away.

# If requested, start a watchdog timer in the background which
# will terminate rc.shutdown if rc.shutdown doesn't complete
# within the specified time.
#
_rcshutdown_watchdog=
if [ -n "$rcshutdown_timeout" ]; then
	debug "Initiating watchdog timer."

Obviously, I don’t have any context at the moment (I’ve spent longer typing this post than searching/reading) but, this is fun.

neofusion · January 23, 2025, 4:28pm

The shutdown command used when the UPS service in SCALE needs to bring the system down appears to be /sbin/shutdown -p now, unless the user has overridden it with their own command, is that intentional or an oversight?

John · January 23, 2025, 4:39pm

I had the same thought when we first starting pulling on this thread.

kris · January 23, 2025, 5:05pm

If that exists in the nightly versions of what will become Fangtooth, let us know and we should fix.

neofusion · January 23, 2025, 5:56pm

Ticket: [NAS-133743] - iXsystems TrueNAS Jira

Stux · January 23, 2025, 10:56pm

AFAIK, ASPI shutdowns (ie power buttons and VMs) also trigger /sbin/shutdown, rather than the API.

(Actually that’s a simplification, it’s really handled internally by systemd)

The point is, the power button should trigger a proper shutdown, and if a proper shutdown should involve the API, then the API needs to be tied back into systemd via a before/after type service.

M3NACE · January 24, 2025, 12:08am

im going to follow this, I have a need to shutdown my NAS at midnight everynight and using the built in cronjob im using shutdown -h now, if thats not correct I need to update that.

neofusion · January 24, 2025, 6:19pm

In my humble opinion, it’s not very ambiguous.

They tell you to use the midclt command in scripts in the T3 video, and kris even came here to say that you should use the midclt command to shut your system down instead of directly calling shutdown, poweroff or similar.

My Jira bug report is not related to that, it’s focused on a possible oversight in the UPS service.

You should change your scripts.

John · January 24, 2025, 9:39pm

I finally got around to following up on the UPS for CORE and I verified the shutdown command is in fact being called if the end user does not fill in the ‘shutdown command’ value in the UI.

shutdown_cmd = '/sbin/shutdown -p now'

github.com

truenas/middleware/blob/truenas/13.3-u1-stable/src/middlewared/middlewared/etc_files/local/nut/upsmon.conf

<%
	import re

	ups_config = middleware.call_sync('ups.config')
	user = re.sub(r'([#$])', r'\\\1', ups_config['monuser'])
	powerdown = '/etc/killpower' if ups_config['powerdown'] else '/etc/nokillpower'
	for field in filter(
		lambda f: not ups_config[f],
		['monpwd', 'identifier', 'mode', 'monuser']
	):
		middleware.logger.debug(f'UPSMON: {field} field empty, upsmon will fail to start.')
	ident = ups_config['complete_identifier']
	if not ups_config['shutdowncmd'] and not (
		not (middleware.call_sync('system.is_freenas')) and (
			middleware.call_sync('truenas.get_chassis_hardware')).startswith('TRUENAS-X')
	):
		shutdown_cmd = '/sbin/shutdown -p now'
	else:
		shutdown_cmd = ups_config['shutdowncmd'] or ''
%>\

This file has been truncated. show original

History show this was added in 2019.

I checked the documentation and there is NOT a recomended value for this setting in the documentation

NOTE (for 13.3):

My recommendations (I know my recommendations are not worth much but this is a force of habit):

(lowest impact) Leave defult hard coded, and modify documentation for 13.0 and 13.3.
(highest impact) Change default.

NOTE: I typed and checked as much as I could but you may/should still quickly double check (links provided for your convience). I only spent 5 minutes on this task (I was moving quick and may have missed something).

Also, I was not able to locate the rc.d fork you mentioned. I know you’re busy but if you happen to have that location handy I’d still be interested in toying around.

EDIT: I forgot to mention; I said #1 was lowest impact becuase I previously verified there is a rc.d shutdown script which calls the middleware shutdown script (two links below).

github.com

truenas/middleware/blob/truenas/13.3-stable/src/freenas/etc/ix.rc.d/ix-shutdown

#!/bin/sh
#
# $FreeBSD$
#

# PROVIDE: ix-shutdown
# REQUIRE: middlewared LOGIN
# KEYWORD: shutdown

. /etc/rc.subr

do_shutdown()
{
	/usr/local/bin/midclt call core.event_send system ADDED '{"id": "shutdown"}' > /dev/null

	/usr/local/bin/midclt call -job initshutdownscript.execute_init_tasks SHUTDOWN > /dev/null 2>&1
}

name="ix_shutdown"
start_cmd=':'

This file has been truncated. show original

github.com

truenas/middleware/blob/truenas/13.3-stable/src/middlewared/middlewared/plugins/init_shutdown_script.py

import asyncio
import os
import stat
import subprocess
import tempfile

from middlewared.schema import Bool, Dict, File, Int, Patch, Str, ValidationErrors, accepts
from middlewared.service import CRUDService, job, private
import middlewared.sqlalchemy as sa
from middlewared.utils import Popen
from middlewared.validators import Range


class InitShutdownScriptModel(sa.Model):
    __tablename__ = 'tasks_initshutdown'

    id = sa.Column(sa.Integer(), primary_key=True)
    ini_type = sa.Column(sa.String(15), default='command')
    ini_command = sa.Column(sa.String(300))
    ini_script = sa.Column(sa.String(255), nullable=True)

This file has been truncated. show original

csj · January 25, 2025, 2:46pm

Why do you think this is a bug? What are you trying to solve?

winnielinnie · January 25, 2025, 2:51pm

It’s inconsistent behavior.

On the one hand you have iX informing its users to invoke the midclt method of shutting down the server via scripting or the command-line. “Do not use the OS’s shutdown command.”

On the other hand, the default command for shutting down the server in the UPS configuration is /sbin/shutdown -P now

It should default to invoking the midclt method.

csj · January 25, 2025, 2:57pm

So there isn’t really a bug that is trying to be fixed. You all are just worried about inconsistent messaging?