Application(s) have failed to start / all apps are gone

Hi,
I just did a upgrade to 24.04.2 (from 24.04.1.1). All my apps are now gone and this error shows.

Application(s) have failed to start:
[EFAULT] Unable to configure node: Containerd socket is not available
Failed to sync TRUENAS catalog: [EFAULT] Failed to clone 'https://github.com/truenas/charts.git' repository at '/mnt/APPS/ix-applications/catalogs/github_com_truenas_charts_git_master' destination: [EFAULT] Failed to clone 'https://github.com/truenas/charts.git' repository at '/mnt/APPS/ix-applications/catalogs/github_com_truenas_charts_git_master' destination: Cloning into '/mnt/APPS/ix-...
Failed to sync TRUECHARTS catalog: [EFAULT] Failed to clone 'https://github.com/truecharts/catalog' repository at '/mnt/APPS/ix-applications/catalogs/github_com_truecharts_catalog_main' destination: [EFAULT] Failed to clone 'https://github.com/truecharts/catalog' repository at '/mnt/APPS/ix-applications/catalogs/github_com_truecharts_catalog_main' destination: Cloning into '/mnt/APPS/ix-...

I need some help to get this going again.

My system:
CPU: N6005
RAM: 64GB
APPS: 1 x DISK | 1 wide | 223.57 GiB
DATA: 1 x RAIDZ1 | 5 wide | 7.28 TiB

Do a search for Truecharts on forum. See latest postings.

That’s not the reason for the failure to clone the TrueNAS repo, nor for the containerd socket to not be available.

@padre, is your apps pool online?

Truecharts have pulled out - which is why the catalogue no longer updates.

That is entirely separate from why your existing apps have disappeared.

As previously suggested, check that all your pools are online.

You might also ask other experts whether restoring your old configuration file might help.

@Protopia @dan
Yes, the pools are online. I had to Unset Pool and then Choose Pool to get it working again. But that feels like a adhoc solution.

@SmallBarky thank you

[this is SOLVED for me btw, I am even more sorry for opening this again. Look at bottom for solution.]

Hi,
i hate to open topics again but I can’t get to fixing this Problem and the found “adhoc solution” is not working at all. It is still the same problem as stated at the start of this thread.
The Error was thrown after my daily reboot, the Synchting was running for a few days and flame was installed and running yesterday.
Unsetting and setting the pool won’t work with the side-effect of my applications being gone for some time which is really bad because Syncthing was among them.

The Error log:
**2024-08-16 11:19:08** 0/1 nodes are available: 1 node(s) had untolerated taint {ix-svc-stop: }. preemption: 0/1 nodes are available: 1 Preemption is not helpful for scheduling..

The Error notification
[EFAULT] Unable to configure node: Containerd socket is not available

For my setup:
TrueNAS running on Proxmox
Proxmox scripts to note: System shutdown at 00:00, TrueNAS has 10min timeout.
CPU: ntel(R) Xeon(R) CPU E5-2683 v3 (16 threads for TrueNAS)
RAM: 32GB ECC (16GB for TrueNAS)
1 Pool with 3 Data-Datasets and 2 Apps-Datasets (1 extra for Syncthing storage)
→ Pool has 1 Raidz1 3x2TB = 3.63TiB, no SLOG, no L2ARC
–>Apps: Syncthing, flame

Edit: gone from the apps menu, not the Dataset
Edit: Apps not permanently gone, logs were available again.

Important Edit:
I had an init script running which set the zfs_txg_timeout to 600 because I hate the HDD noise. I am living in the same room as my server. Apparently this kept the Apps from deploying and produced said errors. This is 100% reproducible with (de)activating the script. Editing zfs_txg_timeout after initializing the apps is working fine.

PSA: That’s not the proper way of making the drives less noisy. Leave the transaction groups alone.

I just have the “HeavyScript” scheduled via cron. No others.The system just rebooted and the apps are not doing well.

Failed to sync TRUENAS catalog: [EFAULT] Failed to clone 'https://github.com/truenas/charts.git' repository at '/mnt/APPS/ix-applications/catalogs/github_com_truenas_charts_git_master' destination: [EFAULT] Failed to clone 'https://github.com/truenas/charts.git' repository at '/mnt/APPS/ix-applications/catalogs/github_com_truenas_charts_git_master' destination: Cloning into '/mnt/APPS/ix-...
2024-08-17 15:32:13 (Europe/Stockholm) 
Failed to start kubernetes cluster for Applications: [EFAULT] Unable to configure node: Containerd socket is not available
2024-08-17 15:33:35 (Europe/Stockholm) 

I have to unset the pool and then choose it again to get the apps working.

Definetely. Thats really not best practice or practice at all.
I just don’t think that vibration-proofing my enclosure would absorb the absurdly loud seek noises of the HDDs. And at the moment I only have 3/5 active.
I am a student and the server is in my 1 room flat where I try to work and live where the brrrt every 5 seconds is really distracting.

I was referring to spinning them down.

They don’t spin down, I deactivated thatt energy-saving feature. I know that it would wear out the drives faster.

Not necessarily, but it depends on your priorities and your use case. However, we are going offtopic.

1 Like