24.10 Nvidia driver installation error

I upgraded from 24.04.2.3 to 24.10 today on two different systems and when I tried to install the Nvidia drivers via the app config checkbox I get the following error on both.

[EFAULT] Command apt -y remove gcc make pkg-config failed (code 100): Reading package lists… Building dependency tree… Reading state information… Package ‘gcc’ is not installed, so not removed Package ‘make’ is not installed, so not removed WARNING: apt does not have a stable CLI interface. Use with caution in scripts. E: Unable to locate package pkg-config

Traceback (most recent call last):
File “/usr/lib/python3/dist-packages/middlewared/plugins/nvidia.py”, line 93, in _install_packages
subprocess.run([“apt”, “-y”, “install”] + temporary + permanent, **kwargs)
File “/usr/lib/python3.11/subprocess.py”, line 571, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command ‘[‘apt’, ‘-y’, ‘install’, ‘gcc’, ‘make’, ‘pkg-config’, ‘libvulkan1’, ‘nvidia-container-toolkit’, ‘vulkan-validationlayers’]’ returned non-zero exit status 100.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File “/usr/lib/python3/dist-packages/middlewared/job.py”, line 488, in run
await self.future
File “/usr/lib/python3/dist-packages/middlewared/job.py”, line 535, in __run_body
rv = await self.middleware.run_in_thread(self.method, *args)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/main.py”, line 1364, in run_in_thread
return await self.run_in_executor(io_thread_pool_executor, method, *args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/main.py”, line 1361, in run_in_executor
return await loop.run_in_executor(pool, functools.partial(method, *args, **kwargs))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3.11/concurrent/futures/thread.py”, line 58, in run
result = self.fn(*self.args, **self.kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/plugins/nvidia.py”, line 56, in install
with self._install_packages(
File “/usr/lib/python3.11/contextlib.py”, line 137, in enter
return next(self.gen)
^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/plugins/nvidia.py”, line 98, in _install_packages
subprocess.run([“apt”, “-y”, “remove”] + temporary, **kwargs)
File “/usr/lib/python3.11/subprocess.py”, line 571, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command ‘[‘apt’, ‘-y’, ‘remove’, ‘gcc’, ‘make’, ‘pkg-config’]’ returned non-zero exit status 100.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File “/usr/lib/python3/dist-packages/middlewared/job.py”, line 488, in run
await self.future
File “/usr/lib/python3/dist-packages/middlewared/job.py”, line 533, in __run_body
rv = await self.method(*args)
^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/schema/processor.py”, line 49, in nf
res = await f(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/schema/processor.py”, line 179, in nf
return await func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File “/usr/lib/python3/dist-packages/middlewared/plugins/docker/update.py”, line 106, in do_update
await (
File “/usr/lib/python3/dist-packages/middlewared/job.py”, line 436, in wait
raise self.exc_info[1]
File “/usr/lib/python3/dist-packages/middlewared/job.py”, line 492, in run
raise handled
middlewared.service_exception.CallError: [EFAULT] Command apt -y remove gcc make pkg-config failed (code 100):
Reading package lists…
Building dependency tree…
Reading state information…
Package ‘gcc’ is not installed, so not removed
Package ‘make’ is not installed, so not removed

WARNING: apt does not have a stable CLI interface. Use with caution in scripts.

E: Unable to locate package pkg-config

I am using an Nvidia T600 and A2000 between the two NAS for Plex. The GPUs are listed when I use lspci | grep NVIDIA.

NAS1

02:00.0 VGA compatible controller: NVIDIA Corporation TU117GL [T600] (rev a1)

NAS2

43:00.0 VGA compatible controller: NVIDIA Corporation GA106 [RTX A2000] (rev a1)

Any help in resolving this issue would be much appreciated.

One of these actions resolved my issue: refreshed apps, deleted heavyscript from root and deleted the cronjob for it. This allowed for me to install the Nvidia driver and from there I just followed HoneyBadger’s fix.

Could you elaborate more please? What’s a honeybadger fix? I’ve nothing in cronjobs and get this same error. I use the card for frigate and jellyfin none of which now have gpu in options.

This is the fix I used after I resolve my driver installation issue that I refered to as HoneyBadger’s fix:
Docker Apps and UUID issue with NVIDIA GPU after upgrade to 24.10

Regarding the driver installation issue I’m honestly not sure what the cause was or how I actually fixed it. Just that I did those 3 things. Perhaps it was the 3rd party scripts I had in the root user directory preventing some sort of check from functioning or the driver download wasn’t available at that time. Either way I’m not Linux saavy enough to know. Perhaps someone else in the community can shed some light on this.

1 Like

Thanks that got me going!