I’m trying to use docker compose and for awhile everything was working. I was able to load the module for nvidia_drm and setup nvidia-modeset, but all of a sudden, no reboot or anything, it’s gone. nvidia_drm is not found to load and /dev/nvidia-modeset is missing. I tried using nvidia-smi and it sees the video card. When i attempt to use it in plex for transcoding, i get a transcode error and no processes show up. it’s like it attempts to load a process but fails. Any help would be apprecaited!
additional details:
this is what i see when i try to load nvidia_drm
root@pyra:/# modprobe -r nvidia_drm ; modprobe nvidia_drm modeset=1
modprobe: FATAL: Module nvidia_drm not found.
modprobe: FATAL: Module nvidia_drm not found in directory /lib/modules/6.6.29-production+truenas
when i check that library, there is no module nvidia_drm.
whoops. sorry. at the time, no there was no reboot. I did reboot to see if the driver would load, but it did not.
I’m running Dragonfish-24.04.1.1
the GPU is a 3050
±--------------------------------------------------------------------------------------+
| NVIDIA-SMI 545.23.08 Driver Version: 545.23.08 CUDA Version: 12.3 |
|-----------------------------------------±---------------------±---------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA GeForce RTX 3050 Off | 00000000:07:00.0 Off | N/A |
| 0% 56C P0 34W / 130W | 1MiB / 8192MiB | 2% Default |
| | | N/A |
±----------------------------------------±---------------------±---------------------+
±--------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| No running processes found |
±--------------------------------------------------------------------------------------+
I can do this, but no process can use the card. It tried, the “no running processes found” disappears for a few seconds, then comes back and an error is presented in the application.
root@pyra:~# ls -la /dev | grep nvidia
drwxr-xr-x 2 root root 80 Jun 23 11:49 nvidia-caps
crw-rw-rw- 1 root root 239, 0 Jun 23 11:49 nvidia-uvm
crw-rw-rw- 1 root root 239, 1 Jun 23 11:49 nvidia-uvm-tools
crw-rw-rw- 1 root root 195, 0 Jun 23 11:49 nvidia0
crw-rw-rw- 1 root root 195, 255 Jun 23 11:49 nvidiactl
additionally, here’s the error i get in dockage trying to load fileflows node with GPU support:
Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as ‘legacy’
nvidia-container-cli: mount error: stat failed: /dev/nvidia-modeset: no such file or directory: unknown