Hello everybody, my nvidia drivers intallation has some sort of problem. after trying to install some custom apps (wich I failed) the system can’t no more use the graphic card.
Error seems to be in CUDA, ad per my frigate deploying log states:
Generating yolov7x-640.trt. This may take a few minutes.
2025-02-24 07:39:19.359145+00:002025-02-24T07:39:19.359145118Z
2025-02-24 07:39:19.672104+00:00Traceback (most recent call last):
2025-02-24 07:39:19.672144+00:00File “/usr/local/src/tensorrt_demos/yolo/onnx_to_tensorrt.py”, line 244, in
2025-02-24 07:39:19.672263+00:00main()
2025-02-24 07:39:19.672309+00:00File “/usr/local/src/tensorrt_demos/yolo/onnx_to_tensorrt.py”, line 229, in main
2025-02-24 07:39:19.672335+00:00engine = build_engine(
2025-02-24 07:39:19.672353+00:00File “/usr/local/src/tensorrt_demos/yolo/onnx_to_tensorrt.py”, line 114, in build_engine
2025-02-24 07:39:19.672395+00:00with trt.Builder(TRT_LOGGER) as builder, builder.create_network(EXPLICIT_BATCH) as network, trt.OnnxParser(network, TRT_LOGGER) as parser:
2025-02-24 07:39:19.672421+00:00TypeError: pybind11::init(): factory function returned nullptr
2025-02-24 07:39:19.701560+00:00[02/24/2025-08:39:19] [TRT] [W] Unable to determine GPU memory usage
2025-02-24 07:39:19.701611+00:00[02/24/2025-08:39:19] [TRT] [W] Unable to determine GPU memory usage
2025-02-24 07:39:19.701621+00:00[02/24/2025-08:39:19] [TRT] [W] CUDA initialization failure with error: 100. Please check your CUDA installation: 1. Introduction — Installation Guide for Linux 12.8 documentation
2025-02-24 07:39:19.701633+00:00Loading the ONNX file…
2025-02-24 07:39:19.701701+00:00Available tensorrt models:
2025-02-24 07:39:19.703197+00:00ls: cannot access '.trt’: No such file or directory
2025-02-24 07:39:19.703576+00:00s6-rc: warning: unable to start service trt-model-prepare: command exited 2
Do you know a way to get rid of actual CUDA toolkit so that I can install the proper one back?
Already tried removing NVIDIA drivers from apps/Configuration/settings/
Install NVIDIA Drivers and restart the system, but no luck
thx in advance for your preciuos help
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.127.05 Driver Version: 550.127.05 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce GTX 1060 6GB Off | 00000000:08:00.0 Off | N/A |
| 0% 44C P0 22W / 120W | 0MiB / 6144MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+