Using an NVIDIA Tesla T4 with Truenas VMs on a Dell R730xd

coolnodje · January 30, 2026, 9:38am

After upgrading to 25.10 yesterday, I finally tried to add an NVIDIA Tesla T4 that I have had running on a spare server (R640) that is too expensive too keep up 24h/7.

I thought I’d add it to my main Truenas storage server and passthrough it to a VM and start playing with something available 24/7.

It went well at first, lspci displayed it, I could pick it up in the VM Edit dialog.
I had an issue adding it the first time, UI spinned for a very long time, then it wasn’t added in the end.
Did it again, and then start the VM errored. I realized I had 2 passthrough device added and thought it was the cause.

I couldn’t remove the extra passthrough device, UI errored with python trace, and then refreshing the VM page led to a red message and no VMs.

uhuh, I rebooted TN, hoping it would help. It rarely does, but it did this time.
Everythig looked good, after I removed an extra passthrough device, and decided to boot the VM.

Then bam, it hard-crashed the server, and after reboot only an unknow device was shown under TN VM GPU Edit dialog.

lspci didn’t show anything for the NVIDIA device anymore.

I cleanup the device from the VM and started it, it went ok.

Today incidentally I asked ChatGPT about power draw and how to save some watts with my main box, and as it proved resourceful in advice, I decided to give a go at tweaking the BIOS settings (thinking I needed to at least reboot anyway and maybe remove the NVIDIA GPU and put it again to solve the issue)
That’s when the BIOS started to freeze, oddly.
Shared that with ChatGPT and it actually told me this, which is what I want to share ultimately:

You didn’t hit a random PCIe incompatibility.
You hit a known failure mode of GPU passthrough on Dell 13G + NVIDIA T4.

Let’s unpack what happened and what it means.

Why it worked… until the VM started

What you observed

T4 enumerates (lspci sees it)

TrueNAS boots fine

Crash only when VM starts with GPU attached

This tells us:

Basic PCIe enumeration is fine
The failure happens when the GPU transitions into “active / bus-master / DMA” mode

That transition is exactly what happens when:

VFIO binds the device

VM powers on

GPU firmware is initialized

DMA engines start

PCIe AER is exercised

What actually triggered the crash

When the VM booted:

VFIO took ownership of the T4

GPU enabled:

Bus mastering

DMA

MSI/MSI-X interrupts

GPU tried to:

Access host memory

Negotiate PCIe power states

R730xd firmware detected an uncorrectable PCIe error

iDRAC escalated → NMI

BIOS halted system

This is exactly what your SEL shows:

“PCIe fatal error on component at slot 6”

Root causes (this combo is the problem)

T4 + Dell 13G + passthrough = unsupported

Dell 13G firmware has buggy PCIe AER handling

NVIDIA T4 expects:

Newer PCIe error handling

Better ACS isolation

When passthrough activates full DMA → fatal error

This is a known issue on:

R730 / R730xd

R630

Some R720

ACS / IOMMU limitations on 13G

PCIe slots often share IOMMU groups

GPU passthrough stresses ACS routing

Dell firmware reacts badly to GPU DMA remapping

This usually shows up only when VM starts.

Power & reset behavior of the T4

The T4:

Does not tolerate incomplete PCIe resets well

Expects modern hot-reset semantics

Dell 13G does not fully support that

Result:

First VM boot might work

Subsequent resets → fatal error

Why this escalates to BIOS corruption / NMI

After a PCIe fatal error, Dell does NOT gracefully recover.

Instead:

PCIe fabric enters error state

Firmware flags hardware exception

Subsequent BIOS access → hang

Watchdog → NMI halt

This is why things got worse after the crash.

Now I have to admit this is beyond my level of understanding, I come from software.
But this seems bad enough to me that I won’t try to insert the GPU in my box anymore.
Though I badly need a GPU and I don’t want to have to server running 24/7…

Could someone knowledgeable enough in this field validates that?

SmallBarky · January 30, 2026, 3:18pm

Did you try searching for the issue on the Dell website or support? I figure if it is a known issue and appears there, you would have your answer confirmed and know you need a different hardware solution. I don’t know if we would have many users with that combo of hardware.

coolnodje · January 30, 2026, 4:51pm

Well, yes, it says it’s not officially supported, which doesn’t mean it can’t work.

From https://www.dell.com/community/en/conversations/poweredge-hardware-general/can-i-install-gpu-tesla-t4-on-poweredge-r730/647fa157f4ccf8a8de68cda9

Tesla T4 is not a supported GPU configuration with the Dell PE R730. It’s not that it won’t work, but the said configuration is not tested and validated with the R730 server.

You can wait for a reply from the Dell community members who may have installed the Tesla T4 on R730.

coolnodje · February 14, 2026, 6:02pm

I’ve since this first attempt seen user reporting success with T4 GPU in Dell 13G servers.

I want to give it one more try, though having my main data server crash again isn’t really appealing.
But I have no other way to test it.

The funny thing is, ChatGPT is now feeding itself on this thread to report compatibity issue with NVIDIA T4!

Topic		Replies	Views
No Display output on passthough AMD GPU 25.04[SOLVED] Apps and Virtualization SCALE	5	1112	May 2, 2025
GPU VM passthrough Apps and Virtualization SCALE , VM	24	3000	April 16, 2025
Scale EE 24.10.0.2 - Tesla A2 driver installation failure TrueNAS General SCALE , Hardware , Apps	4	172	November 27, 2024
TrueNAS 26.0.0-BETA.1 NVIDIA 590.44.01 no longer supports Tesla P4 (10de:1bb3) TrueNAS General SCALE , Hardware , NVIDIA , GPU , TN26	24	320	June 18, 2026
PCI passthrough in VM no longer working in Dragonfish 24.04.0 Apps and Virtualization SCALE	11	1466	September 21, 2024

Using an NVIDIA Tesla T4 with Truenas VMs on a Dell R730xd

Why it worked… until the VM started

What you observed

What actually triggered the crash

Root causes (this combo is the problem)

T4 + Dell 13G + passthrough = unsupported

ACS / IOMMU limitations on 13G

Power & reset behavior of the T4

Why this escalates to BIOS corruption / NMI

Related topics