TrueNAS CORE crashes randomly when reading/writing trough SMB

Hello, I hope I’m doing this right since I have this problem in my business for about a month so I’ll try my best to give the necessary details:

As the title says, sometimes when an employee tries to copy a folder or some files to a share the server simply crashes. No reboot, no response, nothing. It doesn’t reboot nor does anything at all, we need to manually reset it so it goes back again.

Hardware details:
CPU: Intel Core i7-9700K
Memory: 2 x 16GB Kingston sticks at 2666MT/s (no XMP enabled in BIOS)
Motherboard: Gigabyte Z390 UD
Storage: 2x HDD IronWolf ST12000VN0008-2YS101
1x Kingston NVMe SNV2S2000G

In the Pool we have 1 data vdev with 2 disks in mirror mode and 1 cache vdev with a NVME

I have followed some tips on this very forum but to no avail, the memory seems fine, the cpu does not overheat (it stays at a comfortable 40C) the disks appear to function properly (I do scrub and S.M.A.R.T tasks weekly) What else can be done?

I have attached a photo of the Memory section in the “Reporting” area, in a reply right after one of our “freezing incidents” as I thought it was strange to be using Swap right before the crash. Any ideas?

My employees use Windows 11 clients if that matters at all, I really have no idea what to do now.

Thank you for this amazing community and software, please correct me if I missed anything.

It seems I cant attach media items in this post, but Ill do my best to translate the picture:

My Ram seems fine but when the crash happens it goes from 1-5% Free to 100% free, and at the same time the Swap appears to have some utilization, even though I have 32 gigs of ram.

As mentioned before, I did test it with memtest86+ before and the ram passed with flying colors.

Hi and welcome to the forums.

Sorry to hear about your issue. I’ve increased your user level so you should be able to upload screenshots now as that should be helpful.

Does the issue only happen when writing data? Do you have compression enabled and if so which? Have you identified a particular action that breaks it ie small file write ok large file write crash?

zfs deduplication on? - just a guess

1 Like

Yep good shout worth checking.

Hello and thank you for the quick replies.

Attaching the picture mentioned and a bonus one!

Since it just happened AGAIN with my employees. The more data the better, right?

About ZFS deduplication, I have no clue what that is and don’t think I have enabled it (unless it comes ON by default) where can I take a look?

If you could share a screenshot of your dataset properties that should include most of what we need at first.

What version of Core?


Here are proprieties of the dataset

The version of core is:
TrueNAS-13.0-U6.3

Realtek GbE with CORE (but that might be me!) again is another guess.

2 Likes

So, changing to SCALE should fix the issue? I cant change the NIC for the moment.

You might want to get another second/third opinion - again a guess - if you have an intel NIC card you can add to the NAS - try it out. (switch out to SCALE - don’t know if that will work)

Usually reboot indicates a kernel panic. This might be related to buggy driver or it might be related to pool corruption / issue. If you look through logs you can usually figure out what went wrong.

What logs should I focus on?

I’m a beginner on this kind of issue. Have been using TrueNAS for a year now and never had something like this happen.

You can download a debug to review on your own System->Advanced->Save Debug. It will gather your log files and basic information about your system. Many times the crash directory will have files generated by the kernel about why the reboot occurred. If that’s empty you can start with log/messages and other log files.

1 Like

Well I would suggest you to bet on some kind of lottery my friend because I did exactly that after our talk and 15 days in, just works!

I formatted the disk, installed TrueNAS Scale and imported my config and it just works!

If anyone want’s the solution: Use Scale :+1:

It’s not really best solution - why - its still realtek NIC (scale has slight better support than core in realtek but) - if your going to do bigger transfer over the network - the best bet is to switch over to an INTEL NIC card.