Data lost? or issue with drive?

hello,

i booted my truenas …during the boot and inside the dmesg i can see the following

da0: quirks=0x140<RETRY_BUSY,STRICT_UNMAP>
ses0: pass1,ada0 in 'Slot 00', SATA Slot: scbus33 target 0
ses0: pass2,ada1 in 'Slot 01', SATA Slot: scbus34 target 0
ses0: pass3,ada2 in 'Slot 05', SATA Slot: scbus38 target 0
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
ahcich35: Timeout on slot 9 port 0
ahcich35: is 00000000 cs 00000200 ss 00000000 rs 00000200 tfd d0 serr 00000000 cmd 0004c917
(ada2:ahcich35:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 00 00 00 00 00 00
(ada2:ahcich35:0:0:0): CAM status: Command timeout
(ada2:ahcich35:0:0:0): Retrying command, 0 more tries remain
Root mount waiting for: ada
Root mount waiting for: ada
ahcich35: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada

ahcich35: Timeout on slot 10 port 0
ahcich35: is 00000000 cs 00000400 ss 00000000 rs 00000400 tfd 80 serr 00000000 cmd 0004ca17
(aprobe0:ahcich35:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00
(aprobe0:ahcich35:0:0:0): CAM status: Command timeout
(aprobe0:ahcich35:0:0:0): Retrying command, 0 more tries remain
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
ahcich35: AHCI reset: device not ready after 31000ms (tfd = 00000080)
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
Root mount waiting for: ada
intsmb0: <Intel PIIX4 SMBUS Interface> port 0x1040-0x104f at device 7.3 on pci0
intsmb0: intr SMI disabled revision 0
smbus0: <System Management Bus> on intsmb0

after some time it booted, and i unlocked the root zfs.

Is it possible to say what Ada ie specific drive is it? One of the 3 in raid1?

Is it possible that files on that zfs got corrupted?

I used Jail for a backup, the last one 6months ago, but now when i login into that jail i shows files from 2022.

Any idea what is that?

Thank you

Whats version of TrueNAS is this? Core?

Is the nas failing to boot entirely or does the Web UI come up? If the web UI comes up you can likely get the info you want from the Storage/Disks tab/button (depending on if you’re on Core or Scale). That’d be the lazy way I would do it.

@Fleshmauler thanks

its Core

It does boot after a while, web ui is up. I also did ssh to it as root to grab dmesg logs.

i am confused whats going on. and the most concern issue with ata during the boot.

i can confirm i found that proper jail - so it looks like no files corrupted inside zfs pool.

question would be the error in the dmesg?

Huh - I just realized that core has ada1/2/3/4/etc naming structure, so ‘ada’ itself gives no relevant info. To me those messages look like timeouts as the NAS is trying to contact drives on specific ports, though I have no clue what ‘slot 9 port 0’ and ‘slot 10 port 0’ correspond to, or if there are even disks connected to these slots.

yes i have ada0, ada1, ada2. so thats why i am confused why log shows only ada.

does it mean all drives might be problematic/have issue?

i cant run /execute smartctl long test on these?

So wait, all drives show up & everything is working fine? Is the issue that you see these messages during boot at all, or that boot is now suddenly slow & now suddenly you see those messages?

To me, and I could be wrong, the system is going per port to identify any drives & some ports aren’t responding until the command times out. If you have any ports without connected drives, this’d make sense to me.

Hopefully someone smarter than me can confirm my thoughts.

You’re always welcome to review smartctl results/tests, and zpool status/scrubs. Nothing here is jumping at me as any kind of corruption or data loss.

Yes, all drives and i unlocked encrypted pool.

During the boot there plenty of these messages, it took some time and afterwards system booted.

There are 3 sata unconnected ports. But it never does that before.

Zpool status i have provided says no errors.

Have to run smartctl long and Scrutity to evaluate results.

Any other ideas.

Appreciate again your support here.

Sounds like a failing boot device. What is your complete hardware?

First, save the configuration file.

Then hardware details, please, alongside a few outputs:

zpool status -v
camcontrol devlist
smartctl -x /dev/XXX

where XXX is the boot drive

Hello thanks,will do today. Why do u suspect booting drive?

Thx