Hello folks
as said one of my 16*10TB disks died and another was on the verge of doing so too, I had to put the system off for a while.
so as soon as could I added two new disks to the system, and rebooted, butâŚ
now the machine says that all disks are offline, with the zpool âexportedâ and the new disk as not availlable (normal as it are not formated yet).
whatâs bug me is that if I want to import the zpool as it was (named myzpool) I have to choose from ânew poolâ on âexisting poolâ⌠it sees that it belong to âmyzpoolâ but when I need to choose that existing zpool there is nothing in the chooser !
How can I re-import my pool and repair it with the new disks without losing all my files ?
can I really import the disks to anew pool without losing anything ?
and, why canât I just re-import it as it knows to wich zpool it belongs ?
Thanks for a speedy answer please as itâs really urgent.
One of the problems with ZFS is that it does not like to import âdamagedâ pools. Now their could be a completely reasonable reason for that, like a bad disk that has not yet been replaced. It would would be nice if the GUI had that ability. But, the intent is to keep the NAS running⌠which is different for SOHO users than Enterprise users.
As with most UIs, the TrueNAS one only deals with the most common use cases, and there are just so many ways that pools can go wrong that the UI cannot hope to deal with all of them. And then you have to rely on the command line, and it is then easy to try the wrong thing and lose your pool completely.
I appreciate that you think doing something to get it back online is urgent, but please trust me when I say that talking it slower and getting your data back online again will be more important to you than rushing in and making a mistake and losing your data completely.
So we need some hard facts before we can help you and as @Arwen says, the output of sudo zpool import is the most important.
But also please run the following additional commands which will also help us:
Hmm ⌠OP asks for âa speedy answer please as itâs really urgentâ and he gets 3 speedy answers within a couple of hours, however it doesnât really appear to be âreally urgentâ because here we are another 11 hours later and we still havenât had the information requested 12 hours ago.
hello there
sorry for late reply, Iâm somewhat invalid and one of my problem is I fall asleep at any hour of the day
Btw, looking at the output of your askings I found that one of the degraded disk was not seen by truenas scale, hence why it was not loading the zpool.
Iâve changed the cable for a new one and now it sees it, (damân 3.3v stuff on new disksâŚ) so it is resilvering and replacing the disks now. loooong wait
I really thanks you all for replying so fast, I know that I can count of you when some problem occurs with truenas
Btw II :
My config is as is
Nas[~]$ sudo lspci
00:00.0 Host bridge: Intel Corporation 2nd Generation Core Processor Family DRAM Controller (rev 09)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200/2nd Generation Core Processor Family PCI Express Root Port (rev 09)
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)
00:16.0 Communication controller: Intel Corporation 6 Series/C200 Series Chipset Family MEI Controller #1 (rev 04)
00:19.0 Ethernet controller: Intel Corporation 82579V Gigabit Network Connection (rev 05)
00:1a.0 USB controller: Intel Corporation 6 Series/C200 Series Chipset Family USB Enhanced Host Controller #2 (rev 05)
00:1b.0 Audio device: Intel Corporation 6 Series/C200 Series Chipset Family High Definition Audio Controller (rev 05)
00:1c.0 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 1 (rev b5)
00:1c.4 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 5 (rev b5)
00:1c.5 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 6 (rev b5)
00:1c.6 PCI bridge: Intel Corporation 6 Series/C200 Series Chipset Family PCI Express Root Port 7 (rev b5)
00:1d.0 USB controller: Intel Corporation 6 Series/C200 Series Chipset Family USB Enhanced Host Controller #1 (rev 05)
00:1f.0 ISA bridge: Intel Corporation Z68 Express Chipset LPC Controller (rev 05)
00:1f.2 SATA controller: Intel Corporation 6 Series/C200 Series Chipset Family 6 port Desktop SATA AHCI Controller (rev 05)
00:1f.3 SMBus: Intel Corporation 6 Series/C200 Series Chipset Family SMBus Controller (rev 05)
01:00.0 Serial Attached SCSI controller: Broadcom / LSI SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] (rev 03)
02:00.0 Ethernet controller: Mellanox Technologies MT27500 Family [ConnectX-3]
03:00.0 USB controller: ASMedia Technology Inc. ASM1042 SuperSpeed USB Host Controller
04:00.0 USB controller: ASMedia Technology Inc. ASM1042 SuperSpeed USB Host Controller
05:00.0 SATA controller: JMicron Technology Corp. JMB362 SATA Controller (rev 10)
with 17 (soon 18) 10 TB sata disks, 1 3 TB disk for system pool and 1 500 GB SSD WD Red for cache, on SAS2008 card with 24x ports extender card.
running on a 10 GB FTTH line.
my pool is RaidZ2 with multiple datasets and smb/nfs shares.
all in a neat Fractal Design Define XL R2 box (heavy! but very surdy)
Iâll replace this one by a Define XL R7 soon (more room and better air flow.
Nas[~]$ sudo sas2flash -list
LSI Corporation SAS2 Flash Utility
Version 20.00.00.00 (2014.09.18)
Copyright (c) 2008-2014 LSI Corporation. All rights reserved
Adapter Selected is a LSI SAS: SAS2008(B2)
Controller Number : 0
Controller : SAS2008(B2)
PCI Address : 00:01:00:00
SAS Address : 500605b-0-05f6-f320
NVDATA Version (Default) : 14.01.00.08
NVDATA Version (Persistent) : 14.01.00.08
Firmware Product ID : 0x2213 (IT)
Firmware Version : 20.00.07.00
NVDATA Vendor : LSI
NVDATA Product ID : SAS9211-8i
BIOS Version : 07.39.02.00
UEFI BSD Version : N/A
FCODE Version : N/A
Board Name : SAS9211-8i
Board Assembly : N/A
Board Tracer Number : N/A
Finished Processing Commands Successfully.
Exiting SAS2Flash.
Nas[~]$ sudo sas3flash -list
Avago Technologies SAS3 Flash Utility
Version 16.00.00.00 (2017.05.02)
Copyright 2008-2017 Avago Technologies. All rights reserved.
No Avago SAS adapters found! Limited Command Set Available!
ERROR: Command Not allowed without an adapter!
ERROR: Couldn't Create Command -list
Exiting Program.
As soon as the resilvering has completed you need to run a scrub to try to clean up the 1636 data (checksum) errors - and once that has run you should do a sudo zpool status -v elZ2 to see what errors remain.
The sas2flash shows you are running IT firmware which is correct - I havenât checked whether this is the latest firmware for this device.
As an aside I note that this is a 16-wide RAIDZ2 vDev and the recommended maximum width for new vDevs is 12. The recommendation is only based on resilvering times, and I note that you are looking at c. 1 day for your existing resilvers (which is not too bad) so I am not suggesting that you need to do anything about this existing vDev, just bear it in mind for the future.
good to know
in that case, what will be the accurate number of dedicated metadata disks for such a configuration you would suggest ? would add this help ?
same question for cache disk does my 500 gb ssd suffice ?
You do probably not need any of the special types of vDev (SLOG, L2ARC, Dedup, Special Allocation (Metadata)).
SLOG is for synchronous writes only - and you should only do synchronous writes for specific types of data that needs it. But of those writes are to HDD, then you definitely will need an SLOG.
Dedup is renowned for having a very bad impact on performance and needing a LOT of memory. Donât do it, just donât. If you need to dedup find or write a script which does dedup using block cloning.
L2ARC can help, but apparently only if you have >= 64GB of memory. You might be better off simply adding memory and using a script to scan the directories and read the metadata and cache it in memory.
A special allocation vDev (for metadata) can definitely help if you have a specific response time problem because your read activity is so random that the metadata isnât in the cache. You need to try this, but bear in mind that for RAIDZ you cannot later remove this. If you are going to do this, bear in mind that the Metadata vDev is critical to the pool and if you lose it then you lose your pool - so it needs to be redundant and ideally at least as redundant as the data vDevs.
ok so as I have 32 gb ram (mainboard maxed here) the 500 GB l2arc ssd is good enough
BTW, it surprise me that altough it seems it has finished replacing/resilvering one of the two faulty drives, what I see is that the new replacement drive is âofflineâ while the faulty is still âonlineâ ??? shouldânt it be replaced by the new one and be âofflineâ instead ?