Import zpool with missing slog device


 
Thread Tools Search this Thread
Operating Systems Solaris Import zpool with missing slog device
# 1  
Old 07-30-2010
Import zpool with missing slog device

Hello, I have a problem on my backup server. I lost my system hdd and my separate ZIL device while the system crashs and now I'm in trouble. The old system was running under the least version of osol/dev (snv_134) with zfs v22.
After the server crashs I was very optimistic of solving the problems the same day. It's a long time ago.
I was setting up a new systen (osol 2009.06 and updating to the least version of osol/dev - snv_134 - with deduplication) and then I tried to import my backup zpool, but it does not work.

Code:
# zpool import
  pool: tank1
    id: 5048704328421749681
 state: UNAVAIL
status: The pool was last accessed by another system.
action: The pool cannot be imported due to damaged devices or data.
   see: Sun Microsystems Message Lookup
config:

        tank1        UNAVAIL  missing device
          raidz2-0   ONLINE
            c7t5d0   ONLINE
            c7t0d0   ONLINE
            c7t6d0   ONLINE
            c7t3d0   ONLINE
            c7t1d0   ONLINE
            c7t4d0   ONLINE
            c7t2d0   ONLINE

# zpool import -f tank1
cannot import 'tank1': one or more devices is currently unavailable
        Destroy and re-create the pool from
        a backup source

Any other option (-f, -F, -X, -V, -D) and any combination of them doesn't helps too. An import with option -D only works, if the zpool.cache file is accessible.
I can not add / attach / detach / remove a vdev and the ZIL-device either, because the system tells me: there is no zpool 'tank1'.
In the last weeks I read lots of threads, guides to solve problems and best practice documentations of ZFS and so on, but I do not found a solution for my problem. I created a fake-zpool with separate ZIL-device to combine the new ZIL-file with my old zpool for importing them, but it doesn't work in course of the different GUID and checksum (the name I was modifiing by a binary editor).
The output of:
Code:
eee@opensolaris:~# zdb -e tank1

Configuration for import:
        vdev_children: 2
        version: 22
        pool_guid: 5048704328421749681
        name: 'tank1'
        state: 0
        hostid: 946038
        hostname: 'opensolaris'
        vdev_tree:
            type: 'root'
            id: 0
            guid: 5048704328421749681
            children[0]:
                type: 'raidz'
                id: 0
                guid: 16723866123388081610
                nparity: 2
                metaslab_array: 23
                metaslab_shift: 30
                ashift: 9
                asize: 7001340903424
                is_log: 0
                create_txg: 4
                children[0]:
                    type: 'disk'
                    id: 0
                    guid: 6858138566678362598
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@0,0:a'
                    whole_disk: 1
                    DTL: 4345
                    create_txg: 4
                    path: '/dev/dsk/c7t5d0s0'
                    devid: 'id1,sd@SATA_____SAMSUNG_HD103UJ_______S13PJ1BQ709050/a'
                children[1]:
                    type: 'disk'
                    id: 1
                    guid: 16136237447458434520
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@1,0:a'
                    whole_disk: 1
                    DTL: 4344
                    create_txg: 4
                    path: '/dev/dsk/c7t0d0s0'
                    devid: 'id1,sd@SATA_____SAMSUNG_HD103UJ_______S13PJDWQ317311/a'
                children[2]:
                    type: 'disk'
                    id: 2
                    guid: 10876853602231471126
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@2,0:a'
                    whole_disk: 1
                    DTL: 4343
                    create_txg: 4
                    path: '/dev/dsk/c7t6d0s0'
                    devid: 'id1,sd@SATA_____Hitachi_HDT72101______STF604MH14S56W/a'
                children[3]:
                    type: 'disk'
                    id: 3
                    guid: 2384677379114262201
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@3,0:a'
                    whole_disk: 1
                    DTL: 4342
                    create_txg: 4
                    path: '/dev/dsk/c7t3d0s0'
                    devid: 'id1,sd@SATA_____SAMSUNG_HD103UJ_______S13PJ1NQ811135/a'
                children[4]:
                    type: 'disk'
                    id: 4
                    guid: 15143849195434333247
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@4,0:a'
                    whole_disk: 1
                    DTL: 4341
                    create_txg: 4
                    path: '/dev/dsk/c7t1d0s0'
                    devid: 'id1,sd@SATA_____Hitachi_HDT72101______STF604MH16V73W/a'
                children[5]:
                    type: 'disk'
                    id: 5
                    guid: 11627603446133164653
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@5,0:a'
                    whole_disk: 1
                    DTL: 4340
                    create_txg: 4
                    path: '/dev/dsk/c7t4d0s0'
                    devid: 'id1,sd@SATA_____SAMSUNG_HD103UJ_______S13PJDWQ317308/a'
                children[6]:
                    type: 'disk'
                    id: 6
                    guid: 15036924286456611863
                    phys_path: '/pci@0,0/pci8086,244e@1e/pci11ab,11ab@9/disk@6,0:a'
                    whole_disk: 1
                    DTL: 4338
                    create_txg: 4
                    path: '/dev/dsk/c7t2d0s0'
                    devid: 'id1,sd@SATA_____Hitachi_HDS72101______JP2921HQ0KMEZA/a'
            children[1]:
                type: 'missing'
                id: 1
                guid: 0

The logfix tool doesn't works too, in course the GUID of the old slog device is missing.
Is there a solution to try any other OS (Linux with fuse or kernel module or BSD) which is not so pedantic to import and "repair" the zpool and then exporting it to the opensolaris Server or giving the zpool a fake slog or tell it there's no slog
part of the zpool. Maybe can I swich off the examination that all devices accessible during import only for one time.

I need some help, please!
Thanks for any replies.

Best regards
Ron

Last edited by pludi; 07-30-2010 at 01:48 PM.. Reason: code tags, please...
# 2  
Old 08-11-2010
Hello, I'm very frustrated now in course of solving my problem is far.Smilie
Is there anyone who knows if Sun/Oracle can help me and how much this support is?
Can I even get any support if I have an OpenSolaris system?
Thanks for any replies.

Ron
# 3  
Old 01-14-2011
I have the same problem. I accidentally overwrote the drive with ZIL when upgrading from OpenSolaris 2009.11, to Solaris 11 Express (151a). Now I can't import the pool, even with the -m option (which documentation explicitly says it to do just that).

Sorry I don't have an answer for you (yet), but your effort - and more specifically your post - has at least save ME from potentially weeks of hair-pulling similar to what you have done.

I love the IDEA of ZFS, but in truth I have seen a significant "negative" benefit over the years of using it. I.E., loss of data. Solaris is so damn fickle and exceptionally fragile (as an operating system...and as I'm learning, ZFS is too). I mean, come on! The whole idea of an separate ZIL was to improve performance...and hopefully not at the expense of reliability! From now on, I will be disabling the ZIL altogether. (And the moment btrfs is production-ready on Linux, will migrate everything to that in a heartbeat.)

Who would have thought that losing the ZIL would render all of your data useless!?? Talk about unnecessary additional points of failure. (Even if mirrored...still an additional point of failure.) This risk, IMO, vastly outweighs the risk of asynchronous IO, unless you are running something that explicitly (i.e. documented) requires synchronous IO such as apps/databases that life and/or lots of $ depends on.

Anyway. I'm going to try to import via Linuz/ZFS-FUSE. If that fails I'll have to try to reconstruct the data via sporadic offline backups. I'll keep ya posted.
Login or Register to Ask a Question

Previous Thread | Next Thread

8 More Discussions You Might Find Interesting

1. Solaris

How to clear a removed single-disk pool from being listed by zpool import?

On an OmniOS server, I removed a single-disk pool I was using for testing. Now, when I run zpool import it will show it as FAULTED, since that single disk not available anymore. # zpool import pool: fido id: 7452075738474086658 state: FAULTED status: The pool was last... (11 Replies)
Discussion started by: priyadarshan
11 Replies

2. HP-UX

Failed to open tape device /dev/rmt/0mn:Device busy (errno = 16)

Hi, Unable to make tape backup, please help. /opt/ignite/bin/make_tape_recovery -a /dev/rmt/?mn -I -v -m tar -x inc_entire=vg00 * Creating local directories for configuration files and archive. ======= 04/25/16 16:28:08 IST Started /opt/ignite/bin/make_tape_recovery. (Mon... (4 Replies)
Discussion started by: anuragr
4 Replies

3. Solaris

Zpool import/export error

A backup/clone script of ours was recently ran. It normally only clones the rpool and renames in rpoolA. Something must've changed as it found another one of our pools that it shouldn't have. It exported that pool unbeknownst to us. Later on when a coworker realized the other pool was missing he... (2 Replies)
Discussion started by: beantownmp
2 Replies

4. Solaris

Zpool device weirdness

Weird duplicate device in one of my nested raidz1s - device is shown as both online and unavailable, with a hot spare showing up as degraded as well. Any thoughts on how I should proceed? root@storage# zpool status tank pool: tank state: DEGRADED status: One or more devices are... (6 Replies)
Discussion started by: DoohanMcGirk
6 Replies

5. HP-UX

Cdrom device possibly missing?

Hello, I am following the HPUX 11.31 install/update guide and I am trying to install "Update-UX" from the installation media. I put the CD into the drive, and I am trying to mount the device. The instructions state:Find the DVD-ROM device file name: ioscan -C disk -f -n -k | more A typical... (5 Replies)
Discussion started by: bstring
5 Replies

6. Red Hat

Unable To Activate Ethernet Network Device in RHEL 5.5 - e100 device eth0 does not seem to be presen

Hi All, Could anyone please help to resolve the below problem. I installed RHEL5.5 in my desktop.But when i try to activate the ethernet connection then it gives me the error. I spent 2 days for the above and go through with several suggestion found by googling. But no luck. ... (0 Replies)
Discussion started by: Tanmoy
0 Replies

7. Solaris

zones: Use a zpool or mount as a raw device?

Hi, I need to create 4 zones. I have an array with a RAID5 volume. I have exported 4 LUNS from the array. Now my host see the 4 LUNS as raw devices. I am partitioning each LUN using format then mounting each slice on the global zone. Then I am using each slice as a file system for my non... (3 Replies)
Discussion started by: general_lee
3 Replies

8. Filesystems, Disks and Memory

hpux vg accessible but device files missing!

Hi all, I am looking at a MC cluster comprised of 2 nodes. There is a package named backup which does nothing more than mount a VG. At one point we noticed errors in syslog that look like this: cmclconfd: Could not access device file /dev/dsk/c20t0d0: No such file or directory There are... (1 Reply)
Discussion started by: mariusp
1 Replies
Login or Register to Ask a Question