Need assistance to replace root disk on Netra X4200


 
Thread Tools Search this Thread
Operating Systems Solaris Need assistance to replace root disk on Netra X4200
# 1  
Old 01-29-2014
Need assistance to replace root disk on Netra X4200

Good day.

I have a SUN Netra X4200 running on Solaris 10, with two disks in raid 1 configuration. HDD0 (c2t0d0) seems to have a problem but HDD1 (c2t1d0) is ok:
Code:
# for a in c2t0d0 c2t1d0 ; do raidctl -l $a; done
Volume                  Size    Stripe  Status   Cache  RAID
        Sub                     Size                    Level
                Disk                                    
----------------------------------------------------------------
c2t0d0                  136.5G  N/A     DEGRADED OFF    RAID1
                0.2.0   136.5G          GOOD    
                0.5.0   136.5G          FAILED                   <<< where did 0.5.0 come from and what happened to 0.0.0?


Volume                  Size    Stripe  Status   Cache  RAID
        Sub                     Size                    Level
                Disk                                    
----------------------------------------------------------------
c2t1d0                  136.5G  N/A     OPTIMAL  OFF    RAID1
                0.1.0   136.5G          GOOD    
                0.3.0   136.5G          GOOD

Is there a fix or do I have to replace the disk. I have some disks harvested from an old server, however, I need to be sure that if I replace the primary disk with an older disk, that the older disk does not screw up the mirror disk which is now the only valid one. If I have to replace the disk, does anyone have a procedure?

Regards,
Bjoern
# 2  
Old 01-29-2014
What does this show:
Code:
raidctl -S

# 3  
Old 01-29-2014
Thanks for your prompt reply, bartus11!

Code:
# raidctl -S
  2 "LSI_1064"
  c2t0d0 2 0.2.0 0.5.0 1 DEGRADED
  c2t1d0 2 0.1.0 0.3.0 1 OPTIMAL
  0.1.0 GOOD
  0.2.0 GOOD
  0.3.0 GOOD
  0.5.0 FAILED

# 4  
Old 01-29-2014
Firstly, the 0.5.0 will refer to the position in the SCSI chain; perhaps SCSI ID 5?

Don't worry about finding 0.0.0. It was how it was configured and which disks were selected by the sysadmin to go into the array.

So, your controller knows exactly what's going on; there are two RAID1 arrays and one of the disks has failed.

Do you know exactly which disk it is??? Pulling out the wrong disk will be fatal!!!!

Assuming you know EXACTLY which disk is in trouble, the first thing to try is to pull it out and simply push it back in. Then check the status again. If it says it's rebuilding then perhaps it was just a connection problem (poor contacts happen all the time). If it still says FAILED then replacement is needed.

Most disks show the number of LBA's on the label. LBA=logical blocks or sectors on the disk.

The replacement disk must be the same or greater number of LBA's for it to work. It stands to reason that you can't completely mirror a drive to one which is smaller. [Some disks which have exactly the same model number have different numbers of LBA's. Different versions of manufacture. So beware.]

If you do plug in a disk which is smaller then the RAID controller will refuse to do anything with it. It's not because the replacement is faulty.

If you plug in a disk that the controller is happy with then the status will go into REBUILD whilst the remirror is being done, followed by OPTIMAL when the remirror has finished.

These RAID controllers support hot-swap so no need to take the system down just make damned sure that you're pulling out the right disk.

Hope that helps.
# 5  
Old 01-29-2014
Thanks for your reply hicksd8.

All the nodes in my cluster have the same disk configuration. This is what it looked like about a week before the problem arose:
Code:
Volume                  Size    Stripe  Status   Cache  RAID
          Sub                     Size                    Level
                  Disk                                    
  ----------------------------------------------------------------
  c2t0d0                  136.5G  N/A     OPTIMAL  OFF    RAID1
                  0.0.0   136.5G          GOOD    
                  0.2.0   136.5G          GOOD    
   
   
  Volume                  Size    Stripe  Status   Cache  RAID
          Sub                     Size                    Level
                  Disk                                    
  ----------------------------------------------------------------
  c2t1d0                  136.5G  N/A     OPTIMAL  OFF    RAID1
                  0.1.0   136.5G          GOOD    
                  0.3.0   136.5G          GOOD

According to the information at hand, the faulty disk is HDD0 (c2t0d0). This is the one we need to replace. If reseating the drive does not work, and I replace it with another compatible drive, do I have to format or label the spare drive first?

Regards,
Bjoern

Last edited by DukeNuke2; 01-29-2014 at 05:10 PM..
# 6  
Old 01-29-2014
The RAID controller only looks at raw storage. It doesn't know anything about format, partition types, or filesystems. In rebuilding the mirror it will simply copy sector 0 to sector0, sector 1 to sector 1, thru' sector n to sector n. It doesn't give a stuff what's on the new drive is sees.

Historically, there used to be less able RAID controllers which looked for empty drives ie, expecting no format or partition table. For this reason, if I had to test a recycled drive prior to using it, I'll blow any partition table away before disconnecting from my test rig (to make it look empty) but this is really no longer necessary. Your RAID controller is a LSI and they're extremely good. It will just take care of everything.

Since your problem drive is marked "failed" by the controller, it won't even be trying to talk to it any more so it won't be flashing when the system is I/O'ing. That should tell you which drive to pull.

Don't be surprised if pulling out the drive and reinserting it starts the rebuild. After months/years of operation a poor connection can develop which is cured by reseating. If that doesn't work insert a replacement with the same or greater LBA's.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

SUn Netra X4200 M2 Drivers

Hi Guys, Could someone please share the dvd/cd that came with server Netra x4200 M2 since it is not more available in Oracle site? Or do you know where I can get that? Thanks! (4 Replies)
Discussion started by: CristianS
4 Replies

2. Hardware

Cable for SUN x4200 DC

Hello, I have a spare SUN server X4200 with a DC PSU (oracle part number 300-2186) The PSU is the one in the photo I was looking in the net about what cable i can order but i couldn't find anything that fit there :) Any idea? Thanks! (2 Replies)
Discussion started by: @dagio
2 Replies

3. Solaris

Lost Root Password on VXVM Encapsulated Root Disk

Hi All Hope it's okay to post on this sub-forum, couldn't find a better place I've got a 480R running solaris 8 with veritas volume manager managing all filesystems, including an encapsulated root disk (I believe the root disk is encapsulated as one of the root mirror disks has an entry under... (1 Reply)
Discussion started by: sunnyd76
1 Replies

4. Solaris

Mirror the root disk

Hi all I wish to mirror for the root disk, but last time i do, make the server cannot boot up. :p So this time, hope you guys can assist me on it. =) At the last code, is the step i wish to do. Please help to check and correct me if got any wrong. root@leo # format </dev/null Searching for... (17 Replies)
Discussion started by: SmartAntz
17 Replies

5. UNIX for Advanced & Expert Users

Assistance Needed With Find/Replace in Vi

Hello All I have always had a question about find and replace in Vi. As this uses Vi, sed, and RegEx I never knew how or where to post the question but I thought I would give it a shot here. Say I have a text file filled with the following: Sue, your IP address is 192.168.1.10 which is... (4 Replies)
Discussion started by: NoSalt
4 Replies

6. Solaris

x4200 solaris doesn't see new disk

I got two 2.5" disk for a x4200 and put it in. Just same these disk were, I remember doing this in the past, working.. Now when I put in the disk and run 'devfsadm' and also check with 'cfgadm -a', that disk doesn't seem to show up. When I pull out the disk I see this in the logs: Mar... (15 Replies)
Discussion started by: madvikas
15 Replies

7. Solaris

Netra V440 & Netra 1290 keyswitch

Hello, I wrote a script which monitor the keyswitch state repeatedly each 10 minutes. I'm extracting the keyswitch status by using prtdiag. The script works fine for Netra v440 , but I found that prtdiag under Netra 1290 don't give keyswitch status. unlike Netra 440 server , I found that... (2 Replies)
Discussion started by: Alalush
2 Replies

8. Shell Programming and Scripting

sed search and replace word assistance...

Hi, I am trying to write a shell script designed to take input line by line by line from a file with a word on each line for editing with sed. Example file: 1.ejverything 2.bllown 3.maikling 4.manegement 5.existjing 6.systems My design currently takes input from the user, and... (2 Replies)
Discussion started by: mkfitzwilliams
2 Replies

9. Solaris

Netra 240 Hard disk are not detected

Hi All, I have new Netra 240 box which is not detecting its harddisk equiped with itse same har disk are detected when I swap with another machine. Same time If I swap hardisk of other machine to this Netra 240 box, also do not work. I have checked chassis, cabling etc and all looks fine. ... (1 Reply)
Discussion started by: xpwistler
1 Replies

10. AIX

Replace a Failed Root Disk in AIX

My root disk is failed and how to replace the root disk in AIX. Can u give a detailed explanation in step wise. Pls give the answer taking different scenarios. Regards Praveen (1 Reply)
Discussion started by: chinnu_mulakala
1 Replies
Login or Register to Ask a Question