Sponsored Content
Full Discussion: Quorum and hdisk issue
Operating Systems AIX Quorum and hdisk issue Post 302827775 by bakunin on Sunday 30th of June 2013 04:39:07 PM
Old 06-30-2013
OK.

The output confirms that hdisk1 is the cause of your problem. Your volume group is definitely offline and gone with it are the filesystems it may have (once) contained. If they appear to be still mounted: don't believe it, they are gone.

What you see here is a description of the disk (hdisk1) in increasing detail:

Quote:
Originally Posted by newtoaixos
Code:
pmut3# lsdev -Cc disk
hdisk0 Available 06-08-01-3,0 16 Bit LVD SCSI Disk Drive
hdisk1 Available 06-08-01-4,0 16 Bit LVD SCSI Disk Drive
hdisk2 Available 06-08-01-5,0 16 Bit LVD SCSI Disk Drive
hdisk3 Available 06-08-01-8,0 16 Bit LVD SCSI Disk Drive

pmut3# lsattr -El hdisk1
PCM             PCM/friend/scsiscsd                    Path Control Module           False
algorithm       fail_over                              Algorithm                     True
dist_err_pcnt   0                                      Distributed Error Percentage  True
dist_tw_width   50                                     Distributed Error Sample Time True
hcheck_interval 0                                      Health Check Interval         True
hcheck_mode     nonactive                              Health Check Mode             True
max_transfer    0x40000                                Maximum TRANSFER Size         True
pvid            00c5c9cfcf30eee90000000000000000       Physical volume identifier    False
queue_depth     3                                      Queue DEPTH                   False
reserve_policy  single_path                            Reserve Policy                True
size_in_mb      73400                                  Size in Megabytes             False
unique_id       260800023B980AST373455LC08IBM   H0scsi Unique device identifier      False

And this is the probable cause for hdisk1 failing. I SNIPped to the interesting part:

Quote:
Originally Posted by newtoaixos
Code:
pmut3# errpt -aj 8647C4E2 | pg
<...SNIP....>
Resource tested:        hdisk1
Resource Description:   16 Bit LVD SCSI Disk Drive
Location:               U788C.001.AAB1650-P1-T11-L4-L0
SRN:                    000-129
Description:            Error log analysis indicates a SCSI bus problem.

Looks like your SCSI disk was failing somehow - this could be everything from a broken cable, a terminator gone bad to the disk itself become broken. First, make shure that the SCSI link is up again. Delete the hdisk1 devices and run "cfgmgr" to rediscover it. If it won't come back the disk is not connected (or broken), if it is in status "available" the disconnection is gone. You still should investigate, because a symptom gone is not a problem solved. Find the reason for the disconnection, only this will solve your problem.

Still, don't be shy to start repair action - this server will do nothing without the data necessary for carrying out its function anyway. If business complains: see above. If they are too greedy to pay for mirrored disks they will have to live with failing ones and the time necessary for repair. If the disks are indeed mirrored whoever forgot to (un)set the quorum is to blame and business will have every right to be angry. This is administration basics and should not happen at all.

I hope this helps.

bakunin
 

10 More Discussions You Might Find Interesting

1. AIX

vpath to an hdisk

Is there a simply way for me to map a vpath to an hdisk on AIX 5.2? (5 Replies)
Discussion started by: 2dumb
5 Replies

2. AIX

Check quorum for volume group

Hi all, I would like to ensure that a volume group has an effective quorum setting of 1 (or off). I know you can change the quorum setting using the chvg -Q command but want to know if the setting has been changed before the vg was varied on or a reboot. In other words how can I ensure that... (3 Replies)
Discussion started by: backslash
3 Replies

3. Emergency UNIX and Linux Support

AIX APPVG - QUORUM LOST, VOLUME GROUP CLOSING

Hi, I am running AIX 5.3 TL8. After a disk failure, one of my mirrored application volumegroups went down. Unfortunately we have quorum switched on on this VG and the defective disk holds the majority. I have set MISSINGPV_VARYON to TRUE and tried a forced varyon but it's still failing. I... (3 Replies)
Discussion started by: zxmaus
3 Replies

4. AIX

Dummy hdisk in AIX 6.1

How do you create a dummy hdisk with AIX 6.1? In previous versions, I've used this and works, but now I get this error. hostname:/:# mkdev -l hdisk57 -c disk -t osdisk -s scsi -p fscsi0 -w 0,10 -d Method error (/etc/methods/define): 0514-022 The specified connection is not valid. Any... (2 Replies)
Discussion started by: kah00na
2 Replies

5. AIX

LVM - Quorum

Hi all Just a question about quorum. I am running AIX 5.3 Rootvg has 2 PV - not mirrored. quorum is switched on. What happens when one disk fails?, can i replace the disk and bring the entire VG back up. with all the data intact. knowing that the VG will be unavailable until i replace the... (3 Replies)
Discussion started by: Andre54
3 Replies

6. AIX

Quorum in lsvg output

Hi there, I have three servers and I'm puzzled by the oputput I get from lsvg rootvg. Server 1 : QUORUM: 2 (Enabled) Server 2 : QUORUM: 1 (Disabled) Server 3 : QUORUM: 1 All VG are build on 2 PV and are mirroring. What could cause the number to be different?... (2 Replies)
Discussion started by: petervg
2 Replies

7. AIX

Flashcopy, ghost hdisk ??

Hi all, I'm getting some errors on AIX regarding Flashcopy and volume group hard disks. The script that activates flashcopy showed this errors: Recreating Flashcopy for lun01_A1 Performing syntax check... Syntax check complete. Executing script... Script execution complete. SMcli... (1 Reply)
Discussion started by: enux
1 Replies

8. Red Hat

Centos/rhel 5 cluster 3 nodes with out Quorum

Hi all, i have 3 nodes cluster (Centos 5 cluster suit) with out quorum disk, node vote = 1, the value of a quorum = 2, when 2 nodes going offline, cluster services are destoys. How i can save the cluster and all services(move all services to one alive node) with out quorum disk when other... (3 Replies)
Discussion started by: Flomaster
3 Replies

9. AIX

How can I map hdisk# to rhdisk#?

Some storage/disks have been added to an existing AIX 6.1 server. The admin sent me the list of hdisk#'s for the new disks, but I need the corresponding rhdisk# for the same hdisk. (I know from past experience that the rhdisk that maps to an hdisk is not always the same number. For instance,... (5 Replies)
Discussion started by: sbrower
5 Replies

10. Solaris

Precaution during Quorum Server Reboot

Hi I need to know what are the precaution we should take during quorum server reboot as this quorum server is providing quorum devices to five different solaris two node clusters. Also let me know do I have to follow below procedure as well before and after reboot of quorum server Do I... (3 Replies)
Discussion started by: sb200
3 Replies
All times are GMT -4. The time now is 04:58 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy