SVM metastat -- needs maintenance


 
Thread Tools Search this Thread
Operating Systems Solaris SVM metastat -- needs maintenance
# 8  
Old 01-26-2006
Sorry for being sparse on the details.

This is a 280R. The drives are internal.

The metadb state databases are on slice 7 of the mirrored disks. Here is the output:

Quote:
# metadb
flags first blk block count
a m p luo 16 8192 /dev/dsk/c1t0d0s7
a p luo 8208 8192 /dev/dsk/c1t0d0s7
a p luo 16 8192 /dev/dsk/c1t1d0s7
a p luo 8208 8192 /dev/dsk/c1t1d0s7
As for the failing partitions, the only indication of failures is the metastat output, the application and OS are running fine. Metastat reports problems with / (d10) and /smarts1 (d50). The application is EMC SMARTS.

I just noticed this issue because I wanted to implement SVM monitoring and happened to do a metastat. We have rebooted this machine a couple of times in the last few months, most recently about 2 weeks ago.

syslogd is running:
Quote:
root 250 1 0 Jan 12 ? 23:32 /usr/sbin/syslogd
And here is the contents of syslog.conf. Let me know if I should provide anything else.

Quote:
cat /etc/syslog.conf
#ident "@(#)syslog.conf 1.5 98/12/14 SMI" /* SunOS 5.0 */
#
# Copyright (c) 1991-1998 by Sun Microsystems, Inc.
# All rights reserved.
#
# syslog configuration file.
#
# This file is processed by m4 so be careful to quote (`') names
# that match m4 reserved words. Also, within ifdef's, arguments
# containing commas must be quoted.
#
#*.err;kern.notice;auth.notice /dev/sysmsg
#*.err;kern.debug;daemon.notice;mail.none /var/adm/messages

#*.alert;kern.err;daemon.err operator
#*.alert root

#*.emerg *

# if a non-loghost machine chooses to have authentication messages
# sent to the loghost machine, un-comment out the following line:
#auth.notice ifdef(`LOGHOST', /var/log/authlog, @loghost)

mail.debug /var/log/mail
#*.emerg;*.alert;*.crit;*.err;*.warn;*.info /var/log/syslog
*.emerg;*.alert;*.crit /var/log/syslog

#
# non-loghost machines will use the following lines to cause "user"
# log messages to be logged locally.
#
#ifdef(`LOGHOST', ,
#user.err /dev/sysmsg
#user.err /var/adm/messages
#user.alert `root, operator'
#user.emerg *
#)
# 9  
Old 01-26-2006
First post: "There are no errors in /var/adm/messages."

Last post: syslog.conf - the only lines not commented are

mail.debug /var/log/mail
and
*.emerg;*.alert;*.crit /var/log/syslog

If the devices are giving warnings, those may be lost - suggest you add/change

*.emerg;*.alert;*.crit /var/log/syslog
to
*.emerg;*.alert;*.crit;*.err;*.warn;*.info /var/log/syslog

and send a hup siginal to syslogd so it will re-read the config file - then check your /var/log/syslog file for possible errors. That may give you a better read on your issue. I don't see that /var/adm/messages would have had anything it in from syslogd.
# 10  
Old 01-26-2006
Also suggest you get the output of the following and save it

metastat -p
cat md.cf
cat md.tab

The last two files should be in /etc/lvm/
# 11  
Old 01-27-2006
Quote:
Originally Posted by RTM
First post: "There are no errors in /var/adm/messages."

Last post: syslog.conf - the only lines not commented are...

Yep, doesnt make much sense at all. At first, I looked in /var/adm/messages, and thats why I posted that path in my original post, but then later today saw that the application admin changed the log file to /var/log/syslog. Anyhow, I'll try your suggestions tomorrow and update here. Thanks for your help.
# 12  
Old 01-30-2006
I edited the /etc/syslog.conf file and sent a HUP signal to the syslogd process. I am now getting all messages going to /var/log/syslog. However, I'm still not getting any output related to the volumes that needs maintenance.
# 13  
Old 01-30-2006
It's hard to know what to suggest because I don't understand how the box arrived in the current state. That syslog.conf thing scares me. I guess that I would first verify that I have good backups. Then I would look at the two disks with prtvtoc to ensure that they are partitioned identicly. Then I would look at the special files for the disks to make sure that no one replaced them text files or something. If the disks are partitioned correctly, no hardware error are known, and the special files really point to the device, then it has to be ok to attempt a resync. Or least, I think so. So I would cross my fingers and try:
metareplace -e d50 c1t0d0s5

No moneyback guarantees. Objects in mirror may be closer than they seem. Packed by weight, not by volume. Your results may vary. etc...
# 14  
Old 01-30-2006
"saw that the application admin changed the log file to /var/log/syslog"

That's the scary part.

Suggest a call to SUN is in order for your issue - I've never seen such a problem and can not find anything on sunsolve showing this type of issue.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Metastat shows state needs maintenance

Hi, We have a Solaris 10 machine with update 11 and was configured with IBM storage. It was assigned 2 LUNs (each 70GB) which were striped to make it 140GB. we have taken full backup of entire machine and Our storage team replaced IBM storage with Nimble storage (they did storage-level... (6 Replies)
Discussion started by: prvnrk
6 Replies

2. Solaris

metastat |grep Needs

Dear, Required an script such that : If metastat |grep Needs , results in some output then this command to be executed for the same : opcmsg object=metastat a=OS msg_grp=OpC severity=critical msg_text="Need maintenance for the system $line" With regards, Mjoshi (3 Replies)
Discussion started by: mjoshi87
3 Replies

3. Shell Programming and Scripting

Grep contents from metastat command

Hi, after issuing metastat command I am getting output as follows Actually these soft partitions are more than 100. I want output as Device Name & Size. for eg d4004 2.0 GB (4 Replies)
Discussion started by: tuxian
4 Replies

4. Solaris

restore metastat configuration after solaris upgrade

Greetings How can i restore metastat db after a solaris upgrade (9 to 10)? will it work if i save and restore /etc/vfstab and /etc/lvm/md.cf file? root@netad# metastat d35: Mirror Submirror 0: d38 State: Okay Submirror 1: d39 State: Okay Pass:... (2 Replies)
Discussion started by: kashif_islam
2 Replies

5. Solaris

SVM Solaris 8 Problem. Metastat output looping

Hi friends, I'm newbie to SVM. Just wanna try installed it on one of our server (to do mirroring for disk0 and disk1) but i think im lost until now. :( the steps i've taken is as below:- 1.prtvtoc /dev/rdsk/c1t0d0s2 | fmthard -s - /dev/rdsk/c1t1d0s2 2.metadb -a -c 3 -f c1t0d0s7... (3 Replies)
Discussion started by: kronenose
3 Replies

6. Solaris

Metastat shows "maintenance" and "last-erred"

Hi All, Sorry to post a problem for my first post but I'm in a bit of a pickle at the minute! I have an Ultra45 connected to a Storedge 3100 series, 2 internal, 2 external disks with a db application running on the external disks. Now everything is working fine and we've had no downtime or... (4 Replies)
Discussion started by: TheSteed
4 Replies

7. Solaris

Softpartition State: Errored in Command MetaStat

Hi people, I have on problem when execute the command METASTAT... d60: Soft Partition Device: d10 State: Errored Size: 12582912 blocks (6.0 GB) Someone help me? Thank you very much (4 Replies)
Discussion started by: denisgomes
4 Replies

8. Filesystems, Disks and Memory

What should I do with this metastat result? metareplace everything?

I have to rescue the volumes in a soloris 7 machine. The metastat returns the following result: d6 is dying with iostat -nE returns Harderror 3, Transports error 9 Can some offer me some help? Thank you very much. (6 Replies)
Discussion started by: nickychung
6 Replies

9. Solaris

SVM - metastat - Last Erred

My company is running a solaris 2.7 machine. The machine is getting slow recently. I have no expert in solaris. Please help. I checked the log in /var/adm/message: I also checked with the command iostat -nE. It returns: Metastat returns the followings: What should i do now?... (10 Replies)
Discussion started by: nickychung
10 Replies

10. Solaris

Help on metastat

hi all, can someone pls pass on your suggestion? Firs thing I am testing a script which checks for the pattern 'Needs Maintenance' from metastat output and prints some messages in the screen. So i need to simulate an error in mirrored disk for metastat to give this message 'Needs Maintenance'.... (3 Replies)
Discussion started by: srirammad007
3 Replies
Login or Register to Ask a Question