Solaris: Fault Management Service toggles online, offline


 
Thread Tools Search this Thread
Top Forums UNIX for Advanced & Expert Users Solaris: Fault Management Service toggles online, offline
# 1  
Old 02-03-2011
Solaris: Fault Management Service toggles online, offline

I have two Solaris 10 T2000 systems.

Platform sun8 has newer firmware than sun7.
Code:
sun8/user$ prtdiag -v | grep OBP
OBP 4.30.4.b 2010/07/09 13:48

sun7/user$ prtdiag -v | grep OBP
OBP 4.30.4.a 2010/01/06 14:56

The platform (sun8) with the newer firmware (OBP 4.30.4.b) has a Fault Management service which toggles online/offline repetitively.
Code:
sun8/user$ svcs fmd
STATE          STIME    FMRI
online         14:40:52 svc:/system/fmd:default

sun8/user$ svcs fmd
STATE          STIME    FMRI
offline*       14:40:55 svc:/system/fmd:default

sun8/user$ svcs fmd
STATE          STIME    FMRI
online         14:41:01 svc:/system/fmd:default

sun8/user$ svcs fmd
STATE          STIME    FMRI
offline*       14:41:04 svc:/system/fmd:default

The services which "fmd" are dependent upon are online.
Code:
sun8/user$ svcs -d fmd 
STATE          STIME    FMRI
online         Feb_02   svc:/system/filesystem/minimal:default
online         Feb_02   svc:/system/sysevent:default
online         Feb_02   svc:/network/rpc/bind:default
online         Feb_02   svc:/system/dumpadm:default

The error log is not useful.
Code:
sun8/user$ svcs -xv fmd
svc:/system/fmd:default (Solaris Fault Manager)
 State: offline since Thu Feb 03 14:45:27 2011
Reason: Start method is running.
   See: http://sun.com/msg/SMF-8000-C4
   See: man -M /usr/share/man -s 1M fmd
   See: /var/svc/log/system-fmd:default.log
Impact: This service is not running.

sun8/user$ tail /var/svc/log/system-fmd:default.log
[ Feb  3 14:45:09 Executing start method ("/usr/lib/fm/fmd/fmd") ]
[ Feb  3 14:45:15 Method "start" exited with status 0 ]
[ Feb  3 14:45:18 Stopping because all processes in service exited. ]
[ Feb  3 14:45:18 Executing stop method (:kill) ]
[ Feb  3 14:45:18 Executing start method ("/usr/lib/fm/fmd/fmd") ]
[ Feb  3 14:45:24 Method "start" exited with status 0 ]
[ Feb  3 14:45:27 Stopping because all processes in service exited. ]
[ Feb  3 14:45:27 Executing stop method (:kill) ]
[ Feb  3 14:45:27 Executing start method ("/usr/lib/fm/fmd/fmd") ]
[ Feb  3 14:45:33 Method "start" exited with status 0 ]

Steps already taken:
  • upgraded firmware (the box had the same problem with the older firmware)
  • disabled and enabled the service
  • rebooted the box
Google has a lot of people reporting this problem, even the identical problem on OpenSolaris site which indicated it could not be replicated.

What should be looked at, next, to narrow the issue?
# 2  
Old 02-05-2011
What is the contents, if any, of
Code:
/etc/fm/fmd

# 3  
Old 02-18-2011
Data

The contents of "/etc/fm/fmd" - nothing, consistently, even though "fmd" is constantly restarting.

Code:
sun8/user$ svcs fmd
STATE          STIME    FMRI
online         19:53:51 svc:/system/fmd:default

sun8/user$ svcs fmd
STATE          STIME    FMRI
offline*       19:53:54 svc:/system/fmd:default

sun8/user$ ls -al /etc/fm/fmd
total 4
drwxr-xr-x   2 root     sys          512 Apr 13  2006 .
drwxr-xr-x   3 root     sys          512 Apr 13  2006 ..

Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. Red Hat

How do I know whether a service is online?

Hi! I'm a Solaris admin. I don't know how to check whether a service is online or not in Linux. example:i've started a service using #service named start #service http start How can i verify that these services are in running mode? (5 Replies)
Discussion started by: vamshigvk475
5 Replies

2. Solaris

Solaris (FMA) Fault Management Architecture

Does anyone have instructions how to install and configure this to generate SNMP alerts? Running Solaris 10 update 8. (6 Replies)
Discussion started by: soupbone38
6 Replies

3. UNIX for Dummies Questions & Answers

How to recover services from offline to online?

Dear all, How to recover the Dns service from offline to online? I used the command " svcadm enable dns/client" the command is executing but again its showing service is offline. pls clarify and give me suggesstion... Thanks in advance (1 Reply)
Discussion started by: masthan25
1 Replies

4. UNIX for Advanced & Expert Users

how to find any device online/offline

what is the command for checking which device active and which is not active (1 Reply)
Discussion started by: RahulJoshi
1 Replies

5. Shell Programming and Scripting

Server Staus offline to online -Shell script

Hi, We put cron entry :whenever server is offline(checks every 5 minutes) it sends status to mailer group. The number of messages (offline)were growing in our mailbox.How to avoid not to send offline messages after the first one through shell script. Thanks in advance. Chowdary (2 Replies)
Discussion started by: chowdary_m
2 Replies

6. Solaris

All dependicies of nfs service is online but nfs is offline

Hi all in my server all nfs dependices are online but nfs client is offline root@BIWAPP1 # svcs -a|grep nfs disabled Sep_05 svc:/network/nfs/server:default online Sep_05 svc:/network/nfs/rquota:default online Sep_05 svc:/network/nfs/mapid:default online ... (9 Replies)
Discussion started by: spandhan
9 Replies

7. Solaris

NFS service : offline after enabling it .

Hi all, i Followed these steps & still not able to get nfs service online ???:( bash-3.2# svcs -a | grep nfs disabled 17:48:02 svc:/network/nfs/cbd:default disabled 17:48:02 svc:/network/nfs/client:default disabled 20:56:06 svc:/network/nfs/status:default offline ... (4 Replies)
Discussion started by: saurabh84g
4 Replies

8. UNIX for Advanced & Expert Users

command for CPU online/offline status in bash shell

Hi , How do i check that the CPU is online/offline in a multi CPU machine in Linux ? i tired /proc/cpuinfo dmesg nothing gave me the currect CPU status. Pls help !! (5 Replies)
Discussion started by: sars
5 Replies

9. Solaris

Cluster group offline then online in 2 minutes

Hi , we are using Veritas cluster server on solaris 8. Suddenly one application group was offline and online after 2 minutes.. I have checked VCS log.. "VCS INFO V-16-1-50135 User reuters fired command: hares -offline middleman_proc server01 from 127.0.0.1" Does anyone faced this... (0 Replies)
Discussion started by: ddk2oo5
0 Replies
Login or Register to Ask a Question