Solaris 10 10/08 fmd core dump


 
Thread Tools Search this Thread
Operating Systems Solaris Solaris 10 10/08 fmd core dump
# 1  
Old 11-20-2008
Solaris 10 10/08 fmd core dump

There's something strange with fmd in update 6. On four identical machines (Netra T2000) all installed thru jumpstart (same configuration, same packages, same patches, same firmware) fmd randomly refuses to start after reboot.

Code:
$ cat /var/svc/log/system-fmd:default.log
...
[ Nov 20 09:47:03 Executing start method ("/usr/lib/fm/fmd/fmd") ]
[ Nov 20 09:47:16 Method "start" exited with status 0 ]
[ Nov 20 09:47:21 Stopping because process dumped core. ]
[ Nov 20 09:47:21 Executing stop method (:kill) ]

$ svcs -xv
svc:/system/fmd:default (Solaris Fault Manager)
 State: offline since Thu Nov 20 10:58:38 2008
Reason: Start method is running.
   See: http://sun.com/msg/SMF-8000-C4
   See: man -M /usr/share/man -s 1M fmd
   See: /var/svc/log/system-fmd:default.log
Impact: This service is not running.

And it loops ...

Running with debugging on I get:

Code:
$ sudo  /usr/lib/fm/fmd/fmd -o fg=true -o debug=all -o client.debug=true
fmd DEBUG: ereport.io.pci.sserr dispatched to 1 queues
fmd DEBUG: fabric-xlate: Bit 0x4000 First Err 0x4000
fmd DEBUG: fabric-xlate: Sending Target Ereport: type 0x4 addr 0x0 fltbdf 0x700
Segmentation Fault (core dumped)

Core dumps unfortunately don't make much sense to me.

Has anyone experienced similar problems?
Code:
$ uname -a
SunOS hostname 5.10 Generic_137137-09 sun4v sparc SUNW,Netra-T2000

$ cat /etc/release 
                      Solaris 10 10/08 s10s_u6wos_07b SPARC
           Copyright 2008 Sun Microsystems, Inc.  All Rights Reserved.
                        Use is subject to license terms.
                            Assembled 27 October 2008

sc> showhost
System Firmware 6.6.7 Netra[TM] T2000 2008/09/29 09:49

Host flash versions:
   OBP 4.29.0.a 2008/09/15 12:01
   Hypervisor 1.6.7.a 2008/09/29 09:29
   Netra[TM] T2000 POST 4.29.0.a 2008/09/15 12:30

Thank you,
Luka
# 2  
Old 11-22-2008
After applying the latest kernel patches, you will need to perform a re-configuration reboot to bring the system to a consistent state. I think you missed that. bring the system down to single user and try the following

#reboot -- -r
# 3  
Old 11-22-2008
Quote:
Originally Posted by incredible
After applying the latest kernel patches, you will need to perform a re-configuration reboot to bring the system to a consistent state. I think you missed that. bring the system down to single user and try the following

#reboot -- -r
after a kernel patch is installed there will be a file called "reconfigure" in the / filesystem. so a "reboot" or "init 6" should be enough cause this is the same like "reboot -- -r".
# 4  
Old 11-22-2008
Nope, the reconfigure file is not always there in the / fs. Anyway, there is an instruction for the kernel patch for 137137-09 to use a re-configuration boot after patching is completed.
# 5  
Old 11-22-2008
Quote:
Originally Posted by incredible
Nope, the reconfigure file is not always there in the / fs. Anyway, there is an instruction for the kernel patch for 137137-09 to use a re-configuration boot after patching is completed.
jope...
# 6  
Old 11-24-2008
Reconfiguration reboot was done prior to opening this thread.
# 7  
Old 11-24-2008
whats the status now? did you perform any fsck on the filesystem in single user?
Worse come to worst, you will need to do a reinstallation. Are u using the same jumpstart config for other systems running this solaris 10 update?

Last edited by incredible; 11-24-2008 at 11:27 AM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Core dump while doing Solaris packaging

Hi All, I am doing Solaris packaging. I got core dump error while doing it. I have identified particular folder causing it but not sure about the reason. Without that folder i can able to do the package. I have attached the proto file and other related files for your reference. Pls... (1 Reply)
Discussion started by: kamal_456
1 Replies

2. Solaris

Solaris 10, GhostPDL , PCL to PDF , pcl6 core dump

I have compiled GhostPDL that provides a shell script pcl2pdf that calls the executable pcl6. pcl6 compiled with no errors using gmake and gcc. Running pcl2pdf generates a core dump when trying to convert a pcl file to pdf. I have installed the Ghost fonts. Does anyone have GhostPDL compiled... (1 Reply)
Discussion started by: gduncan
1 Replies

3. Solaris

core dump

Hi guys, just want to know which core file pattern is best to set for core dumps: 1) per-process file name pattern or 2) global file name pattern. I will really appreciate an explanation why the chosen one is better. Thanks a lot guys. (2 Replies)
Discussion started by: cjashu
2 Replies

4. Programming

core dump

how to view core dumped file using gdb and how to extract information from the coredumped file.can we get similar information from the other utilites like strace or ptrace. (2 Replies)
Discussion started by: Manabhanjan
2 Replies

5. AIX

core dump

My application gives core dump. When i am debugging with dbx getting instructions below: pthdb_session.c, 818: 695445 PTHDB_INTERNAL (internal error) pthreaded.c, 1941: PTHDB_INTERNAL (internal error) Illegal instruction (illegal opcode) in . at 0x0 warning: Unable to access address 0x0... (1 Reply)
Discussion started by: bapi
1 Replies

6. Solaris

Urgent - core dump in solaris 5.10

anybody know abt Segmentation Fault - core dumped in solaris 5.10..wats this error?? (4 Replies)
Discussion started by: sanjana
4 Replies

7. Programming

about core dump

MY friends: my program under sco unix have a problem? it create a core dump file on the path when execute program , but i can't find the error of the C program ,i don't know how to see the error about my program use core, please help me or give me some suggest and what tools can use... (1 Reply)
Discussion started by: zhaohaizhou
1 Replies

8. UNIX for Dummies Questions & Answers

core dump

does any one have read a core dump? is there any reader for that? or may i know what is the use of that core which takes sometimes memory in GBs? :) (6 Replies)
Discussion started by: sskb
6 Replies

9. UNIX for Advanced & Expert Users

bcp core dump on Solaris 5.7

I am importing data into a new database using bcp (ASE 11.0) on Solaris 5.7. I keep getting a core dump error when my ksh script attempts to load one table. All other tables load fine. If I load the table directly from the command line without a script the table loads fine. Does anybody know... (1 Reply)
Discussion started by: gerald_agoi
1 Replies

10. UNIX for Dummies Questions & Answers

help, what is the difference between core dump and panic dump?

help, what is the difference between core dump and panic dump? (1 Reply)
Discussion started by: aileen
1 Replies
Login or Register to Ask a Question