Server crashes when not in use.


 
Thread Tools Search this Thread
Operating Systems Solaris Server crashes when not in use.
# 1  
Old 02-09-2011
Server crashes when not in use.

Production server crashes when there is no traffic on it. Can only recover by going to ALOM and reboot the server. This seems to happen about the same time every month. The only good thing is the server is not in production at the time of the crash.
I have been unable to locate any information in the logs. Cause could be from remote backups or security scans but can't prove it. All hardware testing shows no problems.
Nothing shows in var/adm/messages files.
I would like to know how to setup logs that may save information. Any information will help.
System is a Sun V210.

Thanks,
# 2  
Old 02-09-2011
What version of Solaris? Does anything show up in prtdiag? I would check for a cron job since it goes down about the same time each month. Is pmconfig running? If so check /etc/power.conf, but I can't imagine this causing a problem only once a month. After you bring it up, run "last" to see who was logged on when it went down. Also check /var/adm/sulog to see if anyone was using su when it went down.
# 3  
Old 02-09-2011
Running Solaris 10.
This is what I checked using your suggestions.
pmconfig = does not show up in ps -ef
sulog = no info the day and time of last crash
last = Mon Sep 18 (most current date).

Can prtdiag be run while the server is in production.

Thanks
# 4  
Old 02-09-2011
Are you getting a crash dump from the kernel?

Or is the server not actually crashing but just hanging?
# 5  
Old 02-09-2011
Nothing on the screen.
Not getting any crash dumps.
The system looks like it is hung.
Have to go to alom sc> to reset it.

I was thinking of maybe opening a telnet session to the box and leaving a alarm window open. Maybe it will catch something when it does it again.

Thanks
# 6  
Old 02-09-2011
prtdiag can be run while the server is in production and it's a good idea to do so every now and then. It's easy on Solaris 10. As root, just run prtdiag -v

--edit--
I just plugged a serial cable into the management port of a V210 to look around the Service Processor (aka ALOM). I see a consolehistory command that might be worth trying the next time it happens before you reboot. And yes I am in ALOM while the box is up and running.

Last edited by Perderabo; 02-09-2011 at 05:25 PM.. Reason: add some stuff
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Web Development

Apache2 Crashes

The Apache server suddenly stops. I am running Debian Jessie Here are some diagnostics: root@meow:/var/www# apachectl configtest AH00558: apache2: Could not reliably determine the server's fully qualified domain name, using 127.0.1.1. Set the 'ServerName' directive globally to suppress... (4 Replies)
Discussion started by: Meow613
4 Replies

2. UNIX and Linux Applications

Apcupsd crashes

I am trying to run apcupsd, but it will not retain contact the UPS root@meow:/home/ethan/UPS/gapcmon-0.8.9# /etc/init.d/apcupsd start Starting UPS power management: A copy of the daemon is still running. If you just stopped it, please wait about 5 seconds for it to shut down.... (0 Replies)
Discussion started by: Meow613
0 Replies

3. Shell Programming and Scripting

Perl/TK script crashes in MainLoop

I am running a perl/tk function that suddenly crashes in MainLoop. The gui just disappears and an X error is thrown when the code tries to update the non-existent GUI. Any suggestions to debug the root cause which destroys the mw gui? -D- <block name> : <block_name> - 1 -D- row_column_id... (0 Replies)
Discussion started by: bob.pepple
0 Replies

4. Programming

C++ program crashes

Hi, Can anyone tell me why the below program is crashing? and where exactly it is crashing. What is the corrective measure to be taken to make it work fine? #include <iostream> #include <cstring> using namespace std; class CString { char* m_data; public: CString() :... (6 Replies)
Discussion started by: royalibrahim
6 Replies

5. Programming

Program crashes on calling __libc_msgrcv()

Hi, I am a newbie to linux programming. I have implemented msgqueue in C. msgrcv() call at the client end is as below: msgrcv( msgqid, msgptr, msgsize, msgtype, 0 ); My program works fine when msgrcv () from /lib/libc.so.6 is called. However it crashes when __libc_msgrcv() is called. ... (3 Replies)
Discussion started by: praasanna
3 Replies

6. Red Hat

7z crashes system

Can someone tell my why every time I try to use 7z it freezes my system? I can't move my mouse, I can't type, I can't kill my xsession. I then restart my system and everything returns to normal. When I try to use 7z my system again freezes. (11 Replies)
Discussion started by: cokedude
11 Replies

7. SuSE

Chromium flashes and crashes

I am running openSUSE 11.2 with KDE4.5 on my eMachines e525. I just did a "zypper up" on my system and it reported that Chromium was going to be updated. I agreed and the when the update was finished I shut down Chromium and started it again. It flickered on the screen for a moment and crashed.... (6 Replies)
Discussion started by: Druonysus
6 Replies

8. Programming

C++ program crashes

Hi, Could anyone tell me the reason why the following program crashes? class A { int x; public: A() { cout << "from A()" << endl; } ~A() { cout << "from ~A()" << endl; } }; class B : public A { public: B() { cout << "from B()"... (2 Replies)
Discussion started by: royalibrahim
2 Replies

9. Linux

gethostnameby_r crashes

Hello all, I'm trying to use gethostbyname_r function with 6 arguments in one of my functions. But the call to this crashes the program. Kindly help me in resolving this... Compiler Info: Linux target: i686-hardhat-linux version: 3.3.1 The following is the piece of code I'm trying to... (1 Reply)
Discussion started by: rajans
1 Replies

10. HP-UX

Program crashes with optimization level O2

I am experiencing a difficulty undersatnding why my program (C++, HP UNIX) crashes. It crashes only when I build it with -O (+O2) optimization switch (used in aCC compiler). It works ok with +O0 or +O1 optimization. Also, I see that local variables are shown incorrecly when program is built... (3 Replies)
Discussion started by: Yuriy07
3 Replies
Login or Register to Ask a Question