Server hung


 
Thread Tools Search this Thread
Operating Systems HP-UX Server hung
# 1  
Old 01-19-2006
Server hung

So my server was hung when I came in this morning. It was responding to pings, but the console and telnet sessions would not respond. There was no disk activity. The display said FA1F which I discovered that the "A" represents a high CPU load. I tired several things to get it going but was forced to boot server.

The first time I tried to boot the server it said "Terminating selection process. No boot device found".

*** Let me just clarify that I am an AIX guy and I just inherited this server and do not know much about HPUX.***

Having said that, I didn't know what to do so I booted the server again.

This time it came up. So I started doing some troubleshooting when it hung again. I was running the swapinfo command when it hung. I'm not sure if it was because of the swapinfo command or just a coincidence.

The display went to FA1F again so I waited for about 15 min and was forced to boot it. It seems to be working now.

Any idea what is going on here? Help with diags or troubleshooting steps would be great.

Thanks.

# uname -r
B.10.20
# 2  
Old 01-19-2006
Type "dmesg" to see the last few kernel messages. But that "Terminating selection process. No boot device found" says it all. Your root disk is failing. Make sure you have a good backup.

10.20 is a very old OS and it is not supported anymore. .

F**F = os is running
FA*F = load is 10 or above
F*1F = only one cpu

Boxes with displays like that are pretty old. A 9000/E240 or something like that is my guess. That is not supported anymore either.
# 3  
Old 01-19-2006
Quote:
Originally Posted by Perderabo
Type "dmesg" to see the last few kernel messages. But that "Terminating selection process. No boot device found" says it all. Your root disk is failing. Make sure you have a good backup.

Boxes with displays like that are pretty old. A 9000/E240 or something like that is my guess. That is not supported anymore either.
Strange that it would fail once and then work fine after a reset. I am aware that this server is old and I am working to replace it. I do have a good backup.

In AIX I could run online diagnostics by doing "diag -a". Is there a similar command in HPUX?

Thanks.
# 4  
Old 01-19-2006
Depend if the diagnostics are installed or not. As root try:
cstm
and see if you get to cstm prompt. Then try:
map
to see the devices. Pick a device a look at the device number, first column. Let's say you pick device 29...
sel dev 29
info
infolog
unselal

Info gathers some info. infolog displays it. unselal unselects the device. There is more you can do if that much works.
# 5  
Old 01-19-2006
Thanks for your help.
# 6  
Old 01-19-2006
Here is some more cstm stuff. Tested on an old 10.20 of our own...

from the cstm prompt, run logtool...
runu logtool

You should be at a logtool prompt.
sl
This will switch to a new log, as a side effect, you learn the name of the old log. "log2.raw.cur will be renamed to log2.raw" or something like that. So enter log2.raw when asked after you run:
sr
this will give you a summary of the errors in the log file you select. Then it prompts you with a misleading prompt. All this command can do is redisplay the summary...get out of this command and format the log file:
fr
You will pick the directory, it will pick the name. (It will remember that name for the next command.) Another summary and another chance to reprint the summary...get out. Now display the formatted log with fl:
fl
finally you see the errors.
# 7  
Old 01-20-2006
Turns out the server has a bad processor. Maybe this information will help someone one day.

*** UPDATE ***

It also had a bad root disk.

Last edited by biznatch; 01-22-2006 at 12:42 PM..
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Infrastructure Monitoring

Can HPOV monitor server hung state ?

Hi, We have Solaris-10 running on VMware (x86). It is being monitored by HP Openview. Sometimes when this server hungs, while ping still works, HPOpenview can't alert that server is down (which is actually unresponsive). First symptom we see is, login failure. It will ask user name and after... (7 Replies)
Discussion started by: solaris_1977
7 Replies

2. Shell Programming and Scripting

How to check if the process is Hung?

I wish to monitor if my Tomcat process if Running, Hung, or Shutdown. I cannot use any third party monitoring tools so i decided to use one of these to test if the tomcat server is responding or not . 1. nslookup 2. telnet 3. ps The reason I do not use wget / curl is because it will... (2 Replies)
Discussion started by: mohtashims
2 Replies

3. Linux

Server hung, is this a stack trace?

Hi everyone, Our Red Hat server hung yesterday, and I managed to log into the console and see the following message: RIP: 0010: mwait_idle_with_hints+0x66/ 0x67 RSP: 0018:ffffffff80457f40 EFLAGS: 00000046 RAX: 0000000000000010 RBX: ffff810c20075910 RCX: 0000000000000001 RDX:... (6 Replies)
Discussion started by: badoshi
6 Replies

4. SuSE

Server hung with firmware error

Hi all We've had an issue over the weekend when one of the SUSE Linux Enterprise Server 11 hung and had to be rebooted. The thing is that I got the ticket alert for a FS exceeding its usage at about 22:41:49 PM on 23 March. I checked the dmesg, the messages log and the boot.msg but all I found... (1 Reply)
Discussion started by: hedkandi
1 Replies

5. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Our network administrators implemented some sort of check to kill idle sessions and now burden is on us to run some sort of keep alive. Client based keep alive doesn't do a very good job. I have same issue with ssh. Does solution 2 provided above apply for ssh sessions also? (1 Reply)
Discussion started by: yoda9691
1 Replies

6. Programming

Subroutine Hung

Hi friends I am Administrator for a system works with uinx OS but, many times I get messages from server console inform me about Subroutine is Hanging so what can I do to reset this Subroutine? Note: always when I got that I restart the server but I think that is not professional solution. (3 Replies)
Discussion started by: bintaleb
3 Replies

7. Shell Programming and Scripting

Monitoring for a hung process

A coworker has a shell script that runs from a scheduler at the 3am. The shell script runs sqlplus passing in a sql statement, which generate a file. This is done 21 times for 21 different sql statements. Recently, one of the sqlplus processes got hung. Is there a way to monitor how long the... (2 Replies)
Discussion started by: prismtx
2 Replies

8. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Hi Guys, Just wondering if anyone of you have been in a situation where you end up having around 100 close_wait connections and seems to me those connections are locking up resources/processes in the server so unless the server is rebooted those processes won't be released by the close_wait... (3 Replies)
Discussion started by: hariza
3 Replies

9. HP-UX

Hung Port

Here's a difficult one for you. I am on HPUX 11.0. I have a hung port, it is stuck in FIN_WAIT_2. Short of rebooting, how can I terminate the connection the system thinks still exists? I found a script online that uses adb but have not been able to get it to work. I am still open to using... (6 Replies)
Discussion started by: TioTony
6 Replies

10. IP Networking

system hung

I installed sco unix v 5.0.5 in the hp server lc 2000 HD size 27gb & ram 256 , used micro focus cobol for unix , aplication system bank , the problem is the system hung 2 or 3 times a day will you pls. give me the solution. thanks talib alsaadi :( (2 Replies)
Discussion started by: talib alsaadi
2 Replies
Login or Register to Ask a Question