Server hung


Login or Register for Dates, Times and to Reply

 
Thread Tools Search this Thread
Operating Systems HP-UX Server hung
# 1  
Server hung

So my server was hung when I came in this morning. It was responding to pings, but the console and telnet sessions would not respond. There was no disk activity. The display said FA1F which I discovered that the "A" represents a high CPU load. I tired several things to get it going but was forced to boot server.

The first time I tried to boot the server it said "Terminating selection process. No boot device found".

*** Let me just clarify that I am an AIX guy and I just inherited this server and do not know much about HPUX.***

Having said that, I didn't know what to do so I booted the server again.

This time it came up. So I started doing some troubleshooting when it hung again. I was running the swapinfo command when it hung. Iím not sure if it was because of the swapinfo command or just a coincidence.

The display went to FA1F again so I waited for about 15 min and was forced to boot it. It seems to be working now.

Any idea what is going on here? Help with diags or troubleshooting steps would be great.

Thanks.

# uname -r
B.10.20
# 2  
Type "dmesg" to see the last few kernel messages. But that "Terminating selection process. No boot device found" says it all. Your root disk is failing. Make sure you have a good backup.

10.20 is a very old OS and it is not supported anymore. .

F**F = os is running
FA*F = load is 10 or above
F*1F = only one cpu

Boxes with displays like that are pretty old. A 9000/E240 or something like that is my guess. That is not supported anymore either.
# 3  
Quote:
Originally Posted by Perderabo
Type "dmesg" to see the last few kernel messages. But that "Terminating selection process. No boot device found" says it all. Your root disk is failing. Make sure you have a good backup.

Boxes with displays like that are pretty old. A 9000/E240 or something like that is my guess. That is not supported anymore either.
Strange that it would fail once and then work fine after a reset. I am aware that this server is old and I am working to replace it. I do have a good backup.

In AIX I could run online diagnostics by doing "diag -a". Is there a similar command in HPUX?

Thanks.
# 4  
Depend if the diagnostics are installed or not. As root try:
cstm
and see if you get to cstm prompt. Then try:
map
to see the devices. Pick a device a look at the device number, first column. Let's say you pick device 29...
sel dev 29
info
infolog
unselal

Info gathers some info. infolog displays it. unselal unselects the device. There is more you can do if that much works.
# 5  
Thanks for your help.
# 6  
Here is some more cstm stuff. Tested on an old 10.20 of our own...

from the cstm prompt, run logtool...
runu logtool

You should be at a logtool prompt.
sl
This will switch to a new log, as a side effect, you learn the name of the old log. "log2.raw.cur will be renamed to log2.raw" or something like that. So enter log2.raw when asked after you run:
sr
this will give you a summary of the errors in the log file you select. Then it prompts you with a misleading prompt. All this command can do is redisplay the summary...get out of this command and format the log file:
fr
You will pick the directory, it will pick the name. (It will remember that name for the next command.) Another summary and another chance to reprint the summary...get out. Now display the formatted log with fl:
fl
finally you see the errors.
# 7  
Turns out the server has a bad processor. Maybe this information will help someone one day.

*** UPDATE ***

It also had a bad root disk.

Last edited by biznatch; 01-22-2006 at 01:42 PM..
Login or Register for Dates, Times and to Reply

Previous Thread | Next Thread
Thread Tools Search this Thread
Search this Thread:
Advanced Search

Test Your Knowledge in Computers #631
Difficulty: Medium
iPhone OS 1.0 was first released with great fanfare during the US Super Bowl in 2007.
True or False?

10 More Discussions You Might Find Interesting

1. Infrastructure Monitoring

Can HPOV monitor server hung state ?

Hi, We have Solaris-10 running on VMware (x86). It is being monitored by HP Openview. Sometimes when this server hungs, while ping still works, HPOpenview can't alert that server is down (which is actually unresponsive). First symptom we see is, login failure. It will ask user name and after... (7 Replies)
Discussion started by: solaris_1977
7 Replies

2. Linux

Server hung, is this a stack trace?

Hi everyone, Our Red Hat server hung yesterday, and I managed to log into the console and see the following message: RIP: 0010: mwait_idle_with_hints+0x66/ 0x67 RSP: 0018:ffffffff80457f40 EFLAGS: 00000046 RAX: 0000000000000010 RBX: ffff810c20075910 RCX: 0000000000000001 RDX:... (6 Replies)
Discussion started by: badoshi
6 Replies

3. Solaris

Hung ILOM - e2900

While trying to perform a firmware upgrade on this ilom, the system refused to allow uploading of more than ~900k or so... We tried multiple approaches using FTP and all failed. I decided to start this morning by firing up Apache on another box and trying the same process via HTTP but when I went... (8 Replies)
Discussion started by: DustinT
8 Replies

4. SuSE

Server hung with firmware error

Hi all We've had an issue over the weekend when one of the SUSE Linux Enterprise Server 11 hung and had to be rebooted. The thing is that I got the ticket alert for a FS exceeding its usage at about 22:41:49 PM on 23 March. I checked the dmesg, the messages log and the boot.msg but all I found... (1 Reply)
Discussion started by: hedkandi
1 Replies

5. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Our network administrators implemented some sort of check to kill idle sessions and now burden is on us to run some sort of keep alive. Client based keep alive doesn't do a very good job. I have same issue with ssh. Does solution 2 provided above apply for ssh sessions also? (1 Reply)
Discussion started by: yoda9691
1 Replies

6. Programming

Subroutine Hung

Hi friends I am Administrator for a system works with uinx OS but, many times I get messages from server console inform me about Subroutine is Hanging so what can I do to reset this Subroutine? Note: always when I got that I restart the server but I think that is not professional solution. (3 Replies)
Discussion started by: bintaleb
3 Replies

7. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Hi Guys, Just wondering if anyone of you have been in a situation where you end up having around 100 close_wait connections and seems to me those connections are locking up resources/processes in the server so unless the server is rebooted those processes won't be released by the close_wait... (3 Replies)
Discussion started by: hariza
3 Replies

8. HP-UX

Hung Port

Here's a difficult one for you. I am on HPUX 11.0. I have a hung port, it is stuck in FIN_WAIT_2. Short of rebooting, how can I terminate the connection the system thinks still exists? I found a script online that uses adb but have not been able to get it to work. I am still open to using... (6 Replies)
Discussion started by: TioTony
6 Replies

9. UNIX for Dummies Questions & Answers

SCSI bus is hung

Anyone have any ideas what could be causing me to get the error "The SCSI bus is hung. Perhaps an external device is turned off"? This is occuring when I try to boot off of my external CDROM. I have tried this CDROM on 3 different systems, tried different SCSI cables, and different terminators... (2 Replies)
Discussion started by: saucierm
2 Replies

10. IP Networking

system hung

I installed sco unix v 5.0.5 in the hp server lc 2000 HD size 27gb & ram 256 , used micro focus cobol for unix , aplication system bank , the problem is the system hung 2 or 3 times a day will you pls. give me the solution. thanks talib alsaadi :( (2 Replies)
Discussion started by: talib alsaadi
2 Replies

Featured Tech Videos