06-26-2019
Yes, we can assume that it was dead slow. I waited for 20 minutes, before I hit reset button. since this was critical application server, I was not able to wait longer. From VMWare console, I checked and memory graph was not showing peak utilization. But I have seen behavior where server was frozen due to memory crunch and VMWare console doesn't show that. So your theory can be true in this case too.
VMWare platform is stable (over 300 VM servers are running on it) and none other VM complained about it and still have large amount of memory. Memory is capped to each VM. For example, for this affected VM, 8 GB of memory is allocated.
As a preventive action, I am looking for alert, if it happens again. There could be a small script running from other server and keep login to affected server and say "Login OK". As soon as it delays or not responding for 10-20 seconds, it will send email to admins. But we want to do it many servers and that will become a messy solution. HPOpenview is handled by different team, but they are not taking any initiative to advance on it. So I am looking for solution, if something can be done with this tool.
10 More Discussions You Might Find Interesting
1. HP-UX
So my server was hung when I came in this morning. It was responding to pings, but the console and telnet sessions would not respond. There was no disk activity. The display said FA1F which I discovered that the "A" represents a high CPU load. I tired several things to get it going but was forced... (6 Replies)
Discussion started by: biznatch
6 Replies
2. UNIX for Advanced & Expert Users
Hi Guys,
Just wondering if anyone of you have been in a situation where you end up having around 100 close_wait connections and seems to me those connections are locking up resources/processes in the server so unless the server is rebooted those processes won't be released by the close_wait... (3 Replies)
Discussion started by: hariza
3 Replies
3. HP-UX
Our network administrators implemented some sort of check to kill idle sessions and now burden is on us to run some sort of keep alive. Client based keep alive doesn't do a very good job. I have same issue with ssh. Does solution 2 provided above apply for ssh sessions also? (1 Reply)
Discussion started by: yoda9691
1 Replies
4. Solaris
Can any one of you suggest me the method to get apache server in online from maintenance mode. I tried in the following way, but couldn't get that service to online.
bash-3.00# svcs -a | grep apache
legacy_run 9:51:55 lrc:/etc/rc3_d/S50apache
offline 9:51:22... (3 Replies)
Discussion started by: Sesha
3 Replies
5. Solaris
Good Day all
i have a solaris 8 server and i want the procedure for how to install HPOV
becuse dont have any small info about solaris . (1 Reply)
Discussion started by: thecobra151
1 Replies
6. Shell Programming and Scripting
Hi, all
Now i want write a shell to get the state of weblogic server,and when the Managed Server's state is not ok, after 3 times checking, i will send msg to the system administrator by sms.
BTW, my environment is :
Linux ,Redhat 5.4 64bit
weblogic version: 10.3.3
the count number... (1 Reply)
Discussion started by: wangsk
1 Replies
7. SuSE
Hi all
We've had an issue over the weekend when one of the SUSE Linux Enterprise Server 11 hung and had to be rebooted. The thing is that I got the ticket alert for a FS exceeding its usage at about 22:41:49 PM on 23 March. I checked the dmesg, the messages log and the boot.msg but all I found... (1 Reply)
Discussion started by: hedkandi
1 Replies
8. Red Hat
when system is hung state due to swap, we will reboot it through ILO.
i want to know which process caused system hung. (1 Reply)
Discussion started by: Naveen.6025
1 Replies
9. Linux
Hi everyone,
Our Red Hat server hung yesterday, and I managed to log into the console and see the following message:
RIP: 0010: mwait_idle_with_hints+0x66/
0x67
RSP: 0018:ffffffff80457f40 EFLAGS: 00000046
RAX: 0000000000000010 RBX: ffff810c20075910 RCX: 0000000000000001
RDX:... (6 Replies)
Discussion started by: badoshi
6 Replies
10. UNIX for Dummies Questions & Answers
Hi,
We observe below logs from switch - the database servers rebooted becaause they couldn't do I/O on vfiler -Any pointers looking at below logs please?
Switch logs:
2016 Apr 30 07:41:16.729 EAG-ECOM-POD111GPU-SWF1 %ETHPORT-5-IF_DOWN_LINK_FAILURE: Interface Ethernet152/1/8 is down (Link... (0 Replies)
Discussion started by: admin_db
0 Replies
LEARN ABOUT OPENSOLARIS
dump_sockdfr
DUMP_SOCKDFR(8) System Manager's Manual DUMP_SOCKDFR(8)
NAME
dump_sockdfr - Display contents of frozen route file for SOCKS server
SYNOPSIS
dump_sockdfr [infile]
DESCRIPTION
dump_sockdfr reads in a frozen route file for the SOCKS server and produces a listing of its contents on the standard output.
The argument is optional; if omitted, /etc/sockd.fr is assumed.
The frozen route file is produced by make_sockdfr and is essentially the memory image of the parsed route file. Using the frozen route file
can reduce the start-up delay of the SOCKS server program since it no longer has to parse the file contents.
When the SOCKS server starts, it always looks for the frozen route file /etc/sockd.fr first. If that file is not found, it then tries to
use the plain-text route file /etc/sockd.route. If you use frozen route file, you must remember to run make_sockdfr every time after you
modify the plain-text file or the SOCKS server will continue to use the frozen version of a previous route file.
FILES
/etc/sockd.fr, /etc/sockd.route
SEE ALSO
make_sockdfr(8), sockd.fr(5), sockd.route(5)
AUTHOR
Ying-Da Lee, yingda@esd.sgi.com or yingda@best.com
May 6, 1996 DUMP_SOCKDFR(8)