Can HPOV monitor server hung state ? Post: 303036442

Sponsored Content

Special Forums UNIX and Linux Applications Infrastructure Monitoring Can HPOV monitor server hung state ? Post 303036442 by solaris_1977 on Wednesday 26th of June 2019 03:40:19 PM

06-26-2019

Registered User

Yes, we can assume that it was dead slow. I waited for 20 minutes, before I hit reset button. since this was critical application server, I was not able to wait longer. From VMWare console, I checked and memory graph was not showing peak utilization. But I have seen behavior where server was frozen due to memory crunch and VMWare console doesn't show that. So your theory can be true in this case too.

VMWare platform is stable (over 300 VM servers are running on it) and none other VM complained about it and still have large amount of memory. Memory is capped to each VM. For example, for this affected VM, 8 GB of memory is allocated.

As a preventive action, I am looking for alert, if it happens again. There could be a small script running from other server and keep login to affected server and say "Login OK". As soon as it delays or not responding for 10-20 seconds, it will send email to admins. But we want to do it many servers and that will become a messy solution. HPOpenview is handled by different team, but they are not taking any initiative to advance on it. So I am looking for solution, if something can be done with this tool.

solaris_1977

View Public Profile for solaris_1977

Find all posts by solaris_1977

10 More Discussions You Might Find Interesting

1. HP-UX

Server hung

So my server was hung when I came in this morning. It was responding to pings, but the console and telnet sessions would not respond. There was no disk activity. The display said FA1F which I discovered that the "A" represents a high CPU load. I tired several things to get it going but was forced...

2. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Hi Guys, Just wondering if anyone of you have been in a situation where you end up having around 100 close_wait connections and seems to me those connections are locking up resources/processes in the server so unless the server is rebooted those processes won't be released by the close_wait...

3. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Our network administrators implemented some sort of check to kill idle sessions and now burden is on us to run some sort of keep alive. Client based keep alive doesn't do a very good job. I have same issue with ssh. Does solution 2 provided above apply for ssh sessions also?

4. Solaris

How to clear maintenance state for apache2 server?

Can any one of you suggest me the method to get apache server in online from maintenance mode. I tried in the following way, but couldn't get that service to online. bash-3.00# svcs -a | grep apache legacy_run 9:51:55 lrc:/etc/rc3_d/S50apache offline 9:51:22...

5. Solaris

HPOV in solaris

Good Day all i have a solaris 8 server and i want the procedure for how to install HPOV becuse dont have any small info about solaris .

6. Shell Programming and Scripting

How can i get the state of weblogic server

Hi, all Now i want write a shell to get the state of weblogic server,and when the Managed Server's state is not ok, after 3 times checking, i will send msg to the system administrator by sms. BTW, my environment is : Linux ,Redhat 5.4 64bit weblogic version: 10.3.3 the count number...

7. SuSE

Server hung with firmware error

Hi all We've had an issue over the weekend when one of the SUSE Linux Enterprise Server 11 hung and had to be rebooted. The thing is that I got the ticket alert for a FS exceeding its usage at about 22:41:49 PM on 23 March. I checked the dmesg, the messages log and the boot.msg but all I found...

8. Red Hat

How to find the process which is caused system hung state?

when system is hung state due to swap, we will reboot it through ILO. i want to know which process caused system hung.

9. Linux

Server hung, is this a stack trace?

Hi everyone, Our Red Hat server hung yesterday, and I managed to log into the console and see the following message: RIP: 0010: mwait_idle_with_hints+0x66/ 0x67 RSP: 0018:ffffffff80457f40 EFLAGS: 00000046 RAX: 0000000000000010 RBX: ffff810c20075910 RCX: 0000000000000001 RDX:...

10. UNIX for Dummies Questions & Answers

Flow control state changed in server logs

Hi, We observe below logs from switch - the database servers rebooted becaause they couldn't do I/O on vfiler -Any pointers looking at below logs please? Switch logs: 2016 Apr 30 07:41:16.729 EAG-ECOM-POD111GPU-SWF1 %ETHPORT-5-IF_DOWN_LINK_FAILURE: Interface Ethernet152/1/8 is down (Link...

LEARN ABOUT DEBIAN

mergelogs

MERGELOGS(1)						      General Commands Manual						      MERGELOGS(1)

NAME

       mergelogs - merge and consolidate web server logs

SYNOPSIS

       mergelogs -p penlog [-c] [-d] [-j jitter] [-t seconds] server1:logfile1 [server2:logfile2 ...]

EXAMPLES

       mergelogs -p pen.log 10.0.0.1:access_log.1 10.0.0.2:access_log.2

       mergelogs -p pen.log 10.0.18.6:access_log-10.0.18.6 10.0.18.8:access_log-10.0.18.8

DESCRIPTION

       When pen is used to load balance web servers, the web server log file lists all accesses as coming from the host running pen. This makes it
       more difficult to analyze the log file.

       To solve this, pen creates its own log file, which contains the real client address, the time of the access, the target server address  and
       the first few bytes of the requests.

       Mergelogs reads pen's log file and the log files of all load balanced web servers, compares each entry and creates a combined log file that
       looks as if the web server cluster were a single physical server.  Client addresses are replaced with the real client addresses.

       In the event that no matching client address can be found in the pen log, the server address is used instead. This should never happen, and
       is  meant  as  a debugging tool. A large number of these indicates that the server system date needs to be set, or that the jitter value is
       too small.

       You probably don't want to use this program. Penlog is a much more elegant and functional solution.

OPTIONS

       -c     Do not cache pen log entries. The use of this option is not recommended, as it will make mergelogs search the  entire  pen  log  for
	      every line in the web server logs.

       -d     Debugging (repeat for more).

       -p penlog
	      Log file from pen.

       -j jitter
	      Jitter  in  seconds (default 600). This is the maximum variation in time stamps in the pen and web server log files. A smaller value
	      will result in a smaller pen log cache and faster processing, at the risk of missed entries.

       -t seconds
	      The difference in seconds between the time on the pen server and UTC.  For example, this is 7200 (two hours) in Finland.

       server:logfile
	      Web server address and name of log file.

AUTHOR

       Copyright (C) 2001-2003 Ulric Eriksson, <ulric@siag.nu>.

SEE ALSO

       pen(1), webresolve(1), penlog(1), penlogd(1)

								       LOCAL							      MERGELOGS(1)

10 More Discussions You Might Find Interesting

1. HP-UX

Server hung

Discussion started by: biznatch

2. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Discussion started by: hariza

3. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Discussion started by: yoda9691

4. Solaris

How to clear maintenance state for apache2 server?

Discussion started by: Sesha