Can HPOV monitor server hung state ? Post: 303036448

Sponsored Content

Special Forums UNIX and Linux Applications Infrastructure Monitoring Can HPOV monitor server hung state ? Post 303036448 by Neo on Wednesday 26th of June 2019 09:58:41 PM

06-26-2019

Administrator

HPOV like most centralized network management systems (NMS) rely primarily on polling to update their management system.

This means you cannot get information by polling an unresponsive server.

However, most of these same systems also can be configured to send traps.

This means, your HPOV team needs to set up your system to send traps back to the management system before the system being monitored slows to a crawl (soft failure) and cannot respond to polling,

In other words, most novices set up network management systems like HPOV to use only polling; but experienced network management people will also set up traps to be sent back to the NMS for certain critical processes which need to alert the NMS prior to overall system "failure" (also meaning a soft failure, not only a hard failure).

I have extensive hands-on experience in NMS including debugging HPOV when it was a black and white versus decades ago. All well configured NMS will be configured to poll (95% +) and to trap (< 5%) alerts.

Bottom Line: Set up traps for the critical processes to alert the NMS before the system soft-fails (slows to a crawl).

Hope this helps.

Cheers.

Neo

View Public Profile for Neo

Visit Neo's homepage!

Find all posts by Neo

10 More Discussions You Might Find Interesting

1. HP-UX

Server hung

So my server was hung when I came in this morning. It was responding to pings, but the console and telnet sessions would not respond. There was no disk activity. The display said FA1F which I discovered that the "A" represents a high CPU load. I tired several things to get it going but was forced...

2. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Hi Guys, Just wondering if anyone of you have been in a situation where you end up having around 100 close_wait connections and seems to me those connections are locking up resources/processes in the server so unless the server is rebooted those processes won't be released by the close_wait...

3. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Our network administrators implemented some sort of check to kill idle sessions and now burden is on us to run some sort of keep alive. Client based keep alive doesn't do a very good job. I have same issue with ssh. Does solution 2 provided above apply for ssh sessions also?

4. Solaris

How to clear maintenance state for apache2 server?

Can any one of you suggest me the method to get apache server in online from maintenance mode. I tried in the following way, but couldn't get that service to online. bash-3.00# svcs -a | grep apache legacy_run 9:51:55 lrc:/etc/rc3_d/S50apache offline 9:51:22...

5. Solaris

HPOV in solaris

Good Day all i have a solaris 8 server and i want the procedure for how to install HPOV becuse dont have any small info about solaris .

6. Shell Programming and Scripting

How can i get the state of weblogic server

Hi, all Now i want write a shell to get the state of weblogic server,and when the Managed Server's state is not ok, after 3 times checking, i will send msg to the system administrator by sms. BTW, my environment is : Linux ,Redhat 5.4 64bit weblogic version: 10.3.3 the count number...

7. SuSE

Server hung with firmware error

Hi all We've had an issue over the weekend when one of the SUSE Linux Enterprise Server 11 hung and had to be rebooted. The thing is that I got the ticket alert for a FS exceeding its usage at about 22:41:49 PM on 23 March. I checked the dmesg, the messages log and the boot.msg but all I found...

8. Red Hat

How to find the process which is caused system hung state?

when system is hung state due to swap, we will reboot it through ILO. i want to know which process caused system hung.

9. Linux

Server hung, is this a stack trace?

Hi everyone, Our Red Hat server hung yesterday, and I managed to log into the console and see the following message: RIP: 0010: mwait_idle_with_hints+0x66/ 0x67 RSP: 0018:ffffffff80457f40 EFLAGS: 00000046 RAX: 0000000000000010 RBX: ffff810c20075910 RCX: 0000000000000001 RDX:...

10. UNIX for Dummies Questions & Answers

Flow control state changed in server logs

Hi, We observe below logs from switch - the database servers rebooted becaause they couldn't do I/O on vfiler -Any pointers looking at below logs please? Switch logs: 2016 Apr 30 07:41:16.729 EAG-ECOM-POD111GPU-SWF1 %ETHPORT-5-IF_DOWN_LINK_FAILURE: Interface Ethernet152/1/8 is down (Link...

LEARN ABOUT MOJAVE

send_trap_vars

NETSNMP_TRAP_API(3)						     Net-SNMP						       NETSNMP_TRAP_API(3)

NAME

       send_easy_trap, send_trap_vars, send_v2trap - send TRAPs or INFORMs from a Net-SNMP MIB module

SYNOPSIS

       #include <net-snmp/agent/agent_trap.h>

       void send_easy_trap(int trap, int specific);

       void send_trap_vars(int trap, int specific, struct variable_list *vars);

       void send_v2trap(struct variable_list *vars);

DESCRIPTION

       These three routines may be used to send traps from a MIB module within the Net-SNMP agent (including an AgentX subagent).

       send_easy_trap()  sends	an SNMPv1 trap (or the SNMPv2 equivalent) to the list of configured trap destinations (or "sinks"), using the pro-
       vided values for the generic trap type, and specific trap value.

       send_trap_vars() is similar, but appends the supplied list of variable bindings to the traps that are sent.

       send_v2trap() uses the supplied list of variable bindings to form an SNMPv2 trap, which is sent to SNMPv2-capable sinks on  the	configured
       list.  An equivalent INFORM is sent to the configuredq list of inform sinks.  Sinks that can only handle SNMPv1 traps are skipped.

       The  various "send_trap()" calls allow you to specify traps in different formats.  And the various "trapsink" directives allow you to spec-
       ify destinations to receive different formats.  But *all* traps are sent to *all* destinations, regardless of how they were specified.
       I.e. it's
						___  trapsink
					       /
	   send_easy_trap ___  [  Trap	    ] ____  trap2sink
			   ___	[ Generator  ]
	   send_v2trap	  /	[	     ] ----- informsink
					       ____
						     trapsess

       *Not*
	    send_easy_trap  ------------------->  trapsink
	    send_v2trap     ------------------->  trap2sink
	    ????	    ------------------->  informsink
	    ????	    ------------------->  trapsess

WARNINGS

       These routines are used to send the traps immediately they are called.  Invoking them at the appropriate time is left  to  the  MIB  module
       programmer.

SEE ALSO

       snmpd.conf(5), snmptrapd(8)

V5.6.2.1							    13 Aug 2010 					       NETSNMP_TRAP_API(3)

10 More Discussions You Might Find Interesting

1. HP-UX

Server hung

Discussion started by: biznatch

2. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Discussion started by: hariza

3. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Discussion started by: yoda9691

4. Solaris

How to clear maintenance state for apache2 server?

Discussion started by: Sesha