Sponsored Content
Special Forums UNIX and Linux Applications Infrastructure Monitoring Can HPOV monitor server hung state ? Post 303036442 by solaris_1977 on Wednesday 26th of June 2019 03:40:19 PM
Old 06-26-2019
Yes, we can assume that it was dead slow. I waited for 20 minutes, before I hit reset button. since this was critical application server, I was not able to wait longer. From VMWare console, I checked and memory graph was not showing peak utilization. But I have seen behavior where server was frozen due to memory crunch and VMWare console doesn't show that. So your theory can be true in this case too.

VMWare platform is stable (over 300 VM servers are running on it) and none other VM complained about it and still have large amount of memory. Memory is capped to each VM. For example, for this affected VM, 8 GB of memory is allocated.

As a preventive action, I am looking for alert, if it happens again. There could be a small script running from other server and keep login to affected server and say "Login OK". As soon as it delays or not responding for 10-20 seconds, it will send email to admins. But we want to do it many servers and that will become a messy solution. HPOpenview is handled by different team, but they are not taking any initiative to advance on it. So I am looking for solution, if something can be done with this tool.
 

10 More Discussions You Might Find Interesting

1. HP-UX

Server hung

So my server was hung when I came in this morning. It was responding to pings, but the console and telnet sessions would not respond. There was no disk activity. The display said FA1F which I discovered that the "A" represents a high CPU load. I tired several things to get it going but was forced... (6 Replies)
Discussion started by: biznatch
6 Replies

2. UNIX for Advanced & Expert Users

close_wait connections causing a server to hung

Hi Guys, Just wondering if anyone of you have been in a situation where you end up having around 100 close_wait connections and seems to me those connections are locking up resources/processes in the server so unless the server is rebooted those processes won't be released by the close_wait... (3 Replies)
Discussion started by: hariza
3 Replies

3. HP-UX

ssh session getting hung (smilar to hpux telnet session is getting hung after about 15 minutes)

Our network administrators implemented some sort of check to kill idle sessions and now burden is on us to run some sort of keep alive. Client based keep alive doesn't do a very good job. I have same issue with ssh. Does solution 2 provided above apply for ssh sessions also? (1 Reply)
Discussion started by: yoda9691
1 Replies

4. Solaris

How to clear maintenance state for apache2 server?

Can any one of you suggest me the method to get apache server in online from maintenance mode. I tried in the following way, but couldn't get that service to online. bash-3.00# svcs -a | grep apache legacy_run 9:51:55 lrc:/etc/rc3_d/S50apache offline 9:51:22... (3 Replies)
Discussion started by: Sesha
3 Replies

5. Solaris

HPOV in solaris

Good Day all i have a solaris 8 server and i want the procedure for how to install HPOV becuse dont have any small info about solaris . (1 Reply)
Discussion started by: thecobra151
1 Replies

6. Shell Programming and Scripting

How can i get the state of weblogic server

Hi, all Now i want write a shell to get the state of weblogic server,and when the Managed Server's state is not ok, after 3 times checking, i will send msg to the system administrator by sms. BTW, my environment is : Linux ,Redhat 5.4 64bit weblogic version: 10.3.3 the count number... (1 Reply)
Discussion started by: wangsk
1 Replies

7. SuSE

Server hung with firmware error

Hi all We've had an issue over the weekend when one of the SUSE Linux Enterprise Server 11 hung and had to be rebooted. The thing is that I got the ticket alert for a FS exceeding its usage at about 22:41:49 PM on 23 March. I checked the dmesg, the messages log and the boot.msg but all I found... (1 Reply)
Discussion started by: hedkandi
1 Replies

8. Red Hat

How to find the process which is caused system hung state?

when system is hung state due to swap, we will reboot it through ILO. i want to know which process caused system hung. (1 Reply)
Discussion started by: Naveen.6025
1 Replies

9. Linux

Server hung, is this a stack trace?

Hi everyone, Our Red Hat server hung yesterday, and I managed to log into the console and see the following message: RIP: 0010: mwait_idle_with_hints+0x66/ 0x67 RSP: 0018:ffffffff80457f40 EFLAGS: 00000046 RAX: 0000000000000010 RBX: ffff810c20075910 RCX: 0000000000000001 RDX:... (6 Replies)
Discussion started by: badoshi
6 Replies

10. UNIX for Dummies Questions & Answers

Flow control state changed in server logs

Hi, We observe below logs from switch - the database servers rebooted becaause they couldn't do I/O on vfiler -Any pointers looking at below logs please? Switch logs: 2016 Apr 30 07:41:16.729 EAG-ECOM-POD111GPU-SWF1 %ETHPORT-5-IF_DOWN_LINK_FAILURE: Interface Ethernet152/1/8 is down (Link... (0 Replies)
Discussion started by: admin_db
0 Replies
ntpd(8) 						      System Manager's Manual							   ntpd(8)

Name
       ntpd - network time protocol (NTP) daemon

Syntax
       /usr/etc/ntpd [ -a threshold ][ -c file ][ -d ][ -D level ][ -l ][ -n ][ -s ]

Description
       The  University of Maryland's daemon synchronizes the local clock with a set of distributed time servers.  The daemon distributes accurate,
       reliable time from the best time source available at your site to hosts on wide area networks (WAN) and local  area  networks  (LAN).   The
       three  recommended  time  sources in decreasing order of accuracy are:  Internet NTP service, local radio clock, and wristwatch.  Note that
       the daemon does not require time servers to be on the same LAN as time clients, and does not create a heavy broadcast load on the  network.
       If  the	NTP servers are not on your Local Area Network (LAN), you must run the daemon before running the daemon.  To run remove the number
       signs (#) from in front of the following lines in your file:
       #[ -f /etc/routed ] && {
       #       /etc/routed & echo 'routed'	       >/dev/console
       #}
       The daemon will be invoked when you reboot your system.	To start without rebooting, type the following on the command line:
       /etc/routed
       For information on setting up the network time services, see the Guide to System and Network Setup.

       The daemon automatically splits the nodes running the daemon into a dynamically reconfigurable hierarchy of nodes.  The nodes  at  the  top
       level  of the hierarchy (low stratum numbers) are connected to the most accurate sources available.  This information is transferred to the
       lower-level nodes (higher stratum numbers) which set their clocks based on the calculated offset from a remote server, and then	distribute
       this time to lower levels of the hierarchy.

       The daemon provides a solution for distributing time to a large number of individual workstations.  It can also be used in conjunction with
       a master daemon to distribute NTP time to workstations running If is run with the and options on at least one system that is  also  running
       then  all other systems on the network running can receive time updates from a host running Although is easier to set up on clients, NTP is
       recommended because it is more accurate and more secure.

       Normally, the daemon is invoked at boot time from the file.  When is started, it reads configuration information from the file, unless  you
       have specified another configuration file with the option.  The configuration file either specifies the list of NTP servers with which this
       host should synchronize, or identifies this host as a local reference clock.  See the reference page for more information on the configura-
       tion file.

       The daemon uses the system call to gradually adjust the local clock for small clock offsets (< 0.128 seconds).  If the local clock lags the
       time on the server by more than 0.128 seconds, the system call is used to make a forward step adjustment of the local  clock.   Clocks  are
       never  stepped  backwards; they are adjusted gradually, which can take a very long time.  Therefore, it is important to initialize the time
       using the command before running the daemon.

Options
       -a threshold
	    Sets the threshold (in seconds) which limits how far the daemon can change the local clock.  By default, the threshold  is	1000  sec-
	    onds.   This  is  set  to avoid propagating major mistakes throughout the network.	If you specify the string instead of a number, the
	    daemon can change the local clock by any amount.

       -c file
	    Specifies a configuration file for the daemon.  By default, the configuration file is

       -d   Increments the debug level by one.	The option can be specified more than once.  Higher debug levels provide more diagnostic  informa-
	    tion.

       -D level
	    Sets the debug level to the specified value.

       -l   Causes  the daemon to log a message each time the local clock is adjusted.	Specify this option only if you want to gather statistical
	    information to analyze the local clock behavior.  If the option is set, a message may be  logged  every  two  minutes.   Messages  are
	    logged to

       -n   Inhibits the program from being swapped out of memory.  Using the option is recommended for both time servers and time clients.

       -s   Prevents the daemon from altering the time on the local host.  The daemon participates as an NTP server with the flag set, but it does
	    not change the time of the local host.

Examples
       Before starting the daemon, either manually or from the file, you must edit the file with the appropriate information.  If your system is a
       client  you  must specify the time servers for it to query.  If it is a time server, you must specify the time servers with which it peers.
       See the reference page for more information.

       To start the daemon manually (on a time client), enter the following commands:
       # /etc/rdate -s
       # /usr/etc/ntp -s -f server1 server2 server3
       # /usr/etc/ntpd -n
       The command initializes your time to the average network time.  The command further refines the initial time to the NTP time.  The  servers
       specified on the command line are the same as those specified in the file.

       To  start  the daemon from the file (on a time client), place the following entries in the file.  Multiple servers are included in case one
       of the servers crashes, or is brought down.  The servers specified in the file are the same as those specified in the file.  These  entries
       should be placed after the entry:

       [-f /etc/syslog] && {
	    /etc/syslog   & echo -n ' syslog'		>/dev/console
       }
       [-f /etc/rdate] && {
	    /etc/rdate -s   & echo -n ' rdate'		>/dev/console
       }
       [-f /usr/etc/ntp] && {
	    /usr/etc/ntp -s -f server1 server2 server3 
			     & echo -n ' ntp'		>/dev/console
       }
       [-f /usr/etc/ntpd] && {
	    /usr/etc/ntpd -n & echo -n ' ntpd'		>/dev/console

       }

Diagnostics
       The  daemon  logs errors, major state changes, and statistics reports using the daemon; the log entries appear in the file with the word on
       each   relevant line.  Normal log entries show when gains or loses synchronization with a lower-stratum host.  Also, once an hour issues  a
       entry that gives information about its state.

       Once  an  hour,	if is synchronized, it updates the file.  This file shows the estimated clock drift for each of the past 5 hours, with the
       most recent hour listed first.  (The 6th number in this file is the number of hours has been running).  To  convert  the  drift	values	to
       parts  per  million  (ppm), divide them by 4096 and multiply by 1000000.  For example, +0.0107001161 means that estimates that the clock is
       drifting by about 2.61 ppm, or is losing about 0.226 seconds per day.  [(2.61/1000000) * 24 * 60 * 60 = 0.226].	Negative drift values mean
       that the clock is gaining time.	If is working, your computer's clock should be accurate to within a few seconds per day.

       Another	diagnostic  tool is the command.  You can use this to look at any host running The following command line returns the state of the
       remote host's server:
       % /usr/etc/ntpdc hostname

       The value that is returned for the should contain values not greater than 100 milliseconds.  See the reference page for more information.

       If the daemon sets the time as frequently as every 10 minutes (indicated by messages in the file), then you should kill the daemon,  remove
       the file, run the command to initialize the time, and restart the daemon.

       If  your  system  clock	is ahead of the server time by more than 1 second, you should kill the daemon, remove the file, run the command to
       initialize the time, and restart the daemon.

       If your clock is more than 1000 seconds off from the server time, and you did not specify the option, will not  change  your  system  time.
       Rather, will repeatedly log messages to the file, indicating that the time is too far off to reset.

See Also
       ntp(1), adjtime(2), settimeofday(2), ntp.conf(5), ntpdc(8), timed(8)
       RFC 1129--Internet Time Synchronization:  the Network Time Protocol
       Guide to System and Network Setup
       Introduction to Networking and Distributed System Services

																	   ntpd(8)
All times are GMT -4. The time now is 11:49 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy