Opinion on auto-restart of failed apps/services


 
Thread Tools Search this Thread
Top Forums UNIX for Advanced & Expert Users Opinion on auto-restart of failed apps/services
# 8  
Old 05-23-2011
Quote:
Originally Posted by Corona688
I don't know why you quoted me, you didn't answer either question.
I think I did answer the first but in a round-about way. It's not that leaving it down performs some function. It's that leaving it down keeps it out of the pool. So if the app is bad I don't start sending customer to it again.

The second question just go wrapped in with the first but my answer applies here as well. No it doesn't prevent me from debugging them. Just prevents me from returning a potentially bad server to the pool.

---------- Post updated at 03:37 PM ---------- Previous update was at 03:26 PM ----------

Quote:
Originally Posted by Neo
Debugging why processes fail is another topic and certainly should not be used as an excuse to shave uptime down.
I agree with you on this. Where I work, however, "uptime" is measured as an SLA. So if I'm hosting a web service, we're not judged by the individual uptime of the hosts and services that make up a pool, we're judged by the availability of the service. So "Does the service respond with the proper data in < 1 sec" is far more important than "Did all the servers in the web service pool stay up this year".

If I restart apache (as an example) automatically, and there is something wrong with that instance that causes it to respond in 5 sec instead of <1, I'll have to answer for that.

So my general philosophy is that you provide high availability with hot spares, load balancing, or some other type of redundancy. Not with auto-restarts. But it sounds like I may be alone in this.

Are most people being measured by uptime on individual hosts/apps as opposed to service availability?
# 9  
Old 05-23-2011
If a server going down doesn't break anything then I guess it's not quite as important. You never hinted about that until now though, many things aren't in pools like that. Presumably some servers are more important than others, too -- you can't avoid having some sort of physical storage somewhere...
# 10  
Old 05-25-2011
Most of the servers I administer are behind a load balancer just like yours and they come out of a pool when they are acting up. However, what I tend to do (for tomcat applications for example) is get a thread dump of what the application is doing at the time, get all the logs, as well as what processes including memory/cpu usage are running on the system. I then use this to debug it. If this is a single point of failure I go for the quick restart after collecting as much data as possible.

Our uptime is managed by service availability, but our SLA's include hosts/services on those hosts. So I am required to respond to them even if the service is still functioning properly.
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. Debian

How do i correct restart network-services in Debian?

Hello, I would like to do follow steps. Set a static IP-Adress on eth0 (For Testing) Set DHCP on eth0 All steps should be done without a single reboot. /etc/network/interfaces iface eth0 inet static address 192.0.2.7/24 gateway 192.0.2.254How do i perform... (3 Replies)
Discussion started by: int3g3r
3 Replies

2. Shell Programming and Scripting

Script to auto restart java for 100 percent

I want the tomcat to restart when java goes 100% cpu utilize and remain on this , Get pid kill and start tomcat . top | grep java We can get pid and cpu utilize , But how can we do on run time. Please use code tags as required by forum rules! (1 Reply)
Discussion started by: kaushik02018
1 Replies

3. Red Hat

Restart of services if port no is changed in /etc/services in RHEL

I had a doubt if any services need to be restarted if port no in /etc/services in an RHEL setup is changed. For eg, the port no of 443 for SSL may need to be changed. I hope my query is clear whether any services need to be restarted if port no in /etc/services is changed. Please revert with... (10 Replies)
Discussion started by: RHCE
10 Replies

4. Shell Programming and Scripting

Need script to restart the services

Hi Guys, I need bash script to restart the service. 1. Disable the service called SASM svcadm disable sasm 2. if service went to maintenance mode then it shuld clear it with below command svcadm clear sasm 3.or else it should restart the mysql service /etc/init.d/mysql stop... (1 Reply)
Discussion started by: bapu1981
1 Replies

5. AIX

problem to restart services from /etc/inittab in AIX6.1

hello, i have an AIX6.1.7.2 machine that it was upgraded recently from AIX5.3.9.4. when i kill system services that should restart automatically like /usr/sbin/cron it doesnt start. i checked my /etc/inittab file and i confirmed that this service is in respawn status so when i kill this process... (2 Replies)
Discussion started by: omonoiatis9
2 Replies

6. Shell Programming and Scripting

Auto restart script does not work

I have a service that are not 100% stable and stops from time to time. So I have a script that do restart the service if it does not run. This script works win on Ubuntu 9.04 but will not start the service in Ubuntu 10.10 If I run the part that do starts the service from CLI, it starts... (2 Replies)
Discussion started by: Jotne
2 Replies

7. HP-UX

Script to auto restart a service

Hi All, May i please know if it is possible to write a script to check the log messages and automatically restart a service if it is failed or it is stopped. Appreciate your suggestions. Thanks in advance. regards, Eajaz (2 Replies)
Discussion started by: ajazshariff
2 Replies

8. Linux

file location for GNOME auto startup apps

I know how to add an apps to auto-start in GUI, but I'd like to know how to do it mannualy. So where is the file saved to by GUI ? (1 Reply)
Discussion started by: honglus
1 Replies

9. Shell Programming and Scripting

Auto Detection/Restart of Sybase Deadlocks

Does anyone have an example of a ksh script that executes a Sybase stored procedure, via the ISQL command, and can detect a deadlock and loop until the process completes successfully? I'm a little confused on where to start. Thanks in advance for any assistance you can provide. (0 Replies)
Discussion started by: BCarlson
0 Replies
Login or Register to Ask a Question