Opinion on auto-restart of failed apps/services

05-23-2011

Registered User

93, 1

Join Date: Jun 2009

Last Activity: 25 June 2012, 6:34 PM EDT

Posts: 93

Thanks Given: 1

Thanked 1 Time in 1 Post

Quote:

Originally Posted by Corona688

I don't know why you quoted me, you didn't answer either question.

I think I did answer the first but in a round-about way. It's not that leaving it down performs some function. It's that leaving it down keeps it out of the pool. So if the app is bad I don't start sending customer to it again.

The second question just go wrapped in with the first but my answer applies here as well. No it doesn't prevent me from debugging them. Just prevents me from returning a potentially bad server to the pool.

---------- Post updated at 03:37 PM ---------- Previous update was at 03:26 PM ----------

Quote:

Originally Posted by Neo

Debugging why processes fail is another topic and certainly should not be used as an excuse to shave uptime down.

I agree with you on this. Where I work, however, "uptime" is measured as an SLA. So if I'm hosting a web service, we're not judged by the individual uptime of the hosts and services that make up a pool, we're judged by the availability of the service. So "Does the service respond with the proper data in < 1 sec" is far more important than "Did all the servers in the web service pool stay up this year".

If I restart apache (as an example) automatically, and there is something wrong with that instance that causes it to respond in 5 sec instead of <1, I'll have to answer for that.

So my general philosophy is that you provide high availability with hot spares, load balancing, or some other type of redundancy. Not with auto-restarts. But it sounds like I may be alone in this.

Are most people being measured by uptime on individual hosts/apps as opposed to service availability?

mglenney

View Public Profile for mglenney

Find all posts by mglenney

05-23-2011

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

If a server going down doesn't break anything then I guess it's not quite as important. You never hinted about that until now though, many things aren't in pools like that. Presumably some servers are more important than others, too -- you can't avoid having some sort of physical storage somewhere...

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

05-25-2011

Registered User

26, 0

Join Date: Jan 2007

Last Activity: 15 November 2011, 2:01 PM EST

Posts: 26

Thanks Given: 0

Thanked 0 Times in 0 Posts

Most of the servers I administer are behind a load balancer just like yours and they come out of a pool when they are acting up. However, what I tend to do (for tomcat applications for example) is get a thread dump of what the application is doing at the time, get all the logs, as well as what processes including memory/cpu usage are running on the system. I then use this to debug it. If this is a single point of failure I go for the quick restart after collecting as much data as possible.

Our uptime is managed by service availability, but our SLA's include hosts/services on those hosts. So I am required to respond to them even if the service is still functioning properly.

Create

View Public Profile for Create

Find all posts by Create

UNIX for Advanced & Expert Users

Opinion on auto-restart of failed apps/services

9 More Discussions You Might Find Interesting

1. Debian

How do i correct restart network-services in Debian?

Discussion started by: int3g3r

2. Shell Programming and Scripting

Script to auto restart java for 100 percent

Discussion started by: kaushik02018

3. Red Hat

Restart of services if port no is changed in /etc/services in RHEL

Discussion started by: RHCE

4. Shell Programming and Scripting

Need script to restart the services

Discussion started by: bapu1981

5. AIX

problem to restart services from /etc/inittab in AIX6.1

Discussion started by: omonoiatis9

6. Shell Programming and Scripting

Auto restart script does not work

Discussion started by: Jotne

7. HP-UX

Script to auto restart a service

Discussion started by: ajazshariff

8. Linux

file location for GNOME auto startup apps

Discussion started by: honglus

9. Shell Programming and Scripting

Auto Detection/Restart of Sybase Deadlocks

Discussion started by: BCarlson