Uptime Monitoring Every Second - Bad For the Server?

Question

I'm wondering if there's avantages of checking if a server is UP by doing a "HTTP GET Request" every second?

Can any server handle it?

Another option is doing the opposite: rather than monitoring the server from outside, monitor the server from inside, such as with http://www.r-u-on.com/. You basically install a small script on your server that pings another server very often, so you can monitor your uptime without making life harder for your web server. — Maxim Zaslavsky, Apr 03 '11 at 20:37
@Maxim, there are several problems with your suggestion. First, it doesn't check that the HTTP service is running on the server. Second, there is the problem of what happens when the server itself is down. That still needs to monitored. Besides, the same result could be had by a simple wget against the local machine. — John Gardeniers, Apr 04 '11 at 08:38

Joseph Kern · Accepted Answer · 2011-04-03T14:19:43.867

Can "any" server handle it? Probably.

Should you do it? Probably not.

Ask yourself a few questions:

How fast will you be to respond to an outage?
How many pageviews do you normally receive per second?
How many consecutive errors are you willing to see before calling it "Down" and sending an alert?
Do you have any SLA with internal or external customers that needs to be honored?
Based the questions listed above what seems like a reasonable monitoring and response time?

When I was first learning to program, I decided I wanted to make a stopwatch. When I finally got a working application, I noticed the CPU usage on my laptop was at 100% whenever I ran it.

My execution loop didn't have a wait cycle. It just kept executing over the time function.

On that day I learned a valuable lesson: there is no such thing as an infinitely accurate measurement.

score 6 · Answer 2 · answered Apr 03 '11 at 22:32

While I, like just about everyone else, question the reason for wanting to monitor so frequently the technical side is a non-issue. One GET request every second is absolutely trifling compared to a typical page load.

Can your server handle it? We have nothing upon which to answer such a question but if your server has an issue handling it then I would suggest it would be completely inadequate for whatever else it's serving.

score 3 · Answer 3 · answered Apr 03 '11 at 14:05

3

Nagios or munin can probably handle running the test every second but it's a bit obsessive. Is there a reason you need to check so frequently? If your server is that unstable you probably have deeper problems.

answered Apr 03 '11 at 14:05

slillibri

1,633
1
9
8

score 1 · Answer 4 · answered Apr 03 '11 at 19:33

1

Most commercial monitoring software offer 1-min or 5-min interval by default. That seems to be a good check interval.

answered Apr 03 '11 at 19:33

sapguy

149
3

Pingdom, for instance, allows you to set an interval and then on detecting the first outage, increase the frequency at which it pings the server to see if it's back up. – Ankur Banerjee Apr 03 '11 at 20:47
>, increase the frequency ..=> but minimum is still 1 min, or? – sapguy Apr 03 '11 at 20:56
On free accounts I think the lowest that Pingdom offers is 1 minute. I don't have a premium account so I cannot say whether they offer an option for even more frequent checks for them. – Ankur Banerjee Apr 03 '11 at 22:00

score 1 · Answer 5 · answered Apr 05 '11 at 20:38

There's nothing wrong with monitoring the server every second, it's just not very efficient especially on servers with high load where an Apache query may hang for a couple of seconds causing your requests to either backup, or issue false alerts for that particular moment, but it's not 'wrong'. One second checks won't make you any faster at responding, and in 99.9% of all circumstances, a 10 or 30 second check is just as important.

score 0 · Answer 6 · answered Apr 03 '11 at 16:40

I agree 100% with Joseph here. If you still want to do some sort of real-time monitoring you could consider sniffing the webserver log for both server errors and absence of new entries in the log for a period of time. It won't put a load on the server, but triggering alerts based on this is a challenge :)

score 0 · Answer 7 · answered Apr 04 '11 at 05:58

0

1 second resolution is really high and probably not needed. However I prefer collectd as it was designed for much higher resolution (ever 10 seconds) then other OSS tools like munin (5 minutes).

answered Apr 04 '11 at 05:58

n8whnp

1,316
7
9

Uptime Monitoring Every Second - Bad For the Server?

7 Answers7