Questions tagged [high-availability]

High availability is an architectural consideration often involving degrees of redundancy to insure availability in case of system or component failure.

High availability is an architectural consideration often involving degrees of redundancy to insure availability in case of system or component failure. Highly available systems are commonly implemented in situations where the risk outweighs the cost.

Best practices such as change control, standardized procedures, and monitoring can also enable high availability.

965 questions
80
votes
11 answers

Multiple data centers and HTTP traffic: DNS Round Robin is the ONLY way to assure instant fail-over?

Multiple A records pointing to the same domain seem to be used almost exclusively to implement DNS Round Robin as a cheap load balancing technique. The usual warning against DNS RR is that it is not good for high availability. When 1 IP goes down…
38
votes
6 answers

Windows 2008 ignores Gratuitous ARP requests

We recently saw an issue after a fail over of our router where our Windows 2008 Boxes didn't start talking to the primary router after fail-back. When we did some digging they still had the ARP entry from the secondary router. According to the…
Zypher
  • 36,995
  • 5
  • 52
  • 95
27
votes
9 answers

Alternatives to Heartbeat, Pacemaker and CoroSync?

Are there any major alternatives for automatic failover on Linux besides the typical Heartbeat/Pacemaker/CoroSync combinations? In particular, I'm setting up failover on EC2 instances, which only supports unicast - no multicast or broadcast. I'm…
organicveggie
  • 1,061
  • 3
  • 14
  • 27
22
votes
8 answers

Avoiding DNS timeouts when a DNSserver fails

We have a small datacenter with about a hundred hosts pointing to 3 internal DNS servers (bind 9). Our problem comes when one of the internal DNS servers becomes unavailable. At that point all the clients that point to that server start performing…
Neil Katin
  • 321
  • 1
  • 2
  • 3
21
votes
2 answers

What is the difference between keepalive and heartbeat?

I want to structure a high available server cluster . Now I want to know detail about keepalive and heartbeat, what is the difference between both, and How to choice one.
aboutstudy
  • 957
  • 3
  • 10
  • 10
21
votes
3 answers

What is the difference between Anycast and GeoDNS / GeoIP wrt HA?

Based on the Wikipedia description of Anycast, it includes both the distribution of a domain-name-to-many-IP-mapping across many DNS servers as well as replying to clients with the most geographically close (or fastest) server. In the context of a…
20
votes
6 answers

DNS Round Robin: Do browsers stick to one IP as long as it is online?

How do most browsers behave if they get multiple A-records from the DNS server? Do the stick to one IP as long as it is reachable (and only use another if the IP is down)? Or do they switch all the time for no reason? If the majority current…
17
votes
1 answer

Highly-available, Web-accessible and scalable deployment of statsd and graphite

I'd like to setup statsd/graphite so that I can log JS apps running on HTML devices (ie. not in a contained LAN environment, and possibly with a large volume of incoming data that I don't directly control). My constraints: entry point must speak…
David142
  • 353
  • 1
  • 2
  • 9
16
votes
3 answers

Multi-site high availability

We have a SaaS application that we need to be highly available. We already have an expensive, well-maintained Hyper-V failover cluster, but today the datacenter where we host that cluster had a five-hour power outage that knocked us completely…
Mike
  • 1,261
  • 5
  • 18
  • 31
16
votes
8 answers

When is the right time to introduce high availability for web site?

When is the right time to introduce high availability for web site? There are many articles on High Availability options. It’s not that obvious however WHEN is the right time to switch from single server to high availability configuration. Please…
Dennis Gorelik
  • 331
  • 2
  • 8
15
votes
2 answers

Options for Multisite High Availability with Puppet

I maintain two datacenters, and as more of our important infrastructure starts to get controlled via puppet, it is important the the puppet master work at the second site should our primary site fail. Even better would be to have a sort of active /…
Kyle Brandt
  • 82,107
  • 71
  • 302
  • 444
15
votes
5 answers

When my A web server gets unplugged, how do I automatically redirect all the users to my B web server in another city, and vice versa?

When my A web server gets unplugged, how do I automatically redirect all the users to my B web server in another city, and vice versa? A load-balancing switch does what I want, except I can't figure out how to get it to work unless both web servers…
David Cary
  • 398
  • 3
  • 16
15
votes
1 answer

Replicating beanstalkd for High Availability

Title says it all. Does anyone know of a way to replicate beanstalkd such that if a beanstalk server went down, others slaves could take over? Here's one approach I thought of: I could make beanstalk write its binlog (with the -b) to a shared…
Josh Nankin
  • 722
  • 11
  • 27
13
votes
2 answers

How to do client side NFS failover in Linux?

I have a CentOS 6.3 client that needs to access NFS storage. There are two NFS servers that serve up the same content stored on a SAN with a clustered filesystem. How do I set up CentOS to failover to the backup NFS server if needed? When I…
Doug
  • 371
  • 2
  • 5
  • 15
13
votes
5 answers

How to setup STONITH in a 2-node active/passive linux HA pacemaker cluster?

I am trying to setup an active/passive (2 nodes) Linux-HA cluster with corosync and pacemaker to hold a PostgreSQL-Database up and running. It works via DRBD and a service-ip. If node1 fails, node2 should take over. The same if PG runs on node2 and…
MMore
  • 523
  • 2
  • 6
  • 12
1
2 3
64 65