[26118] in North American Network Operators' Group


home	help	back	first	fref	pref	prev	next	nref	lref	last	post

Re: How to achieve application reliability

daemon@ATHENA.MIT.EDU (Sean Donelan)
Sun Dec 5 11:38:29 1999

Date: 5 Dec 1999 08:37:09 -0800
Message-ID: <19991205163709.1764.cpmta@c004.sfo.cp.net>
Content-Type: text/plain
Content-Disposition: inline
Mime-Version: 1.0
To: nanog@merit.edu
From: Sean Donelan <sean@donelan.com>
Errors-To: owner-nanog-outgoing@merit.edu

On Sat, 04 December 1999, James Smith wrote:
> Our only alternative is to eliminate every single-point failure with stuff
> like high availability clustering, redundant feeds, battery backups,
> nuclear reactors, physical separate sites on different planets, etc. :-)
> (Pardon me, it's 2:00am and I'm getting punching)

If you are using Microsoft products in your nuclear reactor, its not going
to be very reliable.  They aren't designed for that purpose.

The tools exist to make very reliable network applications, but we can't
force people to use them.  So long as applications neglect to use the other
information provided by the network, they are going to be vulnerable to
single points of failure.

Multiple A records exist for a reason.  Even if you have high availability
clustering, redundant feeds, battery backups, multi-homing, multi-sites; if
you are depending on a single global network announcement there is nothing
to prevent another ISP from announcing the same prefix with a shorter AS
path length, and effectively blackholing your network.  For people with
ultra-high reliablility requirements, a /19 isn't the solution.


home	help	back	first	fref	pref	prev	next	nref	lref	last	post

[26118] in North American Network Operators' Group

Re: How to achieve application reliability

daemon@ATHENA.MIT.EDU (Sean Donelan)Sun Dec 5 11:38:29 1999

daemon@ATHENA.MIT.EDU (Sean Donelan)
Sun Dec 5 11:38:29 1999