[10962] in North American Network Operators' Group
Re: NSI bulletin 097-004 | Root Server Problems
daemon@ATHENA.MIT.EDU (Gary R Wright)
Thu Jul 17 18:43:21 1997
To: nanog@merit.edu
In-reply-to: Your message of "Thu, 17 Jul 1997 15:43:57 EDT."
<199707171944.PAA13467@jekyll.piermont.com>
Date: Thu, 17 Jul 1997 18:05:47 -0400
From: Gary R Wright <gwright@connix.com>
"Perry E. Metzger" writes:
> I admit that the problem at NSI is larger by three orders of
> magnitude, but essentially the same sort of scripts could be run. If
> such scripts were in place at NSI, such failures, which have occurred
> multiple times, would never have happened.
>
> Humans CANNOT be trusted with this sort of thing. Humans are
> fallible. You can't have humans involved in this sort of release
> process.
And who wrote the QA scripts you describe? Complex systems have
complex failure modes. Yes, there are clearly steps that can be
taken to minimize the problems, but anybody who claims that building
"robustness" into complex systems is anything other than "hard"
should spend some time reading the RISKS archives
(http://www.CSL.sri.com/risksinfo.html).