[4966] in testers

home help back first fref pref prev next nref lref last post

Re: erratic host lookup failures

daemon@ATHENA.MIT.EDU (Christopher D. Beland)
Tue Jul 10 05:34:07 2001

Message-Id: <200107100934.FAA06100@Press-Your-Luck.mit.edu>
To: Garry Zacheiss <zacheiss@MIT.EDU>
cc: testers@MIT.EDU
In-reply-to: The events that comprise the history of the universe.
Date: Tue, 10 Jul 2001 05:34:03 -0400
From: "Christopher D. Beland" <beland@MIT.EDU>


> This looks suspiciously like the failure documented in Sun Alert
> 26283, titled "Host Name Lookups Might Fail When Using DNS".  The
> problem description does match what you describe (affects only
> Solaris 8, happens about 1% of the time).


Well, I am also experiencing the problem under Linux. 

Actually, it just happened to me; I think it does indeed have
something to do with timeouts, and maybe also weird server
configurations or conditions.  Here's what happened...


Netscape spewed a "could not find host or domain" error at me when I
attempted to access a file on www.kuro5shin.org.  I played around a
bit, and I noticed that most of the name servers for that address time
out under (say) nslookup time out without giving the desired result.
In fact, there's only one name server that actually seems to work, and
it take some amount of querying to find it.  Included below are some
select results from a lot of testing, following a lookup starting from
a gTLD server.

After I did all that, I flushed my DNS cache by
restarting athena-bind (which claims it is "Stopping Athena NTP" but
actually does the right thing).  I did

nslookup -d www.kuro5shin.org 

five times.  The first time, it timed out after about 28 seconds and
didn't get the IP address.  The second time, it succeeded after 9
seconds.  Three more attempts produced a 9 second success, and two 28
second timeouts.  (Restarting bind each time, obviously.)  I did 5
trials with "host www.kuro5hin.org" on PYL.  Sure enough, it took 5,
9, 32, 44, and 15 seconds, respectively, to get an answer.

I didn't notice any different results on whack-a-mole, which is a 9.0
Irix box, though I wasn't nearly as thorough there.

On an oooold Athena 8.2 SIPB-Linux box, I got four 9 second returns
and one 6 second return using the command "host www.kuro5hin.org".  So
maybe something kooky is going on.


---

beland@Press-Your-Luck: nslookup -d www.kuro5hin.org paranoia.kuro5hin.org 
;; res_querydomain(paranoia.kuro5hin.org, <Nil>, 1, 1)
;; res_query(paranoia.kuro5hin.org, 1, 1)
;; res_mkquery(0, paranoia.kuro5hin.org, 1, 1)
;; res_send()
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 59142
;; flags: rd; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0
;;	paranoia.kuro5hin.org, type = A, class = IN
;; Querying server (# 1) address = 127.0.0.1
;; got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 59142
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 4
;;	paranoia.kuro5hin.org, type = A, class = IN
paranoia.kuro5hin.org.	1d21h15m16s IN A  63.111.177.29
kuro5hin.org.		10m2s IN NS	dukat.thock.com.
kuro5hin.org.		10m2s IN NS	kira.thock.com.
kuro5hin.org.		10m2s IN NS	ns.intes.net.
kuro5hin.org.		10m2s IN NS	paranoia.kuro5hin.org.
dukat.thock.com.	1d23h38m11s IN A  142.165.167.48
kira.thock.com.		1d16h50m10s IN A  142.165.167.47
ns.intes.net.		1d23h39m29s IN A  63.111.177.5
paranoia.kuro5hin.org.	1d21h15m16s IN A  63.111.177.29
;; res_mkquery(0, 29.177.111.63.in-addr.arpa, 1, 12)
timeout
timeout


The situation is the same under Irix...

beland@whack-a-mole: nslookup -d www.kuro5hin.org paranoia.kuro5hin.org 
;; res_querydomain(paranoia.kuro5hin.org, <Nil>, 1, 1)
;; res_query(paranoia.kuro5hin.org, 1, 1)
;; res_mkquery(0, paranoia.kuro5hin.org, 1, 1)
;; res_send()
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 1276
;; flags: rd; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0
;;	paranoia.kuro5hin.org, type = A, class = IN
;; Querying server (# 1) address = 127.0.0.1
;; got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 1276
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 4
;;	paranoia.kuro5hin.org, type = A, class = IN
paranoia.kuro5hin.org.	1d14h58m34s IN A  63.111.177.29
kuro5hin.org.		23m55s IN NS	dukat.thock.com.
kuro5hin.org.		23m55s IN NS	kira.thock.com.
kuro5hin.org.		23m55s IN NS	ns.intes.net.
kuro5hin.org.		23m55s IN NS	paranoia.kuro5hin.org.
dukat.thock.com.	1d23h51m23s IN A  142.165.167.48
kira.thock.com.		1d19h11m11s IN A  142.165.167.47
ns.intes.net.		1d23h52m50s IN A  63.111.177.5
paranoia.kuro5hin.org.	1d14h58m34s IN A  63.111.177.29
;; res_mkquery(0, 29.177.111.63.in-addr.arpa, 1, 12)
timeout
timeout



beland@Press-Your-Luck: dig @dukat.thock.com www.kuro5hin.org T_ANY C_ANY 

; <<>> DiG 8.1 <<>> @dukat.thock.com www.kuro5hin.org T_ANY C_ANY 
; (1 server found)
;; res options: init recurs defnam dnsrch
[hangs]



(5:02 ~) beland@Press-Your-Luck: dig @kira.thock.com www.kuro5hin.org T_ANY C_ANY

; <<>> DiG 8.1 <<>> @kira.thock.com www.kuro5hin.org T_ANY C_ANY 
; (1 server found)
;; res options: init recurs defnam dnsrch
[hangs]



(5:03 ~) beland@Press-Your-Luck: dig @kira.thock.com www.kuro5hin.org

; <<>> DiG 8.1 <<>> @kira.thock.com www.kuro5hin.org 
; (1 server found)
;; res options: init recurs defnam dnsrch
;; got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 10
;; flags: qr aa rd; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 3
;; QUERY SECTION:
;;	www.kuro5hin.org, type = A, class = IN

;; ANSWER SECTION:
www.kuro5hin.org.	1H IN CNAME	kuro5hin.org.

;; AUTHORITY SECTION:
kuro5hin.org.		1H IN NS	dukat.thock.com.
kuro5hin.org.		1H IN NS	kira.thock.com.
kuro5hin.org.		1H IN NS	ns.intes.net.
kuro5hin.org.		1H IN NS	paranoia.kuro5hin.org.

;; ADDITIONAL SECTION:
dukat.thock.com.	10M IN A	142.165.167.48
kira.thock.com.		10M IN A	142.165.167.47
paranoia.kuro5hin.org.	1H IN A		63.111.177.29

;; Total query time: 138 msec
;; FROM: Press-Your-Luck.mit.edu to SERVER: kira.thock.com  142.165.167.47
;; WHEN: Tue Jul 10 05:03:43 2001
;; MSG SIZE  sent: 34  rcvd: 193



beland@Press-Your-Luck: dig @paranoia.kuro5hin.org www.kuro5hin.org

; <<>> DiG 8.1 <<>> @paranoia.kuro5hin.org www.kuro5hin.org 
; (1 server found)
;; res options: init recurs defnam dnsrch
[hangs]

----paranoia.kuro5hin.org PING Statistics----
223 packets transmitted, 0 packets received, 100.0% packet loss
----dukat.thock.com PING Statistics----
4 packets transmitted, 4 packets received, 0.0% packet loss
round-trip min/avg/max = 163.429/163.985/164.587 ms


But:

beland@Press-Your-Luck: nslookup -d www.kuro5hin.org paranoia.kuro5hin.org
;; res_querydomain(paranoia.kuro5hin.org, <Nil>, 1, 1)
;; res_query(paranoia.kuro5hin.org, 1, 1)
;; res_mkquery(0, paranoia.kuro5hin.org, 1, 1)
;; res_send()
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 7709
;; flags: rd; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0
;;	paranoia.kuro5hin.org, type = A, class = IN
;; Querying server (# 1) address = 127.0.0.1
;; got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 7709
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 4
;;	paranoia.kuro5hin.org, type = A, class = IN
paranoia.kuro5hin.org.	1d12h39m55s IN A  63.111.177.29
kuro5hin.org.		2m4s IN NS	dukat.thock.com.
kuro5hin.org.		2m4s IN NS	kira.thock.com.
kuro5hin.org.		2m4s IN NS	ns.intes.net.
kuro5hin.org.		2m4s IN NS	paranoia.kuro5hin.org.
dukat.thock.com.	1d23h24m52s IN A  142.165.167.48
kira.thock.com.		1d16h38m41s IN A  142.165.167.47
ns.intes.net.		1d23h30m47s IN A  63.111.177.5
paranoia.kuro5hin.org.	1d12h39m55s IN A  63.111.177.29
;; res_mkquery(0, 29.177.111.63.in-addr.arpa, 1, 12)
timeout


Finally, this command returns an instant answer:

beland@Press-Your-Luck: nslookup -d www.kuro5hin.org ns.intes.net 

;; res_querydomain(ns.intes.net, <Nil>, 1, 1)
;; res_query(ns.intes.net, 1, 1)
;; res_mkquery(0, ns.intes.net, 1, 1)
;; res_send()
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 43144
;; flags: rd; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0
;;	ns.intes.net, type = A, class = IN
;; Querying server (# 1) address = 127.0.0.1
;; got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 43144
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 3, ADDITIONAL: 3
;;	ns.intes.net, type = A, class = IN
ns.intes.net.		1d23h35m22s IN A  63.111.177.5
intes.NET.		1d23h35m22s IN NS  NS2.intes.NET.
intes.NET.		1d23h35m22s IN NS  NS1.intes.NET.
intes.NET.		1d23h35m22s IN NS  NS3.intes.NET.
NS2.intes.NET.		1d23h35m22s IN A  151.201.22.73
NS1.intes.NET.		1d23h35m22s IN A  63.111.177.4
NS3.intes.NET.		1d23h35m22s IN A  63.111.177.6
;; res_mkquery(0, 5.177.111.63.in-addr.arpa, 1, 12)
------------
Got answer:
    HEADER:
	opcode = QUERY, id = 43145, rcode = NOERROR
	header flags:  response, auth. answer, want recursion, recursion avail.
	questions = 1,  answers = 1,  authority records = 3,  additional = 3

    QUESTIONS:
	5.177.111.63.in-addr.arpa, type = PTR, class = IN
    ANSWERS:
    ->  5.177.111.63.in-addr.arpa
	name = mail-hub.intes.net
	ttl = 86400 (1D)
    AUTHORITY RECORDS:
    ->  177.111.63.in-addr.arpa
	nameserver = ns1.intes.net
	ttl = 86400 (1D)
    ->  177.111.63.in-addr.arpa
	nameserver = ns2.intes.net
	ttl = 86400 (1D)
    ->  177.111.63.in-addr.arpa
	nameserver = ns3.intes.net
	ttl = 86400 (1D)
    ADDITIONAL RECORDS:
    ->  ns1.intes.net
	internet address = 63.111.177.4
	ttl = 86400 (1D)
    ->  ns2.intes.net
	internet address = 151.201.22.73
	ttl = 86400 (1D)
    ->  ns3.intes.net
	internet address = 63.111.177.6
	ttl = 86400 (1D)

------------
Server:  mail-hub.intes.net
Address:  63.111.177.5

;; res_mkquery(0, www.kuro5hin.org, 1, 1)
------------
Got answer:
    HEADER:
	opcode = QUERY, id = 43146, rcode = NOERROR
	header flags:  response, want recursion, recursion avail.
	questions = 1,  answers = 2,  authority records = 4,  additional = 4

    QUESTIONS:
	www.kuro5hin.org, type = A, class = IN
    ANSWERS:
    ->  www.kuro5hin.org
	canonical name = kuro5hin.org
	ttl = 3293 (54m53s)
    ->  kuro5hin.org
	internet address = 209.208.150.45
	ttl = 1192 (19m52s)
    AUTHORITY RECORDS:
    ->  kuro5hin.org
	nameserver = dukat.thock.com
	ttl = 1192 (19m52s)
    ->  kuro5hin.org
	nameserver = kira.thock.com
	ttl = 1192 (19m52s)
    ->  kuro5hin.org
	nameserver = ns.intes.net
	ttl = 1192 (19m52s)
    ->  kuro5hin.org
	nameserver = paranoia.kuro5hin.org
	ttl = 1192 (19m52s)
    ADDITIONAL RECORDS:
    ->  dukat.thock.com
	internet address = 142.165.167.48
	ttl = 162024 (1d21h24s)
    ->  kira.thock.com
	internet address = 142.165.167.47
	ttl = 162024 (1d21h24s)
    ->  ns.intes.net
	internet address = 63.111.177.5
	ttl = 86400 (1D)
    ->  paranoia.kuro5hin.org
	internet address = 63.111.177.29
	ttl = 153920 (1d18h45m20s)

------------
Non-authoritative answer:
Name:    kuro5hin.org
Address:  209.208.150.45
Aliases:  www.kuro5hin.org


::whew::

-Beland

===============================================================
Christopher Beland - http://web.mit.edu/beland/www/contact.html
MIT STS/Course 6 (EECS)   -   MIT Athena User Interface Project              
The Talk of MIT   **   http://web.mit.edu/beland/talk/talk.html
===============================================================

home help back first fref pref prev next nref lref last post