[161404] in North American Network Operators' Group

home help back first fref pref prev next nref lref last post

Re: Bing/MSN/Microsoft contact

daemon@ATHENA.MIT.EDU (Jay Farrell)
Tue Mar 12 13:30:13 2013

In-Reply-To: <CAA5Ek4fRO06W4g+RWk=qvfR=UyXesf0HU1HC5ECtfGPaEeTvgg@mail.gmail.com>
From: Jay Farrell <jayfar@jayfar.com>
Date: Tue, 12 Mar 2013 13:29:40 -0400
To: Blair Trosper <blair.trosper@gmail.com>
Cc: nanog@nanog.org
Errors-To: nanog-bounces+nanog.discuss=bloom-picayune.mit.edu@nanog.org

This might help:

http://www.bing.com/webmaster/help/how-to-report-an-issue-with-bingbot-25c1=
9802

 How do I Report an Issue With Bingbot?

Bingbot is the name of the crawler used by Bing to crawl or =93spider=94 th=
e
web. It is Bingbot's job to find new and updated pages on websites across
the Internet, so that they can be processed for indexation. When crawling a
website, Bingbot looks at robots.txt for special instructions from the
website owner. Bingbot honors robots.txt directives, including the *
crawl-delay:* setting, and, in the absence of a crawl-delay, respects the
input from Webmasters in the Crawl Control Feature.

Generally, Bingbot does a good job at determining how frequent it should
visit pages on your site, taking robots.txt and Crawl Control rules and
hints into consideration. We call this =93Crawl Politeness=94. However, the=
re
still may be cases were you feel Bingbot is not polite enough and is
visiting your pages more than works for you (overcrawling).
 REPORTING OVERCRAWLING

Here=92s the steps you can follow if you think Bingbot is overcrawling your
site or not observing robots.txt rules:

   1. Verify that the bot traffic you are seeing is in fact from a valid
   Bingbot server. You can do this by not only looking at the User-Agent
   string (which can be easily spoofed by anyone) but also at the IP addres=
s
   and using the *Verify Bingbot
tool<http://www.bing.com/toolbox/verify-bingbot>
   * to get a verdict
   2. Once you have verified that this involves genuine Bingbot traffic,
   you can reduce crawler traffic as follows (if you haven=92t done so alre=
ady):
   3. Lower the speed of crawl during busy hours using the Crawl Control
   feature
   4. If that=92s not sufficient, add a crawl-delay: directive to your
   robots.txt: Bing supports whole number values ranging from 1 =96 30. Eac=
h
   number maps to the number of seconds we need to wait between requests, s=
o 1
   means a maximum of 1 request per 1 second =96 which is slow, but still
   adequate for smaller sites. 30 is extremely slow and means we are allowe=
d
   only 1 request per 30 seconds.
   5. If you have followed step 1 and 2 and the issue is still present, you
   can contact *Bing Webmaster
Support<http://go.microsoft.com/fwlink/p/?linkid=3D261881>
   *. Fill out the required fields and in the =93*What type of problem do y=
ou
   have?=94 *dropdown, select =93*Under-Crawling or Over-Crawling inquiry*=
=94 and
   describe the problem you are seeing. You can expect a reply within 24-48
   hours. When you report over-crawling issues, the support team will ask y=
ou
   to provide server log samples that show the Bingbot activity over a cert=
ain
   period of time in a next step, so make sure to have those ready.


On Tue, Mar 12, 2013 at 11:57 AM, Blair Trosper <blair.trosper@gmail.com>wr=
ote:

> If possible, I need someone from Microsoft/Bing (a la the MSN and Bing
> crawler bots) to contact me off list.
>
> Several IPs going back to AS8075 (with the user agent MSN and "bingbot")
> are basically attacking several IPs on my network with hundreds of reques=
ts
> per second.
>
> Thanks,
> Blair
>

home help back first fref pref prev next nref lref last post