[19346] in Privacy_Forum

home help back first fref pref prev next nref lref last post

[ PRIVACY Forum ] Controlling AI scraping

daemon@ATHENA.MIT.EDU (PRIVACY Forum mailing list)
Thu Sep 26 12:10:36 2024

Date: Thu, 26 Sep 2024 09:00:24 -0700
To: privacy-dist@vortex.com
Content-Disposition: inline
MIME-Version: 1.0
Message-ID: <mailman.365.1727366425.1854.privacy@vortex.com>
From: PRIVACY Forum mailing list <privacy@vortex.com>
Reply-To: PRIVACY Forum mailing list <privacy@vortex.com>
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset="us-ascii"; Format="flowed"
Errors-To: privacy-bounces+privacy-forum=mit.edu@vortex.com


Controlling AI scraping

Cloudflare's plan to give its users ways to block and/or monetize AI
scraping is interesting, but of course there are ethical and other
reasons to avoid using Cloudflare, since they continue to support some
of the most disreputable sites on the Net.

This does however suggest the concept of an open source mechanism to
provide the same sorts of features broadly (e.g., in conjunction with
Apache servers) to any sites, anywhere. This could be paired with a
system to keep sites updated about discovered source IP addresses of AI
scrapers that are not adhering to robots.txt directives. Sidenote:
Google announced an effort to expand robots.txt to better deal with AI
scraping issues, a concept I had already earlier suggested. I signed up
for this, but never heard another word about it since the earliest days.

Time to get serious about controlling AI scraping. -L

 - - -
--Lauren--
Lauren Weinstein 
lauren@vortex.com (https://www.vortex.com/lauren)
Lauren's Blog: https://lauren.vortex.com
Mastodon: https://mastodon.laurenweinstein.org/@lauren
Founder: Network Neutrality Squad: https://www.nnsquad.org
         PRIVACY Forum: https://www.vortex.com/privacy-info
Co-Founder: People For Internet Responsibility
_______________________________________________
privacy mailing list
https://lists.vortex.com/mailman/listinfo/privacy

home help back first fref pref prev next nref lref last post