| home | help | back | first | fref | pref | prev | next | nref | lref | last | post |
Date: Thu, 26 Sep 2024 09:00:24 -0700
To: privacy-dist@vortex.com
Content-Disposition: inline
MIME-Version: 1.0
Message-ID: <mailman.365.1727366425.1854.privacy@vortex.com>
From: PRIVACY Forum mailing list <privacy@vortex.com>
Reply-To: PRIVACY Forum mailing list <privacy@vortex.com>
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset="us-ascii"; Format="flowed"
Errors-To: privacy-bounces+privacy-forum=mit.edu@vortex.com
Controlling AI scraping
Cloudflare's plan to give its users ways to block and/or monetize AI
scraping is interesting, but of course there are ethical and other
reasons to avoid using Cloudflare, since they continue to support some
of the most disreputable sites on the Net.
This does however suggest the concept of an open source mechanism to
provide the same sorts of features broadly (e.g., in conjunction with
Apache servers) to any sites, anywhere. This could be paired with a
system to keep sites updated about discovered source IP addresses of AI
scrapers that are not adhering to robots.txt directives. Sidenote:
Google announced an effort to expand robots.txt to better deal with AI
scraping issues, a concept I had already earlier suggested. I signed up
for this, but never heard another word about it since the earliest days.
Time to get serious about controlling AI scraping. -L
- - -
--Lauren--
Lauren Weinstein
lauren@vortex.com (https://www.vortex.com/lauren)
Lauren's Blog: https://lauren.vortex.com
Mastodon: https://mastodon.laurenweinstein.org/@lauren
Founder: Network Neutrality Squad: https://www.nnsquad.org
PRIVACY Forum: https://www.vortex.com/privacy-info
Co-Founder: People For Internet Responsibility
_______________________________________________
privacy mailing list
https://lists.vortex.com/mailman/listinfo/privacy
| home | help | back | first | fref | pref | prev | next | nref | lref | last | post |