Is Meta Scraping the Fediverse for AI?

sean.tilley

Fediverse News
August 11, 2025

A new report from Dropsite News makes the claim that Meta is allegedly scraping a large amount of independent sites for content to train their AI. What’s worse is that this scraping operation appears to completely disregard robots.txt, a control list used to tell crawlers, search engines, and bots which parts of a site should be accessed, and which parts should be avoided. It’s worth mentioning that the efficacy of such lists depend on the consuming software to honor this, and not every piece of software does.

Meta Denies All Wrongdoing

Andy Stone, a communications representative for Meta, has gone

Continue reading on We Distribute...