How crawlers impact the operations of the Wikimedia projects

misk@sopuli.xyz · 1 day ago

How crawlers impact the operations of the Wikimedia projects

TheImpressiveX@lemm.ee · 1 day ago

Why didn’t they just download the data dumps? Are they stupid?

LostXOR@fedia.io · 23 hours ago

Yes. Yes, they are.

I imagine they’re just using some generic web scraper for everything, and not taking any time at all to see if the sites they’re scraping have an easier way to access the data.

misk@sopuli.xyz · 1 day ago

They’re used to AI being extremely inefficient use of resources.