Reddit says Microsoft’s Bing, Anthropic, and Perplexity have scraped its data without permission. “It has been a real pain in the ass to block these companies.”

  • UnderpantsWeevil@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 year ago

    An absolutely prodigious back catalog of high quality images, interviews, and explainers. A treasure trove of historical content that’s been heavily indexed and participant-weighted for relevancy. And the bulk of it predates the infestation of AI, so its valuable just as sampling data for further iterative development of ChatGPT and other LLMs.