cross-posted from: https://lemmy.dbzer0.com/post/41302017

https://societyofauthors.org/2025/04/01/soa-day-of-action-following-allegations-of-metas-mass-theft-of-authors-work/

The SoA is organising a day of protest against Meta following revelations of pirated books being used to train their large language models

On Thursday 20 March, The Atlantic broke the story of how Meta has used the Library Genesis (LIbGen) dataset, which is full of pirated material, to develop their AI systems.

The revelations detailed by The Atlantic come against the background of the recent government consultation into Artificial Intelligence (AI) and copyright and the #MakeItFair campaign which sees the UK creative industries fighting back against the proposed changes to copyright law, which would favour multinational tech companies, but irremediably damage the creative industries.

    • vegantomato@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      1 month ago

      Commercial/state-enforced AI crawlers overburdening services and forcing admins to increase cost and time spent dealing with these DDoS attacks is much closer to theft than the piracy itself. Piracy doesn’t make people lose money, AI crawlers do.

      If I host a website for the general public, I’m not paying money for 200 foreign AI crawlers to consume most of the bandwidth and CPU and leave legit users, whom I created the website for, with scraps. Even Wikipedia is feeling it.

      Many AI crawlers are immoral for other reasons as well, especially when we are talking about companies (Meta, Google) or states (CCP) doing it who are known for corporating with intelligence/defense or are engaged in human rights abuses.

      Turning this discussion to be about piracy is imo a distraction.