Lemmy: Bestiverse
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
cantankerous_cashew@lemmy.world to Technology@lemmy.worldEnglish · 4 months ago

Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal

www.wired.com

external-link
message-square
28
fedilink
  • cross-posted to:
  • technology@lemmy.world
348
external-link

Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal

www.wired.com

cantankerous_cashew@lemmy.world to Technology@lemmy.worldEnglish · 4 months ago
message-square
28
fedilink
  • cross-posted to:
  • technology@lemmy.world
One of the most important AI copyright legal battles just took a major turn.
  • rumba@lemmy.zip
    link
    fedilink
    English
    arrow-up
    88
    ·
    4 months ago

    The notorious piracy database in question is Library Genesis.

    Cached article:

    https://web.archive.org/web/20250110075821/https://www.wired.com/story/new-documents-unredacted-meta-copyright-ai-lawsuit/

    • CriticalMiss@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      ·
      4 months ago

      Earlier reports suggested they trained it on books from Bibliotik.

      What changed?

      • halcyoncmdr@lemmy.world
        link
        fedilink
        English
        arrow-up
        25
        ·
        4 months ago

        Probably just both honestly.

        • rumba@lemmy.zip
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 months ago

          In for a penny and for a pound.

      • BetaDoggo_@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        4 months ago

        The llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3.02K users / day
  • 9.48K users / week
  • 17.8K users / month
  • 37K users / 6 months
  • 1 local subscriber
  • 69.9K subscribers
  • 5.94K Posts
  • 177K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • L4s@hackingne.ws
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org