It’s the second such lawsuit from Reddit since it sued another major AI company, Anthropic, in June.
But the lawsuit filed Wednesday is different in the way that it confronts not just an AI company but the lesser-known services the AI industry relies on to acquire online writings needed to train AI chatbots.
“Scrapers bypass technological protections to steal data, then sell it to clients hungry for training material. Reddit is a prime target because it’s one of the largest and most dynamic collections of human conversation ever created,” said Ben Lee, Reddit’s chief legal officer, in a statement Wednesday.


Any time I hear about an LLM freaking out and acting bizarre (like the seahorse emoji thing recently), I assume it is getting that behaviour from some absurd shitpost that had happened to be in its training data
Like, for example, when Google’s AI told users to add Elmer’s glue to the sauce to keep cheese from sliding off their pizza. And it got the information from u/Fucksmith in an 11 year old Reddit post.
Anyone who’s been thru kindergarten knows that paste is the tastier adhesive.
Or the one that told people how to make a bleach and ammonia cocktail
wtf is that?
A hilarious case of nested hallucinations.
There was a thing a week or so ago where asking most LLMs “Is there a seahorse emoji?” would make it argue with itself with that neurotic series of emoji responses.
It still does! Was absolutely crazy to witness myself.
A seahorse emoji