- AI Weekly
- Posts
- Perplexity just got SUED
Perplexity just got SUED
The Gold standard for AI news
AI keeps coming up at work, but you still don't get it?
That's exactly why 1M+ professionals working at Google, Meta, and OpenAI read Superhuman AI daily.
Here's what you get:
Daily AI news that matters for your career - Filtered from 1000s of sources so you know what affects your industry.
Step-by-step tutorials you can use immediately - Real prompts and workflows that solve actual business problems.
New AI tools tested and reviewed - We try everything to deliver tools that drive real results.
All in just 3 minutes a day
Reddit Caught an AI Company Stealing—And It's About to Get Expensive
Yesterday, Reddit sued Perplexity AI for what it's calling "data laundering." The same day, OpenAI announced a $15 billion data center in Wisconsin. These stories are connected in a way that reveals the entire AI economy's dirty secret.
Let's start with the heist.
The Sting Operation
Reddit set a trap. They created a test post that only Google's crawler could access—nowhere else on the internet. Within hours, Perplexity's AI was spitting out that exact content in response to queries.
"The only way that Perplexity could have obtained that Reddit content is if it and/or its co-defendants scraped Google [search results] for that Reddit content," the lawsuit states. Caught red-handed.
Here's the kicker: Reddit had already sent Perplexity a cease-and-desist letter in May 2024. Perplexity responded claiming they'd respect Reddit's robots.txt file and weren't using Reddit for training. What happened next? Citations to Reddit on Perplexity's platform increased 40-fold.
That's not compliance. That's a middle finger.
Data Laundering: The New Crime
Reddit isn't just suing Perplexity. They're going after the entire supply chain—three data-scraping companies that form what Reddit calls the "data laundering" economy:
Oxylabs: A Lithuanian scraping operation
AWMProxy: What Reddit describes as a "former Russian botnet"
SerpApi: A Texas startup that scrapes Google search results in real-time
The scheme works like this: These companies can't scrape Reddit directly because Reddit blocks them. So they scrape Google search results instead, extracting Reddit content that Google has indexed. Then they sell that "laundered" data to AI companies like Perplexity.
Reddit's chief legal officer Ben Lee puts it perfectly: They're "would-be bank robbers" who, unable to access the vault, break into the armored truck instead.
Why This Actually Matters
Perplexity is valued at $20 billion. They raised $200 million in September 2025 at that valuation, up from $500 million at the start of 2024. Their annual recurring revenue hit $150 million—quadrupling in a year.
And according to Reddit's lawsuit, "Perplexity's business model is effectively to take Reddit's content from Google search results, feed them into a third party's LLM, and call it a new product."
That $20 billion valuation? It's built on stolen content.
Meanwhile, the companies who play by the rules are paying massive fees. Google signed a $60 million per year deal with Reddit in February 2024. OpenAI's paying an estimated $70 million. These licensing agreements make up 10% of Reddit's revenue.
So you've got two tiers: the companies paying tens of millions annually for legitimate access, and the ones scraping it for free while their valuations explode.
The AI Data Crisis
Here's what's actually happening in AI right now: Every company is locked in an arms race for quality training data. And they're running out.
Reddit is valuable because it's authentic human conversation—real people with genuine opinions, organized by topic, ranked by humans rather than algorithms. That's gold for AI training. Reddit went public at a $6.4 billion valuation specifically because of this data goldmine.
But Perplexity isn't alone in getting sued. The list of their legal troubles is growing:
Dow Jones and NYP Holdings sued them in October 2024 for scraping The Wall Street Journal and New York Post
The New York Times sent a cease-and-desist in October 2024
The BBC threatened legal action in June 2025
Forbes and Wired have accused them of plagiarism
A Copyleaks analysis found Perplexity paraphrased 48% of sample articles. In one case, they plagiarized 7% of content directly.
And Reddit already sued Anthropic (maker of Claude) in June 2025 for the same thing. That case has a hearing scheduled for January 2026.
The Infrastructure Irony
Here's where it gets interesting. The same day Reddit filed this lawsuit, OpenAI announced the Lighthouse campus in Wisconsin—a $15 billion data center providing close to a gigawatt of AI capacity, scheduled for completion in 2028.
This is part of the Stargate Project: a $500 billion, four-year plan to build 10 gigawatts of AI data center capacity. For context, that's enough to power 7.5 million homes. OpenAI will run over 2 million chips across this infrastructure.
Think about the asymmetry here. OpenAI and its partners are willing to drop $500 billion on physical infrastructure. They'll build entire campuses, create thousands of jobs, develop zero-emission energy sources.
But paying content creators for their data? That's where companies suddenly get creative with "data laundering" schemes and scrapers disguised as legitimate indexing.
What Perplexity Says
Perplexity's defense is predictable: "We will always fight vigorously for users' rights to freely and fairly access public knowledge." They claim they're not scraping for training but "indexing web pages and surfacing factual content" with citations.
They argue "no single organization owns the copyright over facts."
Technically true. But completely missing the point.
Reddit isn't claiming copyright over facts. They're claiming that circumventing their technical protections, ignoring their robots.txt file, and building a $20 billion business on their content without compensation is theft—regardless of whether individual facts are copyrightable.
The Real Question
The Reddit lawsuit is asking something fundamental: If you're willing to spend $500 billion on infrastructure, why aren't you willing to pay for the data that makes that infrastructure valuable?
Reddit spent "tens of millions of dollars" on anti-scraping technology. They've established clear licensing terms. They've built legitimate relationships with major AI companies. And Perplexity allegedly looked at all that and decided the scrapers and bots were a better deal.
That's not innovation. That's just theft with a fancy valuation.
Major News Outlet Coverage
Reuters – Reddit sues Perplexity for scraping data to train AI system
https://www.reuters.com/world/reddit-sues-perplexity-scraping-data-train-ai-system-2025-10-22/Associated Press (AP News) – Reddit sues Perplexity, others for user comment scraping
https://apnews.com/article/reddit-perplexity-ai-copyright-scraping-lawsuit-3ad8968550dd7e11bcd285a74fb6e2ffBloomberg – Reddit Sues Perplexity, Others Over Alleged Data Scraping
https://www.bloomberg.com/news/articles/2025-10-22/reddit-sues-perplexity-others-over-alleged-data-scrapingThe New York Times – Reddit Accuses ‘Data Scraper’ Companies of Theft
https://www.nytimes.com/2025/10/22/technology/reddit-data-scrapers-perplexity-theft.htmlPBS NewsHour – Reddit sues AI company over alleged ‘industrial‑scale’ scraping of its users’ comments
https://www.pbs.org/newshour/nation/reddit-sues-ai-company-over-alleged-industrial-scale-scraping-of-its-users-comments
Business and Tech Publications
Business Insider – Reddit lawsuit accuses Perplexity AI firms of stealing data from Google results
https://www.businessinsider.com/reddit-lawsuit-perplexity-ai-firms-data-scrapers-scraping-google-2025-10Financial Post – Reddit sues AI company Perplexity and others for ‘industrial‑scale’ scraping of user comments
https://business.financialpost.com/pmn/reddit-sues-ai-company-perplexity-and-others-for-industrial-scale-scraping-of-user-commentsYahoo News Canada – Reddit Sues Perplexity, Other AI Companies for Scraping User Comments
https://ca.news.yahoo.com/reddit-sues-perplexity-other-ai-230650694.htmlSearch Engine Land – Reddit sues Perplexity, SerpApi over scraping Google
https://searchengineland.com/reddit-sues-perplexity-serpapi-scraping-google-463681Axios – Reddit sues Perplexity and data scraping firms
https://www.axios.com/2025/10/22/reddit-suing-perplexity-data-scraping
Additional and Regional Coverage
ABC News (Technology Wire) – Reddit sues over ‘industrial‑scale’ scraping of user comments
https://abcnews.go.com/Technology/wireStory/reddit-sues-ai-company-perplexity-industrial-scale-scraping-126768844Halifax City News – Reddit sues AI company Perplexity and others for ‘industrial‑scale’ scraping of user comments
https://halifax.citynews.ca/2025/10/22/reddit-sues-ai-company-perplexity-and-others-for-industrial-scale-scraping-of-user-commentsA.V. Club – Reddit files lawsuit against Perplexity AI
https://www.avclub.com/reddit-lawsuit-perplexity-data-scrapersCTV News (Associated Press syndication) – Reddit sues AI company Perplexity and others for ‘industrial‑scale’ scraping of user comments
https://www.ctvnews.ca/business/article/reddit-sues-ai-company-perplexity-and-others-for-industrial-scale-scraping-of-user-comments/
These URLs collectively document the key filings, allegations, and legal framing of Reddit Inc. v. Perplexity AI Inc. et al., filed October 22, 2025, in the Southern District of New York.
Reply