{"id":4974,"date":"2025-10-23T13:48:57","date_gmt":"2025-10-23T13:48:57","guid":{"rendered":"https:\/\/www.stanventures.com\/news\/?p=4974"},"modified":"2025-10-30T11:08:27","modified_gmt":"2025-10-30T11:08:27","slug":"reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content","status":"publish","type":"post","link":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/","title":{"rendered":"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content"},"content":{"rendered":"<p><b>Reddit has sued Perplexity AI and three data-scraping companies, accusing them of stealing millions of user posts to train artificial intelligence models. The lawsuit, filed October 22, 2025, in New York federal court, could set a major precedent for how human conversations online are treated in the age of AI.<\/b><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-4975\" src=\"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png\" alt=\"Reddit Sues Perplexity AI and Data Scrapers \" width=\"1536\" height=\"1024\" srcset=\"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png 1536w, https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM-300x200.png 300w, https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM-1024x683.png 1024w\" sizes=\"auto, (max-width: 1536px) 100vw, 1536px\" \/><\/p>\n<p><em><strong>Update:<\/strong> Perplexity AI has <a href=\"https:\/\/www.reddit.com\/r\/perplexity_ai\/comments\/1odpofv\/our_response_to_reddits_lawsuit\/\">responded<\/a> to Reddit\u2019s lawsuit, saying it does not train its models on Reddit data. The company claims it only summarizes Reddit discussions with citations, similar to how users share links. Reddit maintains that Perplexity and its scraping partners bypassed protections to access content, citing a fortyfold increase in Reddit references after a cease-and-desist order.<\/em><\/p>\n<p>Reddit built its reputation as the internet\u2019s massive discussion board, a place where millions of people trade advice, share experiences, and argue about everything from cooking to coding. That same content has now become the focus of a major legal battle.<\/p>\n<p>This week, Reddit filed a <a href=\"https:\/\/fingfx.thomsonreuters.com\/gfx\/legaldocs\/xmpjezjawvr\/REDDIT%20PERPLEXITY%20LAWSUIT%20complaint.pdf\">lawsuit<\/a> against Perplexity AI and three companies (Oxylabs from Lithuania, AWMProxy from Russia, and SerpApi from Texas) accused of supplying it with scraped data.<\/p>\n<p>The lawsuit claims these companies secretly copied Reddit\u2019s user-generated posts through automated systems and sold that information to help train AI products without paying for it.<\/p>\n<p>The complaint describes the process as \u201cindustrial-scale data laundering.\u201d<\/p>\n<p>Reddit says the companies pretended to be ordinary web users, bypassed protective barriers, and gathered vast amounts of its data through Google\u2019s search results. The suit asks the court for damages and an injunction to stop the defendants from using Reddit\u2019s data again.<\/p>\n<p>Ben Lee, Reddit\u2019s chief legal officer, said the race to build powerful AI systems has pushed some developers to ignore boundaries. \u201cThere\u2019s enormous pressure to find high-quality human content,\u201d he said. \u201cThat pressure has created an underground market that thrives on stolen data.\u201d<\/p>\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_82_2 counter-hierarchy ez-toc-counter ez-toc-transparent ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\"><\/p>\n<span class=\"ez-toc-title-toggle\"><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#the-hidden-economy-of-data-scraping\" >The Hidden Economy of Data Scraping<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#perplexity-pushes-back\" >Perplexity Pushes Back<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#when-the-webs-%e2%80%9csharing-culture%e2%80%9d-meets-the-ai-gold-rush\" >When the Web\u2019s \u201cSharing Culture\u201d Meets the AI Gold Rush<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#what-makes-reddits-data-so-valuable\" >What Makes Reddit\u2019s Data So Valuable<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#the-unsettled-law-of-data-ownership\" >The Unsettled Law of Data Ownership<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#what-reddit-users-should-take-from-this\" >What Reddit Users Should Take From This<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#what-to-watch-next\" >What to Watch Next<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#practical-advice\" >Practical Advice<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#key-takeaways\" >Key Takeaways<\/a><\/li><\/ul><\/nav><\/div>\n<h2><span class=\"ez-toc-section\" id=\"the-hidden-economy-of-data-scraping\"><\/span><b>The Hidden Economy of Data Scraping<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Scraping has existed for decades. It helped search engines like Google organize the web in the early days of the internet.<\/p>\n<p>Over time, a smaller group of companies started scraping Google itself, using the results to sell marketing insights or improve how other sites performed in search rankings. Then came the explosion of artificial intelligence.<\/p>\n<p>Suddenly, the kind of data Reddit holds became gold.<\/p>\n<p>AI systems need human language to learn how people think and talk. Companies that could supply that data discovered a lucrative business.<\/p>\n<p>SerpApi was one of them. Based in Austin, Texas, it built tools that scraped Google\u2019s results at a massive scale. Others followed, including Oxylabs and AWMProxy.<\/p>\n<p>According to Reddit\u2019s lawsuit, these firms shifted their focus to AI clients once tools like ChatGPT and Gemini made natural language data valuable. Their data packages allegedly included scraped Reddit posts that could be resold to companies like Perplexity.<\/p>\n<p>Reddit argues that this activity went far beyond what\u2019s acceptable.<\/p>\n<p>The company has banned scraping for years and now <a href=\"https:\/\/www.stanventures.com\/news\/reddit-wants-more-from-its-ai-deals-with-google-and-openai-4426\/\">charges for data access<\/a> through licensing deals.<\/p>\n<p>Google and OpenAI are among those who agreed to pay. Perplexity, Reddit says, did not.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"perplexity-pushes-back\"><\/span><b>Perplexity Pushes Back<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Perplexity rejected Reddit\u2019s accusations.<\/p>\n<p>It said the company had not been served with the lawsuit and that its methods were ethical.<\/p>\n<p>\u201cOur approach remains principled and responsible as we provide factual answers with accurate AI,\u201d Perplexity said in a public statement. \u201cWe will not tolerate threats against openness and the public interest.\u201d<\/p>\n<p>SerpApi\u2019s response was defiant as well. It claimed to have received no formal notice from Reddit and promised to fight the allegations in court.<\/p>\n<p>Denas Grybauskas from Oxylabs argued that no one should be allowed to claim ownership of public information. AWMProxy offered no comment.<\/p>\n<p>Reddit\u2019s legal team says it can prove the scraping happened.<\/p>\n<p>In its filing, the company describes setting a trap, a hidden Reddit post that could only be found by Google\u2019s crawler. Within hours, the post appeared in Perplexity search results. To Reddit, that was the smoking gun.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"when-the-webs-%e2%80%9csharing-culture%e2%80%9d-meets-the-ai-gold-rush\"><\/span><b>When the Web\u2019s \u201cSharing Culture\u201d Meets the AI Gold Rush<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>For much of the internet\u2019s history, scraping was considered part of the bargain. Websites got visibility, while search engines organized information and sent readers back. That balance has eroded.<\/p>\n<p>AI systems often take content without returning traffic or credit. They generate answers directly, leaving publishers and creators out of the loop.<\/p>\n<p>Doug Leeds, co-founder of the nonprofit <a href=\"https:\/\/www.stanventures.com\/news\/really-simple-licensing-rsl-makes-ai-firms-pay-for-content-4329\/\">Really Simple Licensing<\/a>, has watched that shift unfold. He said what once looked like a mutually beneficial system has become something else entirely. \u201cIt used to work because everyone involved made money somehow,\u201d he explained. \u201cNow, AI tools are consuming content without giving anything back.\u201d<\/p>\n<p>Media companies and publishers have started drawing their own lines.<\/p>\n<p>The <a href=\"https:\/\/www.nytimes.com\/2023\/12\/27\/business\/media\/new-york-times-open-ai-microsoft-lawsuit.html\">New York Times has sued OpenAI<\/a> and Microsoft for using its reporting to train models. Major book publishers, including Simon &amp; Schuster, have launched similar cases. Reddit\u2019s lawsuit joins that growing list, signaling that online communities are no longer willing to give away their data for free.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"what-makes-reddits-data-so-valuable\"><\/span><b>What Makes Reddit\u2019s Data So Valuable<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>More than 416 million people use Reddit each week. Its content spans nearly every human interest imaginable, from niche hobbies to personal struggles to global news. Those authentic exchanges are what make Reddit data so appealing to AI developers. It captures how real people communicate, argue, and ask questions.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-4976\" src=\"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/reddit-daily-active-users-960x730-1.webp\" alt=\"Reddit Daily Active Users \" width=\"960\" height=\"730\" srcset=\"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/reddit-daily-active-users-960x730-1.webp 960w, https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/reddit-daily-active-users-960x730-1-300x228.webp 300w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/><\/p>\n<p>Reddit began charging for access to its data in 2023. It now earns revenue through licensing deals with major tech firms, using those funds to support its operations and protect user content.<\/p>\n<p>But the company says it also spends tens of millions every year to stop unauthorized scraping.<\/p>\n<p>The lawsuit paints Perplexity as one of the worst offenders.<\/p>\n<p>It accuses the company of claiming compliance with robots.txt, a standard file that tells web crawlers what they can access, while continuing to scrape content in violation of Reddit\u2019s terms.<\/p>\n<p>After Reddit sent a cease-and-desist notice in May 2024, citations to Reddit content on Perplexity \u201crose fortyfold,\u201d according to the complaint.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"the-unsettled-law-of-data-ownership\"><\/span><b>The Unsettled Law of Data Ownership<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>No court has yet drawn a clear boundary around how public web data can be used to train AI. Some judges have leaned toward allowing scraping if the information is publicly visible.<\/p>\n<p>Others have sided with content owners, arguing that copying and repurposing data at scale crosses into infringement.<\/p>\n<p>That uncertainty makes Reddit\u2019s case especially significant. A win could strengthen the ability of platforms and publishers to control their data and demand payment. A loss might encourage more aggressive scraping, reinforcing the idea that \u201cpublicly available\u201d equals \u201cfree to use.\u201d<\/p>\n<p>The defendants are spread across multiple countries, which complicates enforcement even further. Still, Reddit has signaled it intends to pursue the case to the end, saying it has a duty to protect its users\u2019 contributions.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"what-reddit-users-should-take-from-this\"><\/span><b>What Reddit Users Should Take From This<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>People who post on Reddit may not think much about who reads their comments. But those comments help train AI systems that generate profit. The debate is about fairness. Should everyday conversations be treated as a free training ground for commercial products?<\/p>\n<p>Reddit\u2019s position is that licensing deals allow both sides to benefit. The company gets paid for its data, and partners receive high-quality content with clear permissions. Unauthorized scraping, it says, erases that balance and disrespects the time and creativity of its community.<\/p>\n<p>Critics counter that Reddit\u2019s move toward tighter control goes against its roots as an open platform. They see this as part of a broader trend of corporatizing the internet, where everything is fenced off and monetized. That tension between openness and ownership isn\u2019t going away.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"what-to-watch-next\"><\/span><b>What to Watch Next<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The case will likely move slowly through the courts, but its impact could arrive quickly.<\/p>\n<p>AI companies, social platforms, and regulators are watching closely. However the judge rules, this lawsuit will influence how others handle data collection and licensing.<\/p>\n<p>Reddit\u2019s leadership has made its stance clear. \u201cWe support innovation,\u201d a spokesperson said, \u201cbut respect for creators and communities isn\u2019t optional.\u201d<\/p>\n<p>If nothing else, the case has forced a public reckoning. The casual posts that people make every day now have measurable value in the AI economy. What happens to that value is what this lawsuit will decide.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"practical-advice\"><\/span><b>Practical Advice<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>To protect your content and navigate potential legal challenges, consider implementing the following strategies:<\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Review your website\u2019s <a href=\"https:\/\/www.stanventures.com\/blog\/robots-txt-guide\/\">robots.txt<\/a> settings to manage which crawlers can access your pages.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Monitor server traffic for automated scraping behavior.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Consider offering licensed data access if your content attracts commercial interest.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Keep records of scraping incidents, they may become evidence in future disputes.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Educate contributors or users about how their posts might be reused by third parties.<\/li>\n<\/ol>\n<h2><span class=\"ez-toc-section\" id=\"key-takeaways\"><\/span><b>Key Takeaways<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Reddit is suing Perplexity AI and three data-scraping firms for allegedly copying and reselling its user content.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">The case raises unresolved questions about who owns public online data used for AI training.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Perplexity and others deny wrongdoing, arguing that public data shouldn\u2019t be restricted.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">A court ruling could define new limits on how AI companies collect and use information.<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\">Reddit\u2019s users and creators are at the center, as their conversations have become valuable digital assets.<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Reddit has sued Perplexity AI and three data-scraping companies, accusing them of stealing millions of user posts to train artificial intelligence models. The lawsuit, filed October 22, 2025, in New York federal court, could set a major precedent for how human conversations online are treated in the age of AI. Update: Perplexity AI has responded [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":4975,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-4974","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-seo"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Reddit Sues Perplexity AI and Data Scrapers for Stealing Content - Stan Ventures<\/title>\n<meta name=\"description\" content=\"Reddit sues Perplexity AI, Oxylabs, SerpApi, and AWMProxy, accusing them of scraping user posts for AI training without permission.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content - Stan Ventures\" \/>\n<meta property=\"og:description\" content=\"Reddit sues Perplexity AI, Oxylabs, SerpApi, and AWMProxy, accusing them of scraping user posts for AI training without permission.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/\" \/>\n<meta property=\"og:site_name\" content=\"Stan Ventures\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/StanVentures\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-10-23T13:48:57+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-30T11:08:27+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM-1024x683.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"683\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Zulekha\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@stanventures\" \/>\n<meta name=\"twitter:site\" content=\"@stanventures\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Zulekha\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/\"},\"author\":{\"name\":\"Zulekha\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/person\\\/fa7eddd27331b508c39dfd5ec581c0d1\"},\"headline\":\"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content\",\"datePublished\":\"2025-10-23T13:48:57+00:00\",\"dateModified\":\"2025-10-30T11:08:27+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/\"},\"wordCount\":1531,\"publisher\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/10\\\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png\",\"articleSection\":[\"SEO\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/\",\"name\":\"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content - Stan Ventures\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/10\\\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png\",\"datePublished\":\"2025-10-23T13:48:57+00:00\",\"dateModified\":\"2025-10-30T11:08:27+00:00\",\"description\":\"Reddit sues Perplexity AI, Oxylabs, SerpApi, and AWMProxy, accusing them of scraping user posts for AI training without permission.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/10\\\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png\",\"contentUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/10\\\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png\",\"width\":1536,\"height\":1024,\"caption\":\"Reddit Sues Perplexity AI and Data Scrapers\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#website\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/\",\"name\":\"Stan Ventures\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#organization\",\"name\":\"Stan Ventures\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Stan-Ventures.webp\",\"contentUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Stan-Ventures.webp\",\"width\":2001,\"height\":801,\"caption\":\"Stan Ventures\"},\"image\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/StanVentures\\\/\",\"https:\\\/\\\/x.com\\\/stanventures\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/person\\\/fa7eddd27331b508c39dfd5ec581c0d1\",\"name\":\"Zulekha\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5e254e5ddd7005852ee1623919a9ab39bced859841448a57960e7f8b855fdd52?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5e254e5ddd7005852ee1623919a9ab39bced859841448a57960e7f8b855fdd52?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/5e254e5ddd7005852ee1623919a9ab39bced859841448a57960e7f8b855fdd52?s=96&d=mm&r=g\",\"caption\":\"Zulekha\"},\"description\":\"Zulekha is an emerging leader in the content marketing industry from India. She began her career in 2019 as a freelancer and, with over five years of experience, has made a significant impact in content writing. Recognized for her innovative approaches, deep knowledge of SEO, and exceptional storytelling skills, she continues to set new standards in the field. Her keen interest in news and current events, which started during an internship with The New Indian Express, further enriches her content. As an author and continuous learner, she has transformed numerous websites and digital marketing companies with customized content writing and marketing strategies.\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/author\\\/zulekha871_4\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content - Stan Ventures","description":"Reddit sues Perplexity AI, Oxylabs, SerpApi, and AWMProxy, accusing them of scraping user posts for AI training without permission.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/","og_locale":"en_US","og_type":"article","og_title":"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content - Stan Ventures","og_description":"Reddit sues Perplexity AI, Oxylabs, SerpApi, and AWMProxy, accusing them of scraping user posts for AI training without permission.","og_url":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/","og_site_name":"Stan Ventures","article_publisher":"https:\/\/www.facebook.com\/StanVentures\/","article_published_time":"2025-10-23T13:48:57+00:00","article_modified_time":"2025-10-30T11:08:27+00:00","og_image":[{"width":1024,"height":683,"url":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM-1024x683.png","type":"image\/png"}],"author":"Zulekha","twitter_card":"summary_large_image","twitter_creator":"@stanventures","twitter_site":"@stanventures","twitter_misc":{"Written by":"Zulekha","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#article","isPartOf":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/"},"author":{"name":"Zulekha","@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/person\/fa7eddd27331b508c39dfd5ec581c0d1"},"headline":"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content","datePublished":"2025-10-23T13:48:57+00:00","dateModified":"2025-10-30T11:08:27+00:00","mainEntityOfPage":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/"},"wordCount":1531,"publisher":{"@id":"https:\/\/www.stanventures.com\/news\/#organization"},"image":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#primaryimage"},"thumbnailUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png","articleSection":["SEO"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/","url":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/","name":"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content - Stan Ventures","isPartOf":{"@id":"https:\/\/www.stanventures.com\/news\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#primaryimage"},"image":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#primaryimage"},"thumbnailUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png","datePublished":"2025-10-23T13:48:57+00:00","dateModified":"2025-10-30T11:08:27+00:00","description":"Reddit sues Perplexity AI, Oxylabs, SerpApi, and AWMProxy, accusing them of scraping user posts for AI training without permission.","breadcrumb":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#primaryimage","url":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png","contentUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/10\/ChatGPT-Image-Oct-23-2025-01_47_27-PM.png","width":1536,"height":1024,"caption":"Reddit Sues Perplexity AI and Data Scrapers"},{"@type":"BreadcrumbList","@id":"https:\/\/www.stanventures.com\/news\/reddit-sues-perplexity-ai-and-data-scrapers-for-stealing-content-4974\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.stanventures.com\/news\/"},{"@type":"ListItem","position":2,"name":"Reddit Sues Perplexity AI and Data Scrapers for Stealing Content"}]},{"@type":"WebSite","@id":"https:\/\/www.stanventures.com\/news\/#website","url":"https:\/\/www.stanventures.com\/news\/","name":"Stan Ventures","description":"","publisher":{"@id":"https:\/\/www.stanventures.com\/news\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.stanventures.com\/news\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.stanventures.com\/news\/#organization","name":"Stan Ventures","url":"https:\/\/www.stanventures.com\/news\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/logo\/image\/","url":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2024\/06\/Stan-Ventures.webp","contentUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2024\/06\/Stan-Ventures.webp","width":2001,"height":801,"caption":"Stan Ventures"},"image":{"@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/StanVentures\/","https:\/\/x.com\/stanventures"]},{"@type":"Person","@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/person\/fa7eddd27331b508c39dfd5ec581c0d1","name":"Zulekha","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/5e254e5ddd7005852ee1623919a9ab39bced859841448a57960e7f8b855fdd52?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/5e254e5ddd7005852ee1623919a9ab39bced859841448a57960e7f8b855fdd52?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5e254e5ddd7005852ee1623919a9ab39bced859841448a57960e7f8b855fdd52?s=96&d=mm&r=g","caption":"Zulekha"},"description":"Zulekha is an emerging leader in the content marketing industry from India. She began her career in 2019 as a freelancer and, with over five years of experience, has made a significant impact in content writing. Recognized for her innovative approaches, deep knowledge of SEO, and exceptional storytelling skills, she continues to set new standards in the field. Her keen interest in news and current events, which started during an internship with The New Indian Express, further enriches her content. As an author and continuous learner, she has transformed numerous websites and digital marketing companies with customized content writing and marketing strategies.","url":"https:\/\/www.stanventures.com\/news\/author\/zulekha871_4\/"}]}},"_links":{"self":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts\/4974","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/comments?post=4974"}],"version-history":[{"count":5,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts\/4974\/revisions"}],"predecessor-version":[{"id":5000,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts\/4974\/revisions\/5000"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/media\/4975"}],"wp:attachment":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/media?parent=4974"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/categories?post=4974"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/tags?post=4974"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}