{"id":3986,"date":"2025-08-13T05:15:23","date_gmt":"2025-08-13T05:15:23","guid":{"rendered":"https:\/\/www.stanventures.com\/news\/?p=3986"},"modified":"2025-10-29T07:14:55","modified_gmt":"2025-10-29T07:14:55","slug":"reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns","status":"publish","type":"post","link":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/","title":{"rendered":"Reddit Blocks Internet Archive\u2019s Wayback Machine Over AI Scraping Concerns"},"content":{"rendered":"<p>Reddit has just made a decisive move and it is one that will reshape how the platform\u2019s history is preserved online.\u00a0<\/p>\n<p>Starting 12 August 2025, Reddit will block the Internet Archive\u2019s Wayback Machine from indexing most of its content, citing concerns that AI companies have been scraping archived Reddit pages to train their models.<\/p>\n<p>That is right, the Wayback Machine, a tool that has been quietly archiving billions of web pages for decades, will now only be able to see Reddit\u2019s homepage.\u00a0<\/p>\n<p>No post detail pages, no comment threads and no user profiles. The rest of the Reddit universe? Off-limits.<\/p>\n<p>This is about control over data, the fight against unauthorized AI training and the changing rules of what \u201cpublic internet\u201d really means.<\/p>\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_83 counter-hierarchy ez-toc-counter ez-toc-transparent ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\"><\/p>\n<span class=\"ez-toc-title-toggle\"><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#why-reddit-is-taking-this-step-now\" >Why Reddit Is Taking This Step Now<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#what-changes-are-actually-happening\" >What Changes Are Actually Happening?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#this-isnt-reddits-first-data-access-crackdown\" >This Isn\u2019t Reddit\u2019s First Data Access Crackdown<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#the-internet-archives-perspective\" >The Internet Archive\u2019s Perspective<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#the-bigger-picture-ai-scraping-and-platform-control\" >The Bigger Picture: AI Scraping and Platform Control<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#examples-of-how-this-could-impact-the-web\" >Examples of How This Could Impact the Web<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#could-this-start-a-chain-reaction\" >Could This Start a Chain Reaction?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#where-this-leaves-users-and-the-open-web\" >Where This Leaves Users and the Open Web<\/a><\/li><\/ul><\/nav><\/div>\n<h2><span class=\"ez-toc-section\" id=\"why-reddit-is-taking-this-step-now\"><\/span><b>Why Reddit Is Taking This Step Now<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Reddit\u2019s spokesperson Tim Rathschmidt explained the reasoning in a<a href=\"https:\/\/www.theverge.com\/news\/757538\/reddit-internet-archive-wayback-machine-block-limit\"> statement to The Verge<\/a>:<\/p>\n<p>\u201cInternet Archive provides a service to the open web, but we\u2019ve been made aware of instances where AI companies violate platform policies, including ours and scrape data from the Wayback Machine.\u201d<\/p>\n<p>The key here is <b>policy violation<\/b>.\u00a0<\/p>\n<p>Reddit is not saying that archiving in itself is bad.\u00a0<\/p>\n<p>In fact, the company acknowledges the Wayback Machine\u2019s value as a historical resource.\u00a0<\/p>\n<p>But Reddit insists that until the Internet Archive can better defend its site from AI scrapers and ensure compliance with things like user privacy and the deletion of removed content the platform is limiting access \u201cto protect redditors.\u201d<\/p>\n<h2><span class=\"ez-toc-section\" id=\"what-changes-are-actually-happening\"><\/span><b>What Changes Are Actually Happening?<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The block is not a total shutdown, but it is close.<\/p>\n<p><b>Before:<\/b> The Wayback Machine could crawl Reddit\u2019s post pages, comments and user profiles, meaning you could look back at discussions from years ago even if the original post was deleted.<\/p>\n<p><b>After:<\/b> It will only be able to index the Reddit.com homepage. Practically speaking, that means the Archive will only capture snapshots of trending posts and headlines from a given day but not the full conversations or individual user contributions behind them.<\/p>\n<p>The new limits will \u201cramp up\u201d starting today, and according to Rathschmidt, Reddit informed the Internet Archive in advance before they took effect.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"this-isnt-reddits-first-data-access-crackdown\"><\/span><b>This Isn\u2019t Reddit\u2019s First Data Access Crackdown<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>If this feels familiar, it is because Reddit has been tightening its control over data for a while and AI companies have been at the center of that story.<\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b><a href=\"https:\/\/en.wikipedia.org\/wiki\/Reddit_API_controversy\">2023 API Protests<\/a>:<\/b> Reddit announced controversial API changes that priced many third-party app developers out, leading to mass subreddit blackouts in protest. Reddit\u2019s defense? Too many were using its API to train AI models without permission.\n<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/www.stanventures.com\/news\/google-favors-forums-reddit-178\/\"><b>Google Deal:<\/b><\/a> Early last year, Reddit struck a deal with Google, granting it access to Reddit data for both Search and AI training for a price.\n<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>AI Partnerships and Lawsuits:<\/b> Reddit later struck a similar deal with OpenAI, but in June 2025, it sued Anthropic, alleging that the company continued scraping Reddit even after promising to stop.\n<\/li>\n<\/ul>\n<p>So when we look at this Wayback Machine block, it is part of a broader <b>pay-to-play approach<\/b> Reddit is adopting with data access. If companies want to use Reddit for AI, they are going to have to cut a check.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"the-internet-archives-perspective\"><\/span><b>The Internet Archive\u2019s Perspective<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The Internet Archive has not responded with outrage, at least not publicly. Mark Graham, director of the Wayback Machine, told The Verge:<\/p>\n<p>\u201cWe have a longstanding relationship with Reddit and continue to have ongoing discussions about this matter.\u201d<\/p>\n<p>It\u2019s a diplomatic response, but I can not help wondering how much this changes the Archive\u2019s mission.\u00a0<\/p>\n<p>The Internet Archive exists to preserve the open web but what happens when major sites like Reddit start redefining \u201copen\u201d to exclude anything AI companies could use?<\/p>\n<h2><span class=\"ez-toc-section\" id=\"the-bigger-picture-ai-scraping-and-platform-control\"><\/span><b>The Bigger Picture: AI Scraping and Platform Control<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>This move highlights a growing trend: big platforms no longer see public web content as \u201cfree\u201d for anyone to collect and repurpose.<\/p>\n<p>In the past, crawling public web pages was the norm, search engines did it, archives did it, and researchers relied on it.\u00a0<\/p>\n<p>But AI training has changed the stakes. Training a<a href=\"https:\/\/www.stanventures.com\/news\/ai-traffic-surges-527-in-2025-seo-strategies-face-a-radical-rewrite-3836\/\"> large language model (LLM)<\/a> requires massive datasets and platforms like Reddit are treasure troves of human conversation, opinions and cultural moments.<\/p>\n<p>The problem? AI companies can scrape it once and use it forever without compensating the source.\u00a0<\/p>\n<p>From Reddit\u2019s point of view, that is both a loss of control and a missed revenue opportunity.<\/p>\n<p><b>Privacy and Authenticity Concerns<\/b><\/p>\n<p>There is also the privacy angle.\u00a0<\/p>\n<p>When a post is deleted on Reddit, users often expect it to disappear completely. But the Wayback Machine\u2019s snapshots can preserve it and sometimes indefinitely.<\/p>\n<p>From Reddit\u2019s perspective, limiting the Archive\u2019s access helps enforce those expectations.\u00a0<\/p>\n<p>And from a user trust standpoint, that makes sense. Imagine venting about a personal crisis on Reddit, deleting it later and finding that the post still lives in the Archive years down the line and is now available for an AI model to study.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"examples-of-how-this-could-impact-the-web\"><\/span><b>Examples of How This Could Impact the Web<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Researchers:<\/b> Academics studying internet culture have long used Reddit\u2019s archived pages to track trends and analyze public discourse. With this block, their historical datasets could shrink dramatically.\n<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Journalists:<\/b> Reporters who use the Wayback Machine to verify deleted Reddit posts in breaking news situations will lose that tool for post-level verification.\n<\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Casual Users:<\/b> People who revisit old Reddit threads for nostalgia or reference will find fewer preserved discussions.<\/li>\n<\/ul>\n<p>The irony?\u00a0<\/p>\n<p>AI companies with the resources to pay for direct Reddit access like Google and OpenAI will still be able to train on the data. The restriction primarily impacts free archival access, not corporate AI partnerships.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"could-this-start-a-chain-reaction\"><\/span><b>Could This Start a Chain Reaction?<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>It is worth asking that will other major platforms follow Reddit\u2019s lead?<\/p>\n<p>We have already seen X\u00a0 limit API access, LinkedIn sue data scrapers and news publishers start licensing deals with AI firms. If Reddit\u2019s strategy works, charging AI companies while tightening free archival access, others might adopt the same model.<\/p>\n<p>That could fundamentally change the nature of digital preservation. The Wayback Machine thrives on openness.\u00a0<\/p>\n<p>If site after site blocks it in the name of AI control, the historical record of the internet could become increasingly fragmented.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"where-this-leaves-users-and-the-open-web\"><\/span><b>Where This Leaves Users and the Open Web<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>From a \u201clet\u2019s see\u201d perspective, I think this is a defining moment.<\/p>\n<p>On one hand, Reddit\u2019s move makes sense to protect user privacy, stop unauthorized AI training, and control how data is monetized.\u00a0<\/p>\n<p>It chips away at the principle that the internet\u2019s public spaces should be preserved for future generations.<\/p>\n<p>The tension between platform control and open access is not going away.\u00a0<\/p>\n<p>And as AI companies push harder for more training data, these battles will likely become more common.<\/p>\n<p>For now, the reality is simple: If you want to see an old Reddit post, you will need to hope it is still live on Reddit because the Wayback Machine probably won\u2019t have it.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Reddit has just made a decisive move and it is one that will reshape how the platform\u2019s history is preserved online.\u00a0 Starting 12 August 2025, Reddit will block the Internet Archive\u2019s Wayback Machine from indexing most of its content, citing concerns that AI companies have been scraping archived Reddit pages to train their models. That [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":3988,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-3986","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-seo"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.6 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Reddit Blocks Wayback Machine Over AI Scraping<\/title>\n<meta name=\"description\" content=\"Reddit limits Wayback Machine access to stop AI scraping, blocking post archives while keeping the homepage open.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Reddit Blocks Wayback Machine Over AI Scraping\" \/>\n<meta property=\"og:description\" content=\"Reddit limits Wayback Machine access to stop AI scraping, blocking post archives while keeping the homepage open.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/\" \/>\n<meta property=\"og:site_name\" content=\"Stan Ventures\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/StanVentures\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-08-13T05:15:23+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-29T07:14:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/08\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif\" \/>\n\t<meta property=\"og:image:width\" content=\"1600\" \/>\n\t<meta property=\"og:image:height\" content=\"900\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Dileep Thekkethil\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@dthekkethil\" \/>\n<meta name=\"twitter:site\" content=\"@stanventures\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Dileep Thekkethil\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/\"},\"author\":{\"name\":\"Dileep Thekkethil\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/person\\\/87d00ff18daf9650e7c925ae4bf86efb\"},\"headline\":\"Reddit Blocks Internet Archive\u2019s Wayback Machine Over AI Scraping Concerns\",\"datePublished\":\"2025-08-13T05:15:23+00:00\",\"dateModified\":\"2025-10-29T07:14:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/\"},\"wordCount\":1169,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/08\\\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif\",\"articleSection\":[\"SEO\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/\",\"name\":\"Reddit Blocks Wayback Machine Over AI Scraping\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/08\\\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif\",\"datePublished\":\"2025-08-13T05:15:23+00:00\",\"dateModified\":\"2025-10-29T07:14:55+00:00\",\"description\":\"Reddit limits Wayback Machine access to stop AI scraping, blocking post archives while keeping the homepage open.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/08\\\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif\",\"contentUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2025\\\/08\\\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif\",\"width\":1600,\"height\":900,\"caption\":\"Reddit Blocks Internet Archive\u2019s Wayback Machine\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Reddit Blocks Internet Archive\u2019s Wayback Machine Over AI Scraping Concerns\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#website\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/\",\"name\":\"Stan Ventures\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#organization\",\"name\":\"Stan Ventures\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Stan-Ventures.webp\",\"contentUrl\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Stan-Ventures.webp\",\"width\":2001,\"height\":801,\"caption\":\"Stan Ventures\"},\"image\":{\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/StanVentures\\\/\",\"https:\\\/\\\/x.com\\\/stanventures\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/#\\\/schema\\\/person\\\/87d00ff18daf9650e7c925ae4bf86efb\",\"name\":\"Dileep Thekkethil\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/911bd385b9da54d4a69f19f536a6419e576244371bd6e7d96f06c583dd402fa9?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/911bd385b9da54d4a69f19f536a6419e576244371bd6e7d96f06c583dd402fa9?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/911bd385b9da54d4a69f19f536a6419e576244371bd6e7d96f06c583dd402fa9?s=96&d=mm&r=g\",\"caption\":\"Dileep Thekkethil\"},\"description\":\"Dileep Thekkethil is the Director of Marketing at Stan Ventures, where he applies over 15 years of SEO and digital marketing expertise to drive growth and authority. A former journalist with six years of experience, he combines strategic storytelling with technical know-how to help brands navigate the shift toward AI-driven search and generative engines. Dileep is a strong advocate for Google\u2019s EEAT standards, regularly sharing real-world use cases and scenarios to demystify complex marketing trends. He is an avid gardener of tropical fruits, a motor enthusiast, and a dedicated caretaker of his pair of cockatiels.\",\"sameAs\":[\"https:\\\/\\\/stanventures.com\\\/news\",\"https:\\\/\\\/www.linkedin.com\\\/in\\\/dileep-pradeep-3705aa53\\\/\",\"https:\\\/\\\/x.com\\\/dthekkethil\"],\"url\":\"https:\\\/\\\/www.stanventures.com\\\/news\\\/author\\\/admin_7mxgn8tx\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Reddit Blocks Wayback Machine Over AI Scraping","description":"Reddit limits Wayback Machine access to stop AI scraping, blocking post archives while keeping the homepage open.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/","og_locale":"en_US","og_type":"article","og_title":"Reddit Blocks Wayback Machine Over AI Scraping","og_description":"Reddit limits Wayback Machine access to stop AI scraping, blocking post archives while keeping the homepage open.","og_url":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/","og_site_name":"Stan Ventures","article_publisher":"https:\/\/www.facebook.com\/StanVentures\/","article_published_time":"2025-08-13T05:15:23+00:00","article_modified_time":"2025-10-29T07:14:55+00:00","og_image":[{"width":1600,"height":900,"url":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/08\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif","type":"image\/jpeg"}],"author":"Dileep Thekkethil","twitter_card":"summary_large_image","twitter_creator":"@dthekkethil","twitter_site":"@stanventures","twitter_misc":{"Written by":"Dileep Thekkethil","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#article","isPartOf":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/"},"author":{"name":"Dileep Thekkethil","@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/person\/87d00ff18daf9650e7c925ae4bf86efb"},"headline":"Reddit Blocks Internet Archive\u2019s Wayback Machine Over AI Scraping Concerns","datePublished":"2025-08-13T05:15:23+00:00","dateModified":"2025-10-29T07:14:55+00:00","mainEntityOfPage":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/"},"wordCount":1169,"commentCount":0,"publisher":{"@id":"https:\/\/www.stanventures.com\/news\/#organization"},"image":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#primaryimage"},"thumbnailUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/08\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif","articleSection":["SEO"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/","url":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/","name":"Reddit Blocks Wayback Machine Over AI Scraping","isPartOf":{"@id":"https:\/\/www.stanventures.com\/news\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#primaryimage"},"image":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#primaryimage"},"thumbnailUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/08\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif","datePublished":"2025-08-13T05:15:23+00:00","dateModified":"2025-10-29T07:14:55+00:00","description":"Reddit limits Wayback Machine access to stop AI scraping, blocking post archives while keeping the homepage open.","breadcrumb":{"@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#primaryimage","url":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/08\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif","contentUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2025\/08\/Reddit-Blocks-Internet-Archives-Wayback-Machine.avif","width":1600,"height":900,"caption":"Reddit Blocks Internet Archive\u2019s Wayback Machine"},{"@type":"BreadcrumbList","@id":"https:\/\/www.stanventures.com\/news\/reddit-blocks-internet-archives-wayback-machine-over-ai-scraping-concerns-3986\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.stanventures.com\/news\/"},{"@type":"ListItem","position":2,"name":"Reddit Blocks Internet Archive\u2019s Wayback Machine Over AI Scraping Concerns"}]},{"@type":"WebSite","@id":"https:\/\/www.stanventures.com\/news\/#website","url":"https:\/\/www.stanventures.com\/news\/","name":"Stan Ventures","description":"","publisher":{"@id":"https:\/\/www.stanventures.com\/news\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.stanventures.com\/news\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.stanventures.com\/news\/#organization","name":"Stan Ventures","url":"https:\/\/www.stanventures.com\/news\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/logo\/image\/","url":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2024\/06\/Stan-Ventures.webp","contentUrl":"https:\/\/www.stanventures.com\/news\/wp-content\/uploads\/2024\/06\/Stan-Ventures.webp","width":2001,"height":801,"caption":"Stan Ventures"},"image":{"@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/StanVentures\/","https:\/\/x.com\/stanventures"]},{"@type":"Person","@id":"https:\/\/www.stanventures.com\/news\/#\/schema\/person\/87d00ff18daf9650e7c925ae4bf86efb","name":"Dileep Thekkethil","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/911bd385b9da54d4a69f19f536a6419e576244371bd6e7d96f06c583dd402fa9?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/911bd385b9da54d4a69f19f536a6419e576244371bd6e7d96f06c583dd402fa9?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/911bd385b9da54d4a69f19f536a6419e576244371bd6e7d96f06c583dd402fa9?s=96&d=mm&r=g","caption":"Dileep Thekkethil"},"description":"Dileep Thekkethil is the Director of Marketing at Stan Ventures, where he applies over 15 years of SEO and digital marketing expertise to drive growth and authority. A former journalist with six years of experience, he combines strategic storytelling with technical know-how to help brands navigate the shift toward AI-driven search and generative engines. Dileep is a strong advocate for Google\u2019s EEAT standards, regularly sharing real-world use cases and scenarios to demystify complex marketing trends. He is an avid gardener of tropical fruits, a motor enthusiast, and a dedicated caretaker of his pair of cockatiels.","sameAs":["https:\/\/stanventures.com\/news","https:\/\/www.linkedin.com\/in\/dileep-pradeep-3705aa53\/","https:\/\/x.com\/dthekkethil"],"url":"https:\/\/www.stanventures.com\/news\/author\/admin_7mxgn8tx\/"}]}},"_links":{"self":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts\/3986","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/comments?post=3986"}],"version-history":[{"count":1,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts\/3986\/revisions"}],"predecessor-version":[{"id":3989,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/posts\/3986\/revisions\/3989"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/media\/3988"}],"wp:attachment":[{"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/media?parent=3986"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/categories?post=3986"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.stanventures.com\/news\/wp-json\/wp\/v2\/tags?post=3986"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}