In a detailed review of 1.7 million web pages across 18 major websites, SEO platform Indexing Insight has found that the vast majority of de-indexed pages, 88%, are being removed not due to bugs or crawl issues, but because of issues with content quality.
That’s right. Google is actively forgetting pages it deems unworthy, regardless of how much traffic or revenue they generate.

The Numbers Don’t Lie
Indexing Insight, using Google’s own URL Inspection API, analyzed sites of all sizes from niche ecommerce stores to high-authority marketplaces and content platforms.
The outcome was the same across the board, that is, most pages missing from the search were removed for quality reasons.
That includes:
- Pages already submitted via XML sitemaps
- Content flagged as essential by site owners
- URLs tracked daily for indexing performance
These weren’t forgotten back pages or archives, but revenue-generating, high-priority assets. And yet, Google still removed them from its index.
Who’s Losing the Most Ground?
Not all websites are affected equally. Some are bleeding more visibility than others:
- Marketplace and listing platforms had the worst index coverage, often below 70%.
- Ecommerce websites didn’t fare much better, with many struggling to reach the 90% threshold considered healthy.
- News publishers emerged as the winners, showing 97% index coverage, thanks to fresh content and structured updates.
Even more troubling is that brand size offers no protection. When Moz Brand Authority scores were factored in, both small and large brands reported the same issue: critical pages were still getting wiped out.
This runs counter to long-held beliefs of SEO. Authority, it turns out, can’t save you from Google’s new standards.
The Quality Filter: Google’s New Line in the Sand
So what’s actually causing these removals? According to Indexing Insight’s classification:
- “Actively removed” pages are those that were once indexed but have since been kicked out.
- “Forgotten” pages never made it in and are flagged as “Crawled – currently not indexed” or worse, “URL is unknown to Google.”
These outcomes reflect Google’s increasingly selective approach to indexing. The engine is applying stricter judgments about whether content adds original value.
Product pages with reused descriptions? Gone. Listing pages with thin content? Forgotten. Even dynamic landing pages created for ad campaigns aren’t safe if they lack distinctiveness.
And the problem is often invisible. Google Search Console often buries these issues under vague warnings or fails to report them altogether. Many site owners are flying blind.
What Changed? Look at the March Core Update
The study highlights an important catalyst, that is, Google’s March 2025 core update.
This update changed how Google treats content at the index level. Pages that once held firm in the search ecosystem are now disappearing entirely—not dropping in rank, but vanishing from results.
You can’t optimize for visibility without being indexed first. And while algorithm updates always emphasize “content quality,” the emerging pattern shows that indexing itself is now part of the penalty.
This forces site owners to rethink how they evaluate the success of SEO. Rankings and traffic alone aren’t enough; index presence is now a metric that can’t be ignored.
Here’s What You Can Do—Immediately
If Google is wiping out your most important content, you need to respond fast and strategically.
Here’s where to start:
1. Audit Index Presence, Not Just Rankings
Use indexing intelligence tools to track which pages are visible in Google and which are slipping away.
2. Clean Up Your XML Sitemaps
Remove low-value URLs. Every page in your sitemap should be unique, useful, and worthy of inclusion.
3. Prioritize High-Impact Content
Don’t spread your resources thin. Focus on pages that drive traffic, conversions, or contain original insight.
4. Strengthen Page Value
Enhance content with data, expert input, FAQs, or rich media. Make every page undeniably worth indexing.
5. Track Core Update Timing
Align traffic drops and index shifts with Google’s algorithm updates to pinpoint the real cause.
What This Signals for the Future of SEO
This change represents a structural evolution of how Google surfaces content. Indexing is no longer guaranteed. It’s earned.
Forget everything you thought about crawl budget or technical SEO as the main roadblocks. Today, the single most important factor is whether your content lives up to Google’s unspoken but increasingly strict definition of quality.
And this won’t stop with product listings or low-authority domains. As generative AI floods the web with near-duplicate pages and lazy content, Google is building a wall, allowing only high-quality, original, and human-first pages to pass through.
If you’re not actively defending your content’s place in Google’s index, you’re already losing.
Key Takeaways
- 88% of de-indexed pages were removed due to quality, not technical errors.
- Big brands are equally affected. Authority alone offers no protection.
- Marketplace and ecommerce websites have the worst indexing performance.
- Google’s March core update contributed to mass removals of important content.
Indexing is no longer automatic—Google is actively curating what it includes.
Dileep Thekkethil
AuthorDileep Thekkethil is the Director of Marketing at Stan Ventures and an SEMRush certified SEO expert. With over a decade of experience in digital marketing, Dileep has played a pivotal role in helping global brands and agencies enhance their online visibility. His work has been featured in leading industry platforms such as MarketingProfs, Search Engine Roundtable, and CMSWire, and his expert insights have been cited in Google Videos. Known for turning complex SEO strategies into actionable solutions, Dileep continues to be a trusted authority in the SEO community, sharing knowledge that drives meaningful results.