crawlmouse

How Deep Is Too Deep? Crawl Depth Limits and How to Fix Buried Pages

·8 min read

You published a page weeks ago. It’s genuinely good — useful, well-written, exactly what someone searching would want. And it’s getting nothing. No rankings, barely any crawls, as if Google never noticed it exists. Before you blame the content, ask a simpler question: how many clicks does it take to reach that page from your homepage? If the answer is “more than I’d like,” you’ve probably found the problem.

That number — clicks from the homepage — is the page’s crawl depth (also called click depth). This guide answers the question everyone actually has once they understand it: not “what is crawl depth,” but how deep is too deep — and exactly what to do about the pages that have slipped too far.

What counts as “too deep”

Here’s the honest version, because most articles oversimplify it. There is no universal number Google publishes, and depth is not a direct ranking penalty. A page doesn’t get demoted for being at depth 5. What depth really is, is a discovery cost: the deeper a page sits, the less often it gets crawled, the less internal authority reaches it, and the longer it takes for changes to be noticed. Those costs compound until a page is effectively invisible — not penalized, just never found.

So the threshold depends on your site’s size, and the realistic guidance looks like this:

  • Small sites (under ~1,000 pages): almost everything should sit within 3 clicks of the homepage. If your blog or small store has pages at depth 5+, that’s a structural problem you can fix, not a constraint you have to accept.
  • Medium sites (1k–50k pages): aim for your important pages at depth 2–3 and the bulk of content within 4 clicks. Some depth is unavoidable; uncontrolled depth isn’t.
  • Large sites (50k+ pages): a flat three-click architecture is impossible — you’d need an unusable menu. Keep your money pages and pillar content at depth 1–2, accept that long-tail pages live deeper, and use internal linking to keep depth 4–6 from sliding into depth 10+.

The pattern across all three: depth should track importance. The pages you care about most should be the shallowest. The danger zone isn’t “any page past depth 3” — it’s an important page that has quietly drifted to depth 7.

Why “more than three clicks” became the warning sign

The famous “three-click rule” isn’t a law, and there’s nothing magic about the number three. But it persists for a sound reason: real data on visitor behavior and crawl frequency both fall off a cliff with depth. Roughly speaking, each click down loses a large share of the visitors who’d have continued — and search engines mirror that, crawling pages near the homepage far more often than pages buried deep. Three clicks is just the point where, on most sites, the drop-off starts to bite. Treat it as a smoke alarm, not a building code.

The traps that bury pages without you noticing

Depth problems are rarely deliberate. They’re side effects of structures that seemed reasonable at the time:

  • Pagination chains. If older content is only reachable through “page 2, page 3, … page 40” of an archive, the posts on page 40 are effectively at depth 40. Blogs and stores bury enormous amounts of good content this way.
  • Hubs that don’t link down. A category page that exists but links to nothing useful strands everything beneath it. The hub is shallow; its children are orphaned in practice.
  • Filters and parameters. Faceted navigation can spawn endless deep URLs that soak up crawl attention meant for your real pages.
  • Links that need JavaScript to appear. This one is increasingly important and widely missed. If a link only shows up after a script runs — a “Load more” button, a client-rendered menu — then a crawler that doesn’t execute that script never sees the link, and the page behind it is, to that crawler, at infinite depth.

The depth most people never check: what AI crawlers see

Here’s a blind spot worth its own section. Google can render JavaScript, but the new wave of crawlers feeding AI answers — the bots behind ChatGPT, Claude, and Perplexity — largely don’t execute JavaScript. They read your raw HTML and follow the links that are actually in it. So a page that’s three clicks deep in your fancy JS-driven navigation might be unreachable to an AI crawler — effectively orphaned in the exact place a growing share of discovery now happens.

That means “how deep is my page” now has two answers: how deep it is for a JavaScript-rendering crawler like Googlebot, and how deep it is in the raw, static HTML that AI crawlers (and, notably, the view Google indexes first) actually read. The static answer is the stricter one, and it’s the one almost no one measures.

How to find your deep pages (you can’t eyeball this)

Depth is a property of your whole link graph, not of any single page, so you can’t guess it by looking at a URL — a page at example.com/blog/2021/03/old-post can be one click from home if your homepage links to it, while a short URL can be buried six clicks deep. You need a crawl that starts at the homepage and records the shortest click-path to every page.

That’s exactly what Crawlmouse does: it crawls your live site the way a search engine would, computes the click depth of every page it finds, and rolls depth into your structure grade — so you can see, in under two minutes, which important pages have drifted too deep. Because it reads the static HTML, the depth it reports is the strict, AI-crawler-honest version, not a flattering one. Pair it with an orphan-page check — orphans are simply the extreme case, pages at infinite depth — and you’ll have the full reachability picture.

How to pull a buried page back up

The fix for depth is never “rewrite the URL.” Depth is measured in links, so you fix it by changing what links to what. In rough order of impact:

  1. Link to it from a shallow, relevant page. A single contextual link from your homepage, a hub page, or a popular article can lift a page several levels in one move. This is the cheapest, highest-leverage fix you have.
  2. Strengthen your hub and category pages so they actually link down to the pages beneath them — and link up to those hubs from related content.
  3. Tame pagination. Surface key older content through hubs, “related posts” modules, or an HTML sitemap, instead of relying on a 40-page archive chain.
  4. Make sure the links are in the HTML, not just the JavaScript — so every crawler, including the AI ones, can actually follow them.
  5. For a genuinely urgent page, add a temporary homepage link (a “New” or “Featured” section). It moves the page to depth 1 instantly and gets it re-crawled in hours rather than weeks.

Resist the temptation to fix depth by dumping every URL into your main navigation. That makes everything shallow and nothing important — it spreads your authority thin across hundreds of links instead of concentrating it where it counts. Good architecture is shallow where it matters and clearly organized everywhere else.

The fastest way to know where you stand

You don’t need to map this by hand. Crawl your site, sort the pages by depth, and look at what’s sitting past the threshold for your size — then pull the important ones up with a link or two and re-crawl to confirm. It’s one of the few SEO changes whose effect you can see in your site’s structure the moment you make it, long before the rankings catch up.

If a page you care about is buried, the work that made it is already done. All that’s missing is a path to it — and that’s a fix you can ship today.

Keep reading