Blog

Why Google Analytics Can't See AI Visibility

Google Analytics is useful for human sessions, but it cannot show which pages AI systems fetch, revisit, cite, skip, or leave crawled but not cited.

Google Analytics can tell you what people did on your site. It cannot tell you which AI crawler requests happened before a person ever arrived.

That distinction matters more than it used to.

AI-mediated discovery often starts before a browser session exists. A crawler fetches a page. An answer engine reads it. A user sees a synthesized answer, comparison, or recommendation. Sometimes they click. Often they do not.

GA4 is still useful. It is just looking at a different layer.

The JavaScript boundary

Google Analytics 4 (GA4) uses a simple tracking method:

  1. A visitor loads your page
  2. Their browser downloads and executes the GA4 JavaScript snippet
  3. The script sends tracking data to Google's servers
  4. You see the visit in your dashboard

This works for humans using browsers. But most AI crawler requests are different.

They do not create a normal GA4 browser session.

How AI Bots Access Your Content

When GPTBot (OpenAI's crawler), ClaudeBot (Anthropic's crawler), or PerplexityBot visits your site, here's what happens:

  1. The bot makes an HTTP request to your server
  2. Your server returns the HTML content
  3. The bot parses the HTML and extracts text
  4. The crawler request does not execute GA4 like a human browser session

This is not a GA4 bug. Most AI crawler requests are optimized for speed and extraction, so they do not behave like full browser sessions.

So GA4 usually cannot show the crawler fetch itself.

It also cannot answer the more important operational question: did that fetch turn into reuse?

What GA4 cannot tell you

Which important pages AI systems fetch

GA4 reports sessions, events, paths, sources, and conversions for browser traffic. It does not report that GPTBot, ClaudeBot, PerplexityBot, or another AI crawler requested your comparison page, documentation page, pricing page, or category page.

That means a page can become important to AI systems without ever producing a clean analytics signal.

Which pages get reused

A fetch is not the same as a citation, recommendation, or answer mention.

This is where most analytics setups stop too early. They can answer "did a person visit this page?" They cannot answer:

  • Did AI systems fetch the page?
  • Did the page appear in an answer?
  • Did the page get cited or summarized?
  • Did the page get skipped after a rewrite?
  • Did the page move into crawled but not cited?

Those are page-level AI visibility questions, not web analytics questions.

Which AI clicks are attributable

When AI products send human visitors, attribution can still be messy. Some visits arrive with useful referrer data. Some land in direct or unclassified buckets. Some happen after the user has already made a shortlist inside an AI interface.

GA4 can help you analyze the session once it exists. It cannot reconstruct the upstream AI interaction from that session alone.

What server-side evidence adds

The missing layer is server-side evidence. You need to observe requests before JavaScript matters.

How Server-Side Detection Works

  1. Request arrives at your server (or edge worker)
  2. Headers analyzed: User-agent, ASN, IP patterns
  3. Bot identified: GPTBot, ClaudeBot, PerplexityBot, etc.
  4. Data logged: Platform, timestamp, URL, request type, page state
  5. Human visitors continue normally

This happens in milliseconds with zero impact on user experience.

What You Can Detect

With server-side monitoring, you can see:

  • Which AI platforms fetch important pages: ChatGPT, Claude, Perplexity, Gemini, and other AI systems where detectable
  • Visit frequency and patterns: Understand crawl behavior
  • Page-level changes: See when fetches, citations, or referrals change after edits
  • Crawled-but-not-cited states: Find pages that are accessible but not reused
  • Content gaps: Discover requests for missing or outdated URLs

For a concrete look at how different platforms actually behave once you split them by intent, see what 30 days of AI bot traffic on two real domains actually looks like — the same Claude vs ChatGPT crawler mix flips depending on whether a site is content-driven or technical.

What This Means for GEO (Generative Engine Optimization)

GEO and answer engine optimization can get vague quickly. The practical version is simpler: make important pages easier for AI systems to interpret, compare, cite, and trust.

But you can't optimize what you can't measure.

If you're trying to increase your presence in ChatGPT, Claude, or Perplexity answers, you need to know:

  1. Are these platforms even accessing your content?
  2. Which pages do they visit most?
  3. Which pages appear in answers, citations, or referrals?
  4. Which important pages are crawled but not cited?
  5. Did a content change improve or hurt reuse?

GA4 cannot answer these questions from browser-session data alone. Server-side detection fills in the missing request layer.

This is the basis for AI visibility attribution: connecting what AI systems say with backend evidence of which pages they request, revisit, skip, or leave crawled but not cited.

The Path Forward

Don't Abandon GA4

GA4 is still valuable for understanding human behavior:

  • Session duration and engagement
  • Conversion funnels
  • Audience demographics
  • User flow analysis

Add page-level AI visibility

Use server-side evidence alongside GA4 to:

  • Track which AI systems fetch important pages
  • Separate access from reuse
  • Watch page state changes after content updates
  • Prioritize pages that affect pipeline, revenue, or evaluation

Reconcile the layers

With both data sources, you can:

  • Report human behavior from GA4
  • Report AI fetch and reuse patterns from server-side data
  • Make informed GEO decisions
  • Decide which pages deserve content architecture work first

Getting Started

SeeLLM starts with the pages that matter most.

  1. Run the free AI Visibility Score to check whether a page is readable and accessible.

  2. Install site-wide monitoring when you need ongoing evidence of which AI systems fetch, revisit, skip, or reuse your pages.

  3. Focus first on decision-stage pages: comparisons, category pages, pricing, documentation, and high-intent editorial pages.

For the full page-level measurement model, read Crawled, Cited, or Ignored? A Practical Framework for Measuring AI Visibility.


Want the quick baseline? Start with the free AI Visibility Score.

Questions? Email us at [email protected]

Continue reading

More from the field notes

All posts

From reading to action

See which pages AI systems can actually use.

Start with the free AI Visibility Score. When you need page-level evidence, move from static checks to monitoring the pages that matter.