Methodology

How SeeLLM classifies AI traffic

We separate every request into one of eleven categories using user-agent fingerprints, ASN matching, and request shape. This page documents the rules we apply and the known limitations of each.

Last updated: 2026-05-05

Known limitations

Referer stripping undercounts AI referrals

Mobile ChatGPT, in-app browsers, Arc, Brave, and several AI products strip the Referer header on outbound clicks. AI referral counts are conservative — they reflect what we can see, not the full traffic. Treat referral rankings between AI sources as directional.

User-agent matching is the primary signal

We rely on declared user-agents for bot classification. Some scrapers and small AI projects use Python-requests or browser-shaped UAs without identifying themselves. Those land in HTTP client or Browser, not AI training. ASN matching is used as a secondary signal where available.

AI assistant vs. AI training is a fuzzy line

Some bots (e.g., Perplexity) operate in both modes — training crawls and live-user fetches — sometimes with the same UA. We classify by UA token where vendors differentiate (GPTBot vs. ChatGPT-User), and conservatively otherwise.

Self-identifying is not the same as truthful

Any client can claim to be GPTBot. We do not verify ownership for every request. For high-stakes use cases, we recommend cross-checking with reverse DNS or vendor-published IP ranges.

Categories evolve

New AI products and crawlers appear monthly. The signature lists above are point-in-time and updated as new bots are observed at scale.

How we collect

Edge-side classification via Cloudflare Worker on customer domains
Cloudflare Logpush ingestion for customers preferring no install
Server-log upload for one-off audits
All classification happens server-side. There is no client-side script and no cross-site tracking.

See it on your site

Run a free Score on any URL to check AI readiness, or install the edge worker to start collecting the same classification data on your own domain.

Run a free Score Set up the worker

How SeeLLM classifies AI traffic

Categories

AI training crawler

AI assistant

AI referral

AI coding agent

Search engine

SEO tool

Social preview

Monitoring

HTTP client

Scanner

Browser