Blocking AI Platforms
How to evaluate and implement AI platform blocking decisions using SeeLLM's what-if analysis.
Before You Block
Blocking an AI platform is a significant decision. Before acting, use SeeLLM's what-if analysis to understand the impact.
Using the What-If Tool
Via MCP, ask your AI assistant:
"What would happen if I blocked GPTBot from my site?"
Or via the API:
curl "https://api.seellm.com/api/analytics/whatif-block?domain=example.com&platform=gptbot&range=30d" \
-H "Authorization: Bearer sk_live_..."
The analysis shows:
- How many visits you'd lose
- Estimated value impact
- Whether the platform drives referral traffic (real users) or just crawls
When to Consider Blocking
| Reason | Recommended Action |
|---|---|
| Platform only crawls, never refers users | Consider blocking — low value, high resource cost |
| Platform drives referral traffic | Keep it — those are real users discovering your content |
| Excessive crawl rate causing performance issues | Rate-limit rather than fully block |
| Legal / licensing concerns | Block with robots.txt and verify compliance |
How to Block
Via robots.txt
The standard way to block AI crawlers:
# Block OpenAI's training crawler
User-agent: GPTBot
Disallow: /
# Block Anthropic's crawler
User-agent: ClaudeBot
Disallow: /
# Allow ChatGPT to fetch pages for users (referrals)
User-agent: ChatGPT-User
Allow: /
Note the distinction: GPTBot crawls for training, while ChatGPT-User fetches pages to answer user questions. You may want to block one but allow the other.
Via Cloudflare Rules
For more granular control, use Cloudflare WAF rules to block by user agent, IP range, or ASN.
Monitoring After Blocking
After implementing a block, monitor your SeeLLM dashboard to verify:
- The blocked platform's visits drop to zero
- No unexpected impact on other traffic
- Referral traffic from that platform's users still works (if you only blocked the crawler)