Blog

We Failed Our Own AI-Readiness Check

SeeLLM scored 100/A on bot access and 67/100 on agent readiness because our H1 started 8,388 HTML characters into the page.

We've spent the last few months building seellm.com — server-side observation for AI bot traffic, plus a free AI Search Visibility Score that audits any URL for how AI search engines and assistants will see it.

On May 6, 2026, we ran it on ourselves.

The result

Overall score: 100 / A.

  • All 20 major AI bots allowed in robots.txt (GPTBot, ClaudeBot, Bytespider, PerplexityBot, Google-Extended, Applebot-Extended, Meta-ExternalAgent, Amazonbot, DuckAssistBot, Mistral-User, xAI, CCBot, YouBot, and the rest) ✅
  • robots.txt served, sitemap declared, sitemap valid ✅
  • Canonical host resolves, no restrictive X-Robots-Tag header ✅
  • Spec-conformant llms.txt and llms-full.txt published ✅

So far so good. Then agentReadiness came back needs_fixes (67 / 100):

CheckResult
Homepage HTML is fetchable✅ pass
Machine-readable summary available (llms.txt)✅ pass
Initial page within agent-friendly token budget (2,625 tokens)✅ pass
Useful answer appears near the topfail
Content has extractable structure (lists/tables)🟡 warn
Core content not hidden behind JavaScript🟡 warn (41 script tags)

The failing check, verbatim from our own scanner: "Primary heading starts after 8,388 HTML characters."

When ChatGPT-User or Claude-SearchBot fetches our homepage on behalf of a real human prompt, it has to scroll past 8 KB of nav and decorative HTML before reaching the H1 that explains what we do.

The cobbler's children, etc. We're a startup whose entire pitch is "make sure AI search can actually read your site," and our own homepage fails on the most basic agent-readability check.

Why this happens (and probably to you too)

This isn't a SeeLLM-specific problem. We see it across most modern landing pages we audit. The pattern:

  • Hero section optimized for human eyeballs. A big animated background, a logo, a top nav, sometimes a banner, sometimes social proof — all rendered before the H1.
  • Marketing-design defaults push the H1 down. A typical SaaS landing page hierarchy is: nav → hero visual → headline. The headline is visually first, but in the HTML source it's often the third or fourth major block.
  • JavaScript-heavy frameworks compound the problem. React/Next.js apps ship a lot of script tags before the readable content. Agents that don't execute JS — and many still don't, especially during training — see scaffolding, not content.

The result: when an AI assistant has a 2,000–4,000-token budget per page and needs to find your direct answer fast, it gives up before reaching it. Or worse, it summarizes your nav.

What "agent readiness" actually checks

Our /agent-readiness diagnostic looks at a page from an agent's perspective, not a browser's. The six checks we run on the initial server-rendered HTML:

  1. HTML is fetchable — does the URL return readable HTML at all?
  2. Machine-readable summary available — is there a llms.txt pointing to a structured summary?
  3. Token budget — is the initial response within an agent-friendly token count (~5,000 tokens or less for the part agents read)?
  4. Answer depth — does the H1 + a short answer paragraph appear early in the HTML, or is it buried after long navigation and decoration?
  5. Extractable structure — are there lists, tables, and clear headings that make the answer easy to lift?
  6. JavaScript rendering risk — does the readable text exist server-side, or is it injected after hydration?

A page that passes all six is parseable by an AI assistant in a single fetch with high confidence. A page that fails one or more either gets misread, paraphrased instead of cited, or skipped entirely.

The full classification rules are on our methodology page.

The fix (shipped)

The fix was mundane: move the H1 + a short answer paragraph above the hero animation in the server-rendered HTML. The hero stays for human visitors; the H1 just shifts up in the source.

We shipped it. The H1 now starts around character 1,200 of the HTML — down from 8,388. The hero animation, nav, and decorative scaffolding still render after, so nothing changed for human visitors. Run /agent-readiness on us today and the "useful answer near top" check passes.

The harder lesson: agent readiness is a distinct design constraint from human readability, and most landing pages today are designed only for humans. If you care about AI search as a distribution channel, the same hero that converts human visitors may be the thing keeping you out of ChatGPT and Claude answers.

See your own score

It's a free single-URL audit, no signup, ~30 seconds:

Try it on your own homepage. If your H1 is more than ~5,000 HTML characters deep, you're probably losing AI surface area without realizing it. If your score is high and you're seeing real ChatGPT/Claude referrals show up in your analytics, we'd love to hear what's on your page that the rest of the industry is getting wrong.

Related reading:

Continue reading

More from the field notes

All posts

From reading to action

See which pages AI systems can actually use.

Start with the free AI Visibility Score. When you need page-level evidence, move from static checks to monitoring the pages that matter.