robots.txt
1 point
pass / fail
Pass rule: The site exposes a reachable, parseable robots.txt file at the domain root.
Agents and crawlers use this file as the first crawl-policy signal.
AgentScan uses a content-readiness profile focused on observable public HTTP behavior. Each passing check contributes one point. Neutral informational checks are excluded from the denominator when they appear in upstream data.
1 point
pass / fail
Pass rule: The site exposes a reachable, parseable robots.txt file at the domain root.
Agents and crawlers use this file as the first crawl-policy signal.
1 point
pass / fail
Pass rule: A sitemap is discoverable directly or through robots.txt and returns valid XML.
Sitemaps give agents a canonical inventory of public URLs.
1 point
pass / fail
Pass rule: The homepage response advertises useful machine-readable references through RFC 8288 Link headers.
Headers let agents find indexes, feeds, API catalogs, and related resources without page scraping.
1 point
pass / fail
Pass rule: A request with Accept: text/markdown receives a useful markdown representation.
Markdown gives agents a cleaner parsing path than visual HTML when the site can provide it.
1 point
pass / fail
Pass rule: robots.txt declares explicit behavior for major AI crawlers or clearly relies on wildcard rules.
Explicit bot policy reduces ambiguity around discovery, search, and on-demand fetching.
1 point
pass / fail
Pass rule: robots.txt includes Content-Signal preferences such as ai-train, search, and ai-input.
Content-Signal separates search visibility from model-training and answer-input preferences.
Score percentage equals passed scored checks divided by total scored checks. The public content profile normally has six scored checks, so 6/6 is 100%, 4/6 is 67%, and 2/6 is 33%.
Scan execution is powered by the IsItAgentReady scan API. AgentScan adds implementation tools, prompts, examples, and workflow guidance around those results.
Review the AgentScan demo report with evidence for each passing check.