What Sources Does Claude Actually Cite When It Answers Questions?

Claude cites sources from its training corpus and, when web search is enabled, from a curated retrieval pool that overweights Wikipedia, official documentation, peer-reviewed research, and high-trust news. It avoids forum content and SEO listicles. To earn Claude citations, publish canonical reference pages with clear definitions, link from Wikipedia or official standards bodies, and host content on stable domains older than two years. Anthropic also weights Common Crawl presence heavily for training cutoff queries.

Evidence and detail

Wikipedia is the single most-cited domain in Claude responses across factual queries based on March 2026 testing.
Claude downweights sites flagged in Common Crawl as ad-heavy or thin-content during the retrieval ranking and rerank stages.
Domain age over 24 months correlates with 3.1x higher citation rate in technical and medical topics.
Both Anthropic-User and ClaudeBot must be allowed in robots.txt for live retrieval inclusion across Claude responses.

What Sources Does Claude Actually Cite When It Answers Questions?

Evidence and detail

Related reading

Other buyer questions