CONNECT WITH US

Tech

97% Of LLMS.txt Files Got No Requests, Ahrefs Data Shows via @sejournal, @MattGSouthern

SearchEngineJournal - Tech logo

Published on

97% Of LLMS.txt Files Got No Requests, Ahrefs Data Shows via @sejournal, @MattGSouthern

  1. SEJ
  2.  ⋅ 
  3. Generative AI

97% Of LLMS.txt Files Got No Requests, Ahrefs Data Shows

  • 97% of llms.txt files got zero requests in May 2026.
  • AI retrieval bots accounted for 1.1% of llms.txt requests.
  • 12% of requests came from tools auditing or scanning the files.

Ahrefs analyzed 137K domains and found 97% of llms.txt files received zero requests. AI retrieval bots accounted for 1% of total requests.

Ahrefs analyzed logs from 137,000 domains and found 97% of llms.txt files got zero requests. No bots, no humans.

The analysis used Ahrefs data to identify user agents fetching files. Around 28% of 137,000 domains publish an llms.txt file, but since Ahrefs’ customers are more technical, the actual adoption on the broader web is likely lower.

Of roughly 38,000 domains with valid files, only about 1,100 received any traffic.

Of files with requests, 96% came from bots, mostly non-AI. AI retrieval bots linked to ChatGPT and Perplexity made up 1%.

Who Fetches llms.txt Files

SEO audit tools had 21% requests, then unidentified bots (14%), web crawlers like Googlebot (13%), and tech profiling tools like BuiltWith (11%).

AI bots, across four categories, made up 19% of requests. AI is the largest segment, but the breakdown differs from most llms.txt advocates’ expectations.

Coding agents sent 10% of requests, training crawlers 5%, assistants 2%. Claude-Code and GPTBot were the top individual bots.

Slackbot alone fetched llms.txt files more often than PerplexityBot did.

The Industry Studying Itself

The report found 12% of requests from tools that audit, scan, or study llms.txt files rather than use them.

GEO and AEO readiness tools sent 5% of requests; dedicated scanners and validators sent 3%, more than AI retrieval bots and assistants combined. Research bots sent 2%, with the largest research crawler identifying as a prompt injection survey.

An ecosystem has developed around scoring and cataloging a file format before a significant audience appears.

No AI Bot Looks For Files That Don’t Exist

Requests for /llms.txt paths with 404 errors drew no AI traffic. Humans hitting those 404s seem to be people typing the URL into browsers, likely checking competitors.

The Chrome Lighthouse llms.txt audit, which reignited the llms.txt debate in May, generated about 22 requests across the dataset, roughly 1 in 1,000.

Why This Matters

The data lines up with what Google’s John Mueller has said about llms.txt for over a year. Lily Ray pressed Mueller on the gap between Google Search’s dismissal and Chrome’s Lighthouse audit. He said llms.txt is “not done for search” and called it a “temporary crutch, perhaps to save some tokens” for AI coding tools.

The data shows the file’s audience is coding agents and training crawlers, not AI search and retrieval bots that would generate citations.

We reported on the split between Google Search and Lighthouse documentation in May. SE Ranking’s earlier analysis of 300,000 domains showed no connection between having llms.txt and AI citation frequency. Ahrefs’ data points to one possible reason: the bots most directly tied to live AI retrieval barely requested these files in May.

Looking Ahead

The prompt injection finding is worth watching. Ahrefs found a crawler studying llms.txt as a prompt injection risk, since agents trust ingested content. Sites auto-generating these files via CMS should review their content.

Every figure in this report is a ceiling. Ahrefs measured requests, not whether bots acted on what they fetched.


Featured Image: sdecoret/Shutterstock

Category News Generative AI
SEJ STAFF Matt G. Southern Senior News Writer at Search Engine Journal

See short video versions of news stories on YouTube and TikTok. Matt G. Southern is the Senior News Writer at ...



Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It's possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Google Preferred Source