We use essential cookies to make our site work. With your consent, we may also use non-essential cookies to improve user experience and analyze website traffic. By clicking “Accept,” you agree to our website's cookie use as described in our Cookie Policy. You can change your cookie settings at any time by clicking “Preferences.”
    Your brand has an AI profile — whether you know it or not. Claim yours →
    Research

    How 5 AI Platforms Actually Crawl Brand Data: What 111,286 Requests Taught Us

    We operate one of the largest AI brand directories on the internet. This week, AI model crawlers accounted for 17.5% of all traffic. Here's exactly what each platform is doing.

    111,286

    Total requests/week

    19,454

    AI crawler requests

    17.5%

    AI share of traffic

    5,829

    Brands in directory

    OpenAI

    10,816 requests (55.6%)
    CrawlerRequestsWhat It Does
    GPTBot/1.38,159Training/indexing crawler — builds ChatGPT's parametric 'memory'
    OAI-SearchBot/1.31,250Real-time search crawler — fetches data when ChatGPT searches the web
    ChatGPT-User/1.0515Live browsing — a human asked ChatGPT to 'look up' a specific page
    OAI-SearchBot/1.0441Older search variant, still active
    GPTBot/1.010Legacy version, phasing out

    OpenAI operates three distinct crawling modes, and your brand needs to serve accurate data to all three.

    GPTBot builds parametric knowledge — the 'memory' that ChatGPT uses when it doesn't search the web. If your brand information is wrong on pages GPTBot crawls, ChatGPT will confidently state wrong information even when it's not searching.

    OAI-SearchBot retrieves real-time data. When someone asks ChatGPT a question and it hits the search button, this crawler fetches the results. This is your opportunity for accuracy — real-time retrieval can override stale parametric knowledge.

    ChatGPT-User fires when a human asks ChatGPT to 'look up' or 'visit' a specific URL. 515 times this week, someone asked ChatGPT to go check a brand profile on our directory. That's a direct signal of buyer behavior.

    Anthropic (Claude)

    4,669 requests (24.0%)
    CrawlerRequestsWhat It Does
    ClaudeBot/1.04,235Main indexing crawler
    Claude-User303Live browsing — someone asked Claude to check a page
    Claude-SearchBot/1.099Search-specific crawler
    Claude-User/1.032Authenticated browsing variant

    Anthropic's crawling volume is about 43% of OpenAI's — which aligns with our BAI data showing that Claude tends to have less comprehensive brand knowledge than ChatGPT.

    The ClaudeBot crawler does the heavy lifting, but the Claude-User traffic (335 requests) is the most interesting signal: real people are asking Claude about brands, and Claude is visiting our directory to answer them.

    Amazon

    4,366 requests (22.4%)
    CrawlerRequestsWhat It Does
    Amzn-SearchBot/0.12,885Amazon's AI search crawler
    Amazonbot/0.11,481General web crawler

    Amazon runs two separate bots with nearly equal presence. This matters because Amazon's AI assistants — Alexa and Rufus — use this data for product and brand recommendations.

    If you sell to enterprises, you might think Amazon doesn't matter. But Rufus is increasingly used for B2B product research, and your brand data feeds into that recommendation engine.

    Perplexity

    1,699 requests (8.7%)
    CrawlerRequestsWhat It Does
    PerplexityBot/1.01,699Search-and-answer crawler

    Perplexity is the purest signal in this data. Every single Perplexity crawl is in direct service of answering a real user query. When PerplexityBot visits a brand profile, it's because someone asked Perplexity a question about that brand or category.

    1,699 times this week, someone asked Perplexity a question that led to our directory. These are the highest-intent requests because each one maps to a real human asking a real question.

    ByteDance

    15 requests (0.1%)
    CrawlerRequestsWhat It Does
    Bytespider15TikTok/Doubao crawler

    Minimal presence today, but worth watching. ByteDance's Bytespider is the crawler behind TikTok's AI features and their Doubao assistant.

    15 requests is noise, but it's noise that suggests TikTok's AI is starting to index brand data. If your audience skews younger or if TikTok is a channel for your market, this will matter within 12 months.

    The Bigger Picture

    Zooming out from AI crawlers, here's the full traffic landscape for a brand directory serving 5,829 profiles:

    Traffic SourceRequests/WeekKey Crawlers
    AI model crawlers19,454GPTBot, ClaudeBot, PerplexityBot, Amazonbot
    SEO tool crawlers10,936SemrushBot, MJ12bot, SERankingBot, AhrefsBot
    Search engine crawlers6,730Googlebot (4,530), YandexBot (1,403), bingbot (768)
    Human traffic (estimated)~13,000Direct, referral, organic

    The ratio that matters:

    AI crawlers are hitting our directory at nearly 3× the rate of search engine crawlers. AI models are consuming brand data faster and more aggressively than Google. This is the clearest infrastructure-level signal that AI is becoming a primary consumer of brand information — not a secondary channel.

    How Crawl Frequency Correlates with AI Accuracy

    An obvious question: do brands that get crawled more frequently have higher BAI scores?

    From our directory data: yes, but with important caveats. Brands with strong structured data and consistent authoritative sources tend to get crawled more frequently — AI models learn that these pages are reliable and revisit them more often. It's a virtuous cycle: accuracy drives crawl frequency, which drives more up-to-date parametric knowledge, which drives accuracy.

    The inverse is also true. Phantom brands (BAI 0–19) tend to have the lowest crawl frequencies. AI models never formed a strong entity representation, so they don't prioritize re-crawling those pages.

    The actionable insight: if you can get your page right once — accurate structured data, correct industry classification, consistent messaging — the AI crawlers will return often enough to keep it current. The hard part is the initial correction. The maintenance is largely self-sustaining.

    Related

    This is the first in our monthly "State of AI Brand Crawling" series. Read the full March 2026 report →

    Data source: Cloudflare server logs from Optimly's AI Brand Directory infrastructure, week of March 22–28, 2026. All request counts are approximate and based on user-agent classification. Some crawler variants may be underrepresented if they use non-standard user-agent strings.