AI crawlers, search bots, and training agents.
A compact reference of the crawler identities DataFast checks: what they are for, how they identify themselves, and where their verification sources live.
58 crawlers found
AI answers
User-triggered / 16 crawlers
OpenAI
ChatGPT-User
User-triggered fetches when ChatGPT opens a page to answer a person.
Anthropic
Claude-User
User-triggered fetches when Claude opens a page to answer a person.
Perplexity
Perplexity-User
User-triggered fetches for Perplexity answers and source citations.
Google-Agent
Google user-triggered fetcher used by AI or product experiences.
Google-NotebookLM
Google fetcher associated with NotebookLM-style answer workflows.
Google-Read-Aloud
Google fetcher associated with read-aloud and assistant experiences.
GoogleAgent
Google user-triggered fetcher used by AI or product experiences.
Mistral
MistralAI-User
MistralAI-User is tracked as a user-triggered fetcher from Mistral.
Microsoft
Copilot
User-triggered Microsoft/Copilot fetches for AI answers.
Amazon
Amzn-User
User-triggered Amazon fetches for fresh answers in products such as Alexa.
DuckDuckGo
DuckAssistBot
DuckDuckGo real-time crawler for AI-assisted answers with citations.
xAI
xAI-SearchBot
xAI-SearchBot is tracked as a user-triggered fetcher from xAI.
xAI
Grok-DeepSearch
Grok-DeepSearch is tracked as a user-triggered fetcher from xAI.
Meta
meta-externalfetcher
Meta fetcher used when a person requests or shares a specific URL.
Moonshot AI
Kimi-User
Kimi-User is tracked as a user-triggered fetcher from Moonshot AI.
Alibaba
Qwen-User
Qwen-User is tracked as a user-triggered fetcher from Alibaba.
Search indexes
Discovery / 15 crawlers
OpenAI
OAI-SearchBot
OpenAI crawler used to surface sites in ChatGPT search results.
Anthropic
Claude-SearchBot
Anthropic crawler used to discover and refresh content for Claude search.
Perplexity
PerplexityBot
Perplexity crawler used to keep its answer and search index fresh.
Google-InspectionTool
Google-InspectionTool is tracked as a search or answer-index crawler from Google.
GoogleOther
GoogleOther is tracked as a search or answer-index crawler from Google.
Googlebot
Classic Google crawler used for Search indexing and discovery.
Mistral
MistralAI-Index
MistralAI-Index is tracked as a search or answer-index crawler from Mistral.
Microsoft
Bingbot
Microsoft crawler used for Bing Search indexing.
Microsoft
msnbot
Legacy Microsoft crawler used for search indexing.
Apple
Applebot
Apple crawler used for Apple search and assistant surfaces.
Amazon
Amzn-SearchBot
Amazon crawler used to make content eligible for Amazon search experiences.
Moonshot AI
Kimi-SearchBot
Kimi-SearchBot is tracked as a search or answer-index crawler from Moonshot AI.
ByteDance
TikTokSpider
TikTokSpider is tracked as a search or answer-index crawler from ByteDance.
Baidu
Baiduspider
Baiduspider is tracked as a search or answer-index crawler from Baidu.
You.com
YouBot
YouBot is tracked as a search or answer-index crawler from You.com.
Training crawlers
Model data / 15 crawlers
OpenAI
GPTBot
OpenAI crawler for collecting public content that may improve future models.
Anthropic
ClaudeBot
Anthropic crawler for public content that may be used to improve Claude.
Apple
Applebot-Extended
Robots token controlling whether Apple may use crawled content for AI training.
Amazon
Amazonbot
Amazon crawler used to improve Amazon products and services.
Meta
meta-externalagent
Meta crawler for indexing or improving Meta products and AI systems.
Moonshot AI
KimiBot
KimiBot is tracked as a public-content crawler from Moonshot AI.
ByteDance
Bytespider
Bytespider is tracked as a public-content crawler from ByteDance.
Baidu
ERNIEBot
ERNIEBot is tracked as a public-content crawler from Baidu.
Alibaba
QwenBot
QwenBot is tracked as a public-content crawler from Alibaba.
Zhipu AI
ChatGLM-Spider
ChatGLM-Spider is tracked as a public-content crawler from Zhipu AI.
DeepSeek
DeepSeekBot
DeepSeekBot is tracked as a public-content crawler from DeepSeek.
Cohere
cohere-ai
cohere-ai is tracked as a public-content crawler from Cohere.
Cohere
cohere-training-data-crawler
cohere-training-data-crawler is tracked as a public-content crawler from Cohere.
Allen AI
AI2Bot
Allen Institute crawler used to find documents for AI research systems.
Common Crawl
CCBot
Common Crawl crawler that builds open web crawl datasets.
Other AI bots
AI crawler / 12 crawlers
OpenAI
OAI-AdsBot
OpenAI crawler associated with ad or landing-page fetch workflows.
Google-CloudVertexBot
Google-CloudVertexBot is tracked as other ai bots from Google.
xAI
GrokBot
GrokBot is tracked as other ai bots from xAI.
xAI
xAI-Bot
xAI-Bot is tracked as other ai bots from xAI.
xAI
xAI-Grok
xAI-Grok is tracked as other ai bots from xAI.
xAI
xAI-Web-Crawler
xAI-Web-Crawler is tracked as other ai bots from xAI.
xAI
Grok
Grok is tracked as other ai bots from xAI.
Meta
FacebookBot
FacebookBot is tracked as other ai bots from Meta.
ByteDance
Doubaobot
Doubaobot is tracked as other ai bots from ByteDance.
Baidu
YiyanBot
YiyanBot is tracked as other ai bots from Baidu.
Alibaba
TongyiBot
TongyiBot is tracked as other ai bots from Alibaba.
Alibaba
AliyunBot
AliyunBot is tracked as other ai bots from Alibaba.
Discover when bots and AI assistants crawl your website.
Install the server-side tracker to see which AI assistants, search engines, and training crawlers request your pages. It helps you understand what bots are trying to find, including pages that return 404.