Skip to main content

Step 1: Create an API Key

Create an API key in Promptwatch. Go to Settings → API Keys in your Promptwatch dashboard.

Step 2: Send HTTP Request Logs

Send your HTTP request logs to https://logs.promptwatch.com/event in the following format: Request:
Method: POST
URL: https://logs.promptwatch.com/event
Content-Type: application/json
Headers: X-API-Key: YOUR_API_KEY
Body: [see below]
You can batch multiple events together in a single request by sending a JSON array. The maximum request size is 50MB. All properties need to be present for each event. query_string and referrer can be null. timestamp should be an ISO 8601 string. Example body:
[
  {
    "timestamp": "2025-10-01T12:00:00.000Z",
    "status_code": 200,
    "request_method": "GET",
    "request_path": "/",
    "query_string": "search=test",
    "content_type": "text/html",
    "client_ip": "192.168.0.10",
    "hostname": "example.com",
    "user_agent": "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; GPTBot/1.1; +https://openai.com/gptbot)",
    "referrer": "https://www.google.com"
  }
]

Optional: Filter on known AI Crawler user agents

Promptwatch identifies AI crawlers automatically and only stores AI crawler visits, all other traffic is discarded. You can safely send all your server logs without worrying about non-crawler data being retained. If you prefer to only forward AI crawler traffic, you can use the user agents listed below to pre-filter on your side. Keep in mind that you’ll need to maintain this list yourself as new crawlers emerge.
NameUser AgentFull User AgentDescription
GPT BotGPTBotMozilla/5.0 (compatible; GPTBot/1.0; +https://openai.com/gptbot)Used to crawl content for training OpenAI’s generative AI foundation models.
SearchBotOAI-SearchBotMozilla/5.0 (compatible; OAI-SearchBot/1.0; +https://openai.com/searchbot)Used by ChatGPT search to surface websites in search results.
ChatGPT CitationsChatGPT-UserMozilla/5.0 (compatible; ChatGPT-User/1.0; +https://openai.com/bot)Used for user actions in ChatGPT when visiting web pages.
NameUser AgentFull User AgentDescription
Claude BotClaudeBotMozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; ClaudeBot/1.0; +claudebot@anthropic.com)Used to crawl content for training Anthropic’s generative AI models.
Claude CitationsClaude-UserMozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko); compatible; Claude-User/1.0; +claudebot@anthropic.com)When individuals ask questions to Claude or use Claude Code, it may access websites using a Claude-User agent.
Claude Search BotClaude-SearchBotMozilla/5.0 (compatible; claude-search-bot/1.0; +http://www.anthropic.com/bot.html)Navigates the web to improve search result quality for users.
Claude Webclaude-webMozilla/5.0 (compatible; claude-web/1.0; +http://www.anthropic.com/bot.html)Targeted crawler for recent web content, feeding the Claude browser agent with updated site data.
Anthropic AIanthropic-aiMozilla/5.0 (compatible; anthropic-ai/1.0; +http://www.anthropic.com/bot.html)Primary Anthropic crawler that collects broad web data for Claude model development.
NameUser AgentFull User AgentDescription
GeminiGoogle-ExtendedMozilla/5.0 (compatible; Google-Extended/1.0; +http://www.google.com/bot.html)Controls whether content can be used for training Gemini AI models.
Google Mobile AgentGoogle-AgentMozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/W.X.Y.Z Mobile Safari/537.36 (compatible; Google-Agent; +https://developers.google.com/crawling/docs/crawlers-fetchers/google-agent)Used by Google AI agents to autonomously browse the web and complete tasks on behalf of users (mobile).
Google Desktop AgentGoogle-AgentMozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Google-Agent; +https://developers.google.com/crawling/docs/crawlers-fetchers/google-agent) Chrome/W.X.Y.Z Safari/537.36Used by Google AI agents to autonomously browse the web and complete tasks on behalf of users (desktop).
NameUser AgentFull User AgentDescription
Perplexity BotPerplexityBotMozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; PerplexityBot/1.0; +https://perplexity.ai/perplexitybot)Used to surface and link websites in Perplexity search results.
Perplexity CitationsPerplexity-UserMozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Perplexity-User/1.0; +https://perplexity.ai/perplexity-user)Used for user actions in Perplexity when visiting web pages to answer questions.
NameUser AgentFull User AgentDescription
Cohere AIcohere-aiMozilla/5.0 (compatible; cohere-ai/1.0; +http://www.cohere.ai/bot.html)Collects textual data for Cohere’s language models, helping refine large-scale text generation.
NameUser AgentFull User AgentDescription
Mistral AI CitationsMistralAI-UserMozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; MistralAI-User/1.0; +https://docs.mistral.ai/robots)Used for user actions in LeChat when visiting web pages to answer questions.
NameUser AgentFull User AgentDescription
DeepSeekDeepSeekBotMozilla/5.0 (compatible; DeepSeekBot/1.0; +http://www.deepseek.com/bot.html)Used to crawl content for training DeepSeek’s generative AI models.
NameUser AgentFull User AgentDescription
Grok BotGrokBotGrokBot/1.0 (+https://x.ai)Used for training Grok AI.
Grok SearchxAI-GrokxAI-Grok/1.0 (+https://grok.com)Used for Grok’s search capabilities.
Grok Deep SearchGrok-DeepSearchGrok-DeepSearch/1.0 (+https://x.ai)Used for Grok’s advanced search capabilities.

Step 3: Finish

Once implemented on your end, view your crawler logs in the Promptwatch dashboard.