About
Twitterbot is X's (formerly Twitter) web crawler used to fetch link previews, Open Graph metadata, and media for cards shown in posts. It also contributes to X's broader data collection infrastructure. Since Elon Musk's acquisition of Twitter, concerns have grown about X using crawled data to train Grok, X's in-house AI assistant. Many publishers block Twitterbot to limit exposure to X's AI training pipeline.
Purpose
Link card previews, Open Graph metadata fetching, and AI training data collection for Grok
User Agent String
Twitterbot
How to Control in robots.txt
🚫 Block Twitterbot
User-agent: Twitterbot Disallow: /
✅ Allow Twitterbot
User-agent: Twitterbot Allow: /
⚠️ Twitterbot has been observed ignoring robots.txt directives. You may need server-level blocking (e.g., firewall rules or user-agent filtering) to effectively prevent access.
Is Twitterbot crawling your site?
Run a free scan to check if X (formerly Twitter)'s crawler is accessing your website.
Check if Twitterbot is crawling YOUR site →