How to Block Gemini-Deep-Research: Google's AI Research Crawler
Gemini-Deep-Research crawls your site when a Gemini Advanced subscriber runs a Deep Research query. It reads dozens of pages to compile multi-page research reports. Not the training crawler — that's Google-Extended.
Updated March 2026
Google's AI Crawler Family
Google operates multiple AI-specific crawlers. Each serves a different purpose and can be blocked independently:
GooglebotMain search indexer. Never block this unless you want to disappear from Google Search.Google-ExtendedAI training crawler. Collects data to train and improve Gemini models.Gemini-Deep-ResearchDeep Research retrieval. Fetches pages for multi-page research reports in Gemini Advanced.Google-NotebookLMNotebookLM retrieval. Fetches pages when users add URLs as sources in NotebookLM.What Is Deep Research?
Deep Research is a feature available to Gemini Advanced subscribers ($19.99/month). When a user submits a complex research question, Gemini autonomously browses the web for several minutes — reading dozens of pages, following relevant links, and synthesizing findings into a structured multi-page report with citations.
Common use cases include competitive analysis, academic literature reviews, market studies, and detailed topic investigations. The output is a comprehensive report visible only to the user who requested it.
Gemini-Deep-Research is the crawler that powers this feature. It fetches pages on demand during a research session, reading content in real-time to gather information. It is not a bulk training data harvester.
How to Block Gemini-Deep-Research
Add this to your robots.txt:
User-agent: Gemini-Deep-Research Disallow: /
To block all Google AI crawlers while keeping your search rankings:
# Keep Google Search indexing User-agent: Googlebot Allow: / # Block AI training User-agent: Google-Extended Disallow: / # Block Deep Research User-agent: Gemini-Deep-Research Disallow: / # Block NotebookLM User-agent: Google-NotebookLM Disallow: /
Your Google Search rankings are safe
Blocking Gemini-Deep-Research, Google-Extended, or Google-NotebookLM has zero effect on your Google Search rankings. Only Googlebot controls search indexing.
Who Should Block Gemini-Deep-Research?
What Blocking Does (and Doesn't) Do
- • Your pages being read during Deep Research sessions
- • Your content appearing in Deep Research reports
- • Gemini Advanced users sourcing your site for research
- • Google-Extended training crawls (block separately)
- • NotebookLM crawls (block Google-NotebookLM separately)
- • Google Search rankings (completely unaffected)
- • Gemini's existing knowledge from training data
Frequently Asked Questions
How is Deep Research different from a regular Gemini query?
A regular Gemini query uses the model's training data to answer. Deep Research actively browses the web in real-time — reading dozens of pages over several minutes — to compile a comprehensive report. The Gemini-Deep-Research crawler is what performs this live browsing.
Does Deep Research crawl my entire site?
Not systematically. It reads specific pages relevant to the user's research query, following links where relevant. It's not an index builder — it's a targeted reader. But for a single research session, it may read multiple pages from your site if they're all relevant to the query.
Is Deep Research content used for model training?
Google has not stated that content retrieved by Gemini-Deep-Research feeds back into Gemini model training. That's Google-Extended's purpose. However, Google's internal data practices are not fully transparent, so publishers who want certainty should block both crawlers.
Can I block Deep Research for specific paths only?
Yes. Instead of Disallow: / you can target specific paths: Disallow: /research/ or Disallow: /premium/ — allowing public content to be used in research reports while protecting gated content.
Related Guides
Is your site protected from AI bots?
Run a free scan to check your robots.txt, meta tags, and overall AI readiness score.