Skip to content
GoogleRespects robots.txtAI Research

How to Block Gemini-Deep-Research: Google's AI Research Crawler

Gemini-Deep-Research crawls your site when a Gemini Advanced subscriber runs a Deep Research query. It reads dozens of pages to compile multi-page research reports. Not the training crawler — that's Google-Extended.

Updated March 2026

Google's AI Crawler Family

Google operates multiple AI-specific crawlers. Each serves a different purpose and can be blocked independently:

GooglebotMain search indexer. Never block this unless you want to disappear from Google Search.
Google-ExtendedAI training crawler. Collects data to train and improve Gemini models.
Gemini-Deep-ResearchDeep Research retrieval. Fetches pages for multi-page research reports in Gemini Advanced.
Google-NotebookLMNotebookLM retrieval. Fetches pages when users add URLs as sources in NotebookLM.

What Is Deep Research?

Deep Research is a feature available to Gemini Advanced subscribers ($19.99/month). When a user submits a complex research question, Gemini autonomously browses the web for several minutes — reading dozens of pages, following relevant links, and synthesizing findings into a structured multi-page report with citations.

Common use cases include competitive analysis, academic literature reviews, market studies, and detailed topic investigations. The output is a comprehensive report visible only to the user who requested it.

Gemini-Deep-Research is the crawler that powers this feature. It fetches pages on demand during a research session, reading content in real-time to gather information. It is not a bulk training data harvester.

How to Block Gemini-Deep-Research

Add this to your robots.txt:

robots.txtBlock Deep Research only
User-agent: Gemini-Deep-Research
Disallow: /

To block all Google AI crawlers while keeping your search rankings:

robots.txtBlock all Google AI activity
# Keep Google Search indexing
User-agent: Googlebot
Allow: /

# Block AI training
User-agent: Google-Extended
Disallow: /

# Block Deep Research
User-agent: Gemini-Deep-Research
Disallow: /

# Block NotebookLM
User-agent: Google-NotebookLM
Disallow: /

Your Google Search rankings are safe

Blocking Gemini-Deep-Research, Google-Extended, or Google-NotebookLM has zero effect on your Google Search rankings. Only Googlebot controls search indexing.

Who Should Block Gemini-Deep-Research?

📊
Premium research publishers
If you sell research reports, market analysis, or proprietary data, Deep Research can compile your findings into a free report for the user — bypassing your paywall.
🎯
Competitive intelligence sites
Deep Research is specifically designed for competitive analysis. If your content is a competitive moat, it's a prime target for Deep Research queries.
⚖️
Legal or compliance-sensitive content
Regulated content that shouldn't be processed by external AI systems. Deep Research compiles content across sources, potentially combining sensitive information in new ways.

What Blocking Does (and Doesn't) Do

What it stops
  • • Your pages being read during Deep Research sessions
  • • Your content appearing in Deep Research reports
  • • Gemini Advanced users sourcing your site for research
What it doesn't stop
  • • Google-Extended training crawls (block separately)
  • • NotebookLM crawls (block Google-NotebookLM separately)
  • • Google Search rankings (completely unaffected)
  • • Gemini's existing knowledge from training data

Frequently Asked Questions

How is Deep Research different from a regular Gemini query?

A regular Gemini query uses the model's training data to answer. Deep Research actively browses the web in real-time — reading dozens of pages over several minutes — to compile a comprehensive report. The Gemini-Deep-Research crawler is what performs this live browsing.

Does Deep Research crawl my entire site?

Not systematically. It reads specific pages relevant to the user's research query, following links where relevant. It's not an index builder — it's a targeted reader. But for a single research session, it may read multiple pages from your site if they're all relevant to the query.

Is Deep Research content used for model training?

Google has not stated that content retrieved by Gemini-Deep-Research feeds back into Gemini model training. That's Google-Extended's purpose. However, Google's internal data practices are not fully transparent, so publishers who want certainty should block both crawlers.

Can I block Deep Research for specific paths only?

Yes. Instead of Disallow: / you can target specific paths: Disallow: /research/ or Disallow: /premium/ — allowing public content to be used in research reports while protecting gated content.

Related Guides

Is your site protected from AI bots?

Run a free scan to check your robots.txt, meta tags, and overall AI readiness score.

Related Guides