Google-Extended

Last updated 1 hour ago.

CompliantAI Bot

What is Google-Extended?

About

Google-Extended is a standalone product token that web publishers can use to manage whether content Google crawls from their sites may be used for training future generations of Gemini models that power Gemini Apps and Vertex AI API for Gemini and for grounding (providing content from the Google Search index to the model at prompt time to improve factuality and relevancy) in Gemini Apps and Grounding with Google Search on Vertex AI. Google-Extended does not impact a site's inclusion in Google Search nor is it used as a ranking signal in Google Search. NOTE: Google-Extended doesn't have a separate HTTP request user agent string. Crawling is done with existing Google user agent strings; the robots.txt user-agent token is used in a control capacity.

Operator

Google

See how often Google-Extended visits your website by setting up Spyglasses analytics. Set up tracking

Expected Behavior

AI model trainers systematically crawl websites to collect data for training and improving AI models. These bots read and analyze web content to understand language patterns, gather factual information, and build knowledge that will be incorporated into AI systems. The data they collect becomes part of the training dataset used to teach AI models how to understand and generate human-like text.

Should I Block Google-Extended?

The decision to block AI model trainers depends on how your business uses the content on your site. If you create original creative work, proprietary research, or paid content that gives you a competitive advantage, you should consider blocking these bots to protect your intellectual property. However, if your content helps potential customers discover your products or services, allowing AI models to access it can help you reach new audiences when people ask AI assistants for recommendations.

For detailed guidance on when to block AI model trainers, including considerations for different types of businesses and content, read our comprehensive guide.

Learn more about blocking AI model trainers

Recommended Solution

Instead of manually managing robots.txt rules, use Spyglasses to automatically detect and manage Google-Extended traffic with real-time analytics and flexible blocking rules.

Get Automated Bot Management

Manage Google-Extended Traffic with Spyglasses

Get real-time alerts when bots visit your site, automatically generate robots.txt rules, and integrate bot traffic data with your existing analytics tools.

Start Free Trial