Google-Extended
Google's crawler specifically used for training Generative AI models, separate from traditional Googlebot.
What is Google-Extended?
Google-Extended is Google's specialized crawler for collecting data to train its Generative AI models, including Gemini and Bard. Unlike Googlebot (which powers Google Search rankings), Google-Extended data is used exclusively for AI training. Site owners can block Google-Extended in robots.txt without affecting their traditional SEO rankings. This gives publishers control: allow Google Search indexing while blocking AI training, or allow both for maximum visibility.
How Qwairy Makes This Actionable
Qwairy tracks Google-Extended crawler activity separately from regular Googlebot visits. Monitor how frequently Google's AI training crawler accesses your content and which pages it prioritizes for generative AI data collection.
Frequently Asked Questions
Related Terms
AI Crawler
Indexing robot used by AI companies to collect data intended to train or feed their models.
robots.txt
Text file placed at the root of a website to indicate to indexing robots which pages to explore or avoid.
Gemini
Family of language models developed by Google, integrated into its search and AI products.