Web Traffic
Good Bot Traffic
14%
Source: Imperva 2025 Bad Bot Report As of: 2025-04

14% good bots: Googlebot, Bingbot, monitors, AI training crawlers that respect robots.txt.

What it measures

Good bots operate within accepted web norms — they identify themselves accurately, respect robots.txt, observe crawl-delay directives, and provide reciprocal value to site owners. At 14% of all web traffic they include:

Why humans should care

Good bots are the invisible infrastructure of the open web. Without Googlebot your content doesn't exist in search results. Without uptime monitors, outages go undetected for hours. The 14% figure understates their economic importance — one Googlebot visit can drive thousands of subsequent human visits.

AI training crawlers: contested category

AI training crawlers (GPTBot, ClaudeBot, Google-Extended) are classified as good bots when they identify themselves and respect robots.txt. But many publishers block them, arguing that training use doesn't provide the referral reciprocity that search indexing does. The distinction is increasingly contested legally and economically.

What happens next

The good bot share is being squeezed: AI training crawlers blur the boundary between good and bad by consuming content without providing referral reciprocity. As more publishers block AI crawlers via robots.txt, the definition of 'good bot' will be legally and economically contested — especially as crawler compensation models begin to emerge.

Pros — Benefits

Cons — Risks

What to watch for

What you can do

  • Verify your robots.txt explicitly allows Googlebot and Bingbot
  • Check Google Search Console for crawl errors and crawl budget waste
  • Decide your policy on AI training crawlers and encode it explicitly in robots.txt
  • Whitelist known good bot IP ranges in your WAF to prevent false-positive blocking
  • Monitor crawl budget in Search Console; excessive bad bot traffic wastes it
  • Set up uptime monitoring if you don't have it — 5-minute checks minimum
  • Develop industry standards for AI crawler reciprocity and compensation
  • Support Web Monetization proposals for crawler-based content compensation
  • Fund research on sustainable web crawling economics

Data & methodology

Source
Imperva 2025 Bad Bot Report
Classification
Good bots identified by verifying claimed user-agent against known legitimate bot IP ranges
Update cadence
Annual — April 2025 report
Dashboard anchor
Live stat on dashboard

Related stats