• 0 Posts
  • 6 Comments
Joined 2 days ago
cake
Cake day: July 16th, 2025

help-circle
  • ell1e@leminal.spacetoTechnology@lemmy.worldI was wrong about robots.txt
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    53 minutes ago

    You look up what Googlebot does. No AI.

    The page seems written to perhaps suggest it but doesn’t explicitly say the other bots can’t feed into some other sort of AI training. It would be in Google’s interest to mislead the users here.

    Edit: I found a quote where it says Googlebot does both in one: “Google-Extended doesn’t have a separate HTTP request user agent string. Crawling is done with existing Google user agent […]” and I guess Cloudflare doesn’t trust Google to abide by the access controls. That seems sensible to me.