Skip to content

Commit 1b15d10

Browse files
committed
robots.txt excludes known AI crawlers
1 parent 78daf97 commit 1b15d10

File tree

1 file changed

+44
-0
lines changed

1 file changed

+44
-0
lines changed

public/robots.txt

Lines changed: 44 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,46 @@
1+
# List of AI crawlers from https://github.com/ai-robots-txt/ai.robots.txt/blob/main/robots.txt
2+
User-agent: AI2Bot
3+
User-agent: Ai2Bot-Dolma
4+
User-agent: Amazonbot
5+
User-agent: anthropic-ai
6+
User-agent: Applebot
7+
User-agent: Applebot-Extended
8+
User-agent: Bytespider
9+
User-agent: CCBot
10+
User-agent: ChatGPT-User
11+
User-agent: Claude-Web
12+
User-agent: ClaudeBot
13+
User-agent: cohere-ai
14+
User-agent: Diffbot
15+
User-agent: DuckAssistBot
16+
User-agent: FacebookBot
17+
User-agent: facebookexternalhit
18+
User-agent: FriendlyCrawler
19+
User-agent: Google-Extended
20+
User-agent: GoogleOther
21+
User-agent: GoogleOther-Image
22+
User-agent: GoogleOther-Video
23+
User-agent: GPTBot
24+
User-agent: iaskspider/2.0
25+
User-agent: ICC-Crawler
26+
User-agent: ImagesiftBot
27+
User-agent: img2dataset
28+
User-agent: ISSCyberRiskCrawler
29+
User-agent: Kangaroo Bot
30+
User-agent: Meta-ExternalAgent
31+
User-agent: Meta-ExternalFetcher
32+
User-agent: OAI-SearchBot
33+
User-agent: omgili
34+
User-agent: omgilibot
35+
User-agent: PerplexityBot
36+
User-agent: PetalBot
37+
User-agent: Scrapy
38+
User-agent: Sidetrade indexer bot
39+
User-agent: Timpibot
40+
User-agent: VelenPublicWebCrawler
41+
User-agent: Webzio-Extended
42+
User-agent: YouBot
43+
Disallow: /
44+
145
User-agent: *
246
Allow: /

0 commit comments

Comments
 (0)