-
-
Notifications
You must be signed in to change notification settings - Fork 346
docs: update llms.txt list #386
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
The latest updates on your projects. Learn more about Vercel for GitHub. 1 Skipped Deployment
|
WalkthroughUpdated Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~3 minutes Pre-merge checks and finishing touches✅ Passed checks (3 passed)
✨ Finishing touches🧪 Generate unit tests (beta)
Comment |
451860a to
a822a35
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
🧹 Nitpick comments (1)
README.md (1)
912-912: Move French entry to “International” or translate description to English
Replace the French description with an English one (or relocate under an International section):-  **[Plombier sur Lyon](https://chapuisplomberie.fr/)** - Vous avez une urgence ? Besoin d'un dépannage ? Votre plombier sur Lyon disponible pour devis gratuit avec intervention au plus rapide après votre appel. <sub>[llms.txt](https://chapuisplomberie.fr/llms.txt)</sub> +  **[Plombier sur Lyon](https://chapuisplomberie.fr/)** - Emergency plumbing services in Lyon with free quotes and fast response. <sub>[llms.txt](https://chapuisplomberie.fr/llms.txt)</sub>
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
💡 Knowledge Base configuration:
- MCP integration is disabled by default for public repositories
- Linear integration is disabled by default for public repositories
You can enable these sources in your CodeRabbit configuration.
📒 Files selected for processing (1)
README.md(4 hunks)
🧰 Additional context used
📓 Path-based instructions (1)
{**/README.md,docs/**}
📄 CodeRabbit inference engine (.cursor/rules/shared.mdc)
Include proper documentation
Files:
README.md
🔇 Additional comments (3)
README.md (3)
776-776: LGTM: “Clouve” entry added under Infrastructure & Cloud; both llms.txt and llms-full.txt endpoints return HTTP 200.
659-659: LGTM: llms.txt URL returns 200 OK Verified that https://xmcp.dev/llms.txt responds with HTTP 200; ready to merge.
46-46: Neutralize Digital Inning entry in README.md
Verified llms endpoints return HTTP 200/302; updated marketing language to a factual directory-style description.--  **[Digital Inning](https://digitalinning.com/)** - Discover top-tier digital marketing services in India with Digital Inning, the leading agency based in Rajkot. Elevate your online presence today. <sub>[llms.txt](https://digitalinning.com/llms.txt) • [llms-full.txt](https://digitalinning.com/llms-full.txt)</sub> +-  **[Digital Inning](https://digitalinning.com/)** - Digital marketing agency based in Rajkot, India. <sub>[llms.txt](https://digitalinning.com/llms.txt) • [llms-full.txt](https://digitalinning.com/llms-full.txt)</sub>
4620093 to
84b17ae
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
🧹 Nitpick comments (5)
README.md (5)
46-46: Trim promotional copy for neutrality.Keep descriptions neutral and concise to match the repo’s style.
- -  **[Digital Inning](https://digitalinning.com/)** - Discover top-tier digital marketing services in India with Digital Inning, the leading agency based in Rajkot. Elevate your online presence today. <sub>[llms.txt](https://digitalinning.com/llms.txt) • [llms-full.txt](https://digitalinning.com/llms-full.txt)</sub> + -  **[Digital Inning](https://digitalinning.com/)** - Digital marketing agency based in Rajkot, India. <sub>[llms.txt](https://digitalinning.com/llms.txt) • [llms-full.txt](https://digitalinning.com/llms-full.txt)</sub>
245-248: “business operations” isn’t part of the documented category taxonomy.Either (a) add “business operations” to the Categories section, or (b) move Hinto AI under an existing category (e.g., “automation workflow” or “ai ml”) to stay consistent.
916-917: Consider moving non‑English listing to “international”.This French-language service might be better placed under the International section for consistency.
953-954: Misclassification: this is an app, not a personal/portfolio site.Suggest moving “Astroline - astrology app” from “personal” to “other” (or another suitable section) to align with the submission guidelines.
959-961: Clarify display name and neutralize tone.Use a clearer name referencing the brand and keep copy neutral.
- -  **[All-in](https://brandefense.io)** - Brandefense platform is a proactive digital risk protection solution for organizations. Our AI-driven technology constantly scans the dark, deep, and surface. <sub>[llms.txt](https://brandefense.io/llms.txt)</sub> + -  **[Brandefense — All‑in](https://brandefense.io)** - Digital risk protection platform. <sub>[llms.txt](https://brandefense.io/llms.txt)</sub>
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting
📒 Files selected for processing (1)
README.md(6 hunks)
🧰 Additional context used
🔍 Remote MCP Linear
Additional context from project issue tracker (relevant to PR #386)
- THE-1133 — "Add vercel llms.txt" — status: Done (references https://sdk.vercel.ai/llms.txt).
- THE-1134 — "Add upsun.com" — status: Todo (references https://devcenter.upsun.com/posts/llms-introduction/).
- THE-1126 — "Add link to the standard documentation for llms.txt" — status: Done.
- THE-1125 — "https://llmstxthub.com/llms.txt is returning a 500" — status: Done (past availability issue with the llms.txt endpoint).
- THE-1194 — "Add number of websites on the hompage list" — status: Backlog.
- THE-1112 — "Add email to subscribe does not work" — status: Done.
All items above were retrieved from the project's Linear issue list.
Suggested quick checks for this docs PR (based on the issues above)
- Verify each newly added external link (llms.txt or llms-full.txt) resolves and serves the expected llms.txt content (watch for availability/500 errors).
- Confirm entries with promotional text (e.g., agency descriptions) match repository contribution guidelines or expectations.
🔇 Additional comments (3)
README.md (3)
663-663: xmcp entry looks good.Formatting is consistent; link placement is correct.
780-781: Clouve entry looks good.Both llms.txt and llms-full.txt are provided and formatted consistently.
46-46: Quick link check — one redirect foundAll listed llms.txt endpoints returned HTTP 200 and text/plain except https://digitalinning.com/llms-full.txt — returned HTTP 302 (no Content-Type on HEAD). Update the README to point to the final target (follow the redirect) or confirm the redirect is intentional.
0f64290 to
70b7627
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 1
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting
📒 Files selected for processing (1)
README.md(9 hunks)
🧰 Additional context used
🔍 Remote MCP Linear
Summary of additional context relevant to PR #386 (docs: update llms.txt list)
- THE-1133 — "Add vercel llms.txt": target URL https://sdk.vercel.ai/llms.txt; status: Done.
- THE-1134 — "Add upsun.com": target URL https://devcenter.upsun.com/posts/llms-introduction/; status: Todo (not yet added).
- THE-1126 — "Add link to the standard documentation for llms.txt": status: Done. (Confirms standard docs link was added previously.)
- THE-1125 — "https://llmstxthub.com/llms.txt is returning a 500": status: Done (past 500 error resolved). Suggest verifying newly added links don't suffer similar availability issues.
- THE-1194 — "Add number of websites on the homepage list": status: Backlog (not relevant to this PR but indicates planned UX metric change).
- THE-1112 — "Add email to subscribe does not work": status: Done (subscription bug fixed previously).
Notes for reviewers (derived from above):
- Verify each newly added external llms.txt / llms-full.txt link resolves and serves expected content (watch for 500s). (See THE-1125, THE-1133, THE-1134.)
90fddf0 to
79ca287
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 1
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting
📒 Files selected for processing (1)
README.md(17 hunks)
🧰 Additional context used
🔍 Remote MCP Linear
Additional context for PR #386 (concise)
- THE-1133 — "Add vercel llms.txt": target URL https://sdk.vercel.ai/llms.txt — status: Done.
- THE-1134 — "Add upsun.com": target URL https://devcenter.upsun.com/posts/llms-introduction/ — status: Todo (not yet added).
- THE-1126 — "Add link to the standard documentation for llms.txt" — status: Done (standard docs link already added).
- THE-1125 — "https://llmstxthub.com/llms.txt is returning a 500" — was reported as a bug and is marked Done (incident resolved); reviewers should still verify newly added external links for availability/HTTP errors.
- THE-1194 — "Add number of websites on the homepage list" — Backlog (UX metric change; not part of this PR).
- THE-1112 — "Add email to subscribe does not work" — reported and marked Done (subscription bug fixed previously).
Recommended reviewer actions (derived from above):
- Verify each newly added llms.txt / llms-full.txt link resolves and serves expected llms.txt content (watch for 5xx/4xx).
| -  **[Digital Inning](https://digitalinning.com/)** - Discover top-tier digital marketing services in India with Digital Inning, the leading agency based in Rajkot. Elevate your online presence today. <sub>[llms.txt](https://digitalinning.com/llms.txt) • [llms-full.txt](https://digitalinning.com/llms-full.txt)</sub> | ||
| -  **[Maison Fervenza | Wedding](https://www.fervenza.fr/)** - Authentic luxury weddings in France with exclusive access to hidden châteaux, private museums & Michelin chefs. No templates, just your truth. <sub>[llms.txt](https://www.fervenza.fr/llms.txt) • [llms-full.txt](https://www.fervenza.fr/llms-full.txt)</sub> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fix Digital Inning’s llms-full.txt link
https://digitalinning.com/llms-full.txt still responds with HTTP 301 instead of returning the file directly, so the redirect issue called out in the earlier review remains unresolved. Please update the entry to point at the final URL that serves the llms-full.txt with an HTTP 200 response.
🤖 Prompt for AI Agents
In README.md around lines 46-47 the Digital Inning entry uses
https://digitalinning.com/llms-full.txt which currently responds with an HTTP
301 redirect; update that link to the final destination URL that returns HTTP
200 (replace the redirected URL in the markdown with the direct file URL),
verify the new URL returns 200 (e.g., curl -I) and update both the visible link
and the sublink reference so the README points directly to the non-redirecting
llms-full.txt.
f81c9ff to
cbe0b9d
Compare
cbe0b9d to
651dabf
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 3
♻️ Duplicate comments (1)
README.md (1)
46-47: Digital Inning llms-full.txt still redirects — point to final 200 URLReplace the redirected llms-full.txt link with the final destination that serves 200 directly. Keep llms.txt as-is.
Run to confirm redirect and get final URL:
#!/bin/bash set -euo pipefail curl -sS -I https://digitalinning.com/llms-full.txt | sed -n '1p;/^location:/Ip'
🧹 Nitpick comments (3)
README.md (3)
247-251: Align “business operations” with documented category schemaEither add “business operations” to the Categories/Guidelines sections or reclassify entries under existing categories to avoid drift.
979-979: Rename label to “Brandefense” for clarityUse the brand name rather than “All‑in” to improve searchability and consistency with favicon/domain.
- -  **[All-in](https://brandefense.io)** - Brandefense platform is a proactive digital risk protection solution for organizations. Our AI-driven technology constantly scans the dark, deep, and surface. <sub>[llms.txt](https://brandefense.io/llms.txt)</sub> + -  **[Brandefense](https://brandefense.io)** - Proactive digital risk protection platform. <sub>[llms.txt](https://brandefense.io/llms.txt)</sub>
254-256: Strip HTML/CSS artifacts from descriptions across multiple entriesThe codebase contains scraped HTML/CSS fragments embedded in entry descriptions. Clean these for consistency:
- Line 254 (AdGate Media): Remove
<meta charset="UTF-8">- Line 312 (agent.ai): Remove
visibility: hidden;- Line 317 (AKOOL): Remove
--bprogress-color: #8D66FF;- Line 338 (Arpeggi): Remove
<meta charset="UTF-8" />- Lines 789, 790, 857, 862: Additional
<meta>and<!DOCTYPE>tags to remove
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting
📒 Files selected for processing (1)
README.md(19 hunks)
🧰 Additional context used
🔍 Remote MCP Linear
Summary — additional repo/issue context relevant to reviewing PR #386:
- THE-1133: "Add vercel llms.txt" — status: Done; URL: https://sdk.vercel.ai/llms.txt.
- THE-1134: "Add upsun.com" — status: Todo; target URL: https://devcenter.upsun.com/posts/llms-introduction/ (not yet added).
- THE-1126: "Add link to the standard documentation for llms.txt" — status: Done (standard docs link already added).
- THE-1125: "https://llmstxthub.com/llms.txt is returning a 500" — status: Done (incident resolved); reviewers should still verify newly added external links for 4xx/5xx.
- THE-1112: "Add email to subscribe does not work" — status: Done (subscription flow fixed).
- THE-1194: "Add number of websites on the homepage list" — status: Backlog (not part of this PR).
Reviewer action suggested (based on above): verify that each newly added llms.txt / llms-full.txt link in README resolves and serves expected llms.txt content (check for 200 and correct plain-text format).
| -  **[Digital Inning](https://digitalinning.com/)** - Discover top-tier digital marketing services in India with Digital Inning, the leading agency based in Rajkot. Elevate your online presence today. <sub>[llms.txt](https://digitalinning.com/llms.txt) • [llms-full.txt](https://digitalinning.com/llms-full.txt)</sub> | ||
| -  **[Maison Fervenza | Wedding](https://www.fervenza.fr/)** - Authentic luxury weddings in France with exclusive access to hidden châteaux, private museums & Michelin chefs. No templates, just your truth. <sub>[llms.txt](https://www.fervenza.fr/llms.txt) • [llms-full.txt](https://www.fervenza.fr/llms-full.txt)</sub> | ||
| -  **[Upstaff](https://upstaff.com)** - Upstaff is a technology partner with expertise in AI, ML, Web3, Software, and Data Engineering. Hire faster, develop smarter with Upstaff. <sub>[llms.txt](https://upstaff.com/llms.txt)</sub> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🧩 Analysis chain
Quick link audit for newly added llms.txt endpoints
Validate status (200) and content-type (text/plain|text/markdown), and surface any 30x/40x/50x.
Also applies to: 249-251, 255-256, 278-278, 287-287, 341-341, 401-401, 674-674, 690-690, 747-747, 795-796, 806-806, 818-818, 830-830, 848-848, 913-913, 973-973, 979-979
🏁 Script executed:
#!/bin/bash
set -euo pipefail
check() {
local url="$1"
local out
out="$(curl -sS -o /dev/null -w 'HTTP %{http_code} CT %{content_type}\n' "$url" || true)"
printf '%-80s %s\n' "$url" "$out"
}
urls=(
# agency services
https://digitalinning.com/llms.txt
https://digitalinning.com/llms-full.txt
https://www.fervenza.fr/llms.txt
https://www.fervenza.fr/llms-full.txt
https://upstaff.com/llms.txt
# business operations
https://ca.auto-scrap-yard.com/llms.txt
https://hintoai.com/llms.txt
# content media
https://quodat.com/llms.txt
https://quodat.com/llms-full.txt
https://showswatched.com/llms.txt
https://showswatched.com/llms-full.txt
# data analytics
https://igexporter.com/llms.txt
https://sellersprite.ai/llms.txt
https://sellersprite.ai/llms-full.txt
# dev tools (sample changed)
https://docs.aspose.com/words/llms.txt
https://docs.aspose.com/words/llms-full.txt
https://configcat.com/docs/llms.txt
https://configcat.com/docs/llms-full.txt
https://xmcp.dev/llms.txt
# ecommerce retail
https://helloskin.hu/llms.txt
https://helloskin.hu/llms-full.txt
# finance fintech
https://www.jedox.com/llms.txt
https://unquoting.com/llms.txt
https://unquoting.com/llms-full.txt
# infrastructure cloud
https://www.clouve.com/llms.txt
https://www.clouve.com/llms-full.txt
https://payperfax.com/llms.txt
# international
https://meinbildungsurlaub.de/llms.txt
https://www.grupocfdeveloper.com/llms.txt
# marketing sales
https://adstxt.guru/llms.txt
https://adstxt.guru/llms-full.txt
# other/personal/security
https://kig.wiki/llms.txt
https://kig.wiki/llms-full.txt
https://astroline.today/llms.txt
https://brandefense.io/llms.txt
)
for u in "${urls[@]}"; do check "$u"; doneLength of output: 3871
Fix broken llms.txt endpoints in README.md
The following newly added endpoints returned errors instead of HTTP 200 responses:
https://digitalinning.com/llms-full.txt→ HTTP 302 redirecthttps://www.fervenza.fr/llms-full.txt→ HTTP 301 redirecthttps://showswatched.com/llms-full.txt→ HTTP 404https://helloskin.hu/llms-full.txt→ HTTP 404https://www.clouve.com/llms.txt→ HTTP 404https://www.clouve.com/llms-full.txt→ HTTP 404https://payperfax.com/llms.txt→ HTTP 404
Remove or correct these entries before merging to maintain link integrity across all listed instances (lines 46-48, 249-251, 255-256, 278, 287, 341, 401, 674, 690, 747, 795-796, 806, 818, 830, 848, 913, 973, 979).
🤖 Prompt for AI Agents
In README.md around lines 46 to 48 (and other referenced locations: 249-251,
255-256, 278, 287, 341, 401, 674, 690, 747, 795-796, 806, 818, 830, 848, 913,
973, 979), several llms.txt/llms-full.txt links return non-200 responses
(302/301 redirects or 404s); locate each listed URL
(digitalinning.com/llms-full.txt, www.fervenza.fr/llms-full.txt,
showswatched.com/llms-full.txt, helloskin.hu/llms-full.txt,
www.clouve.com/llms.txt, www.clouve.com/llms-full.txt, payperfax.com/llms.txt)
and either replace them with the correct working URL that returns HTTP 200,
update them to the canonical redirected target if appropriate, or remove the
entry entirely; after changes, run an HTTP check on all README llms links to
ensure each returns 200 and update the corresponding README lines accordingly
before merging.
| -  **[Unquoting Insurance Company](https://unquoting.com)** - Sells insurance to protect you and your assets in all 50 states. | ||
| Licensed insurance school offering pre-licensing education in the state of Ohio. <sub>[llms.txt](https://unquoting.com/llms.txt) • [llms-full.txt](https://unquoting.com/llms-full.txt)</sub> | ||
| -  **[VeChain](https://vechain.org)** - Discover VeChainThor blockchain: Sustainable, fast, scalable, secure, and EVM-compatible. Join us in leading the future of Web3. <sub>[llms.txt](https://docs.vechain.org/llms.txt)</sub> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fix broken bullet for “Unquoting Insurance Company”
The description spills to a new line, breaking the markdown list. Make it a single bullet line.
--  **[Unquoting Insurance Company](https://unquoting.com)** - Sells insurance to protect you and your assets in all 50 states.
-Licensed insurance school offering pre-licensing education in the state of Ohio. <sub>[llms.txt](https://unquoting.com/llms.txt) • [llms-full.txt](https://unquoting.com/llms-full.txt)</sub>
+-  **[Unquoting Insurance Company](https://unquoting.com)** - Sells insurance in all 50 states. Licensed insurance school offering pre‑licensing education in Ohio. <sub>[llms.txt](https://unquoting.com/llms.txt) • [llms-full.txt](https://unquoting.com/llms-full.txt)</sub>📝 Committable suggestion
‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.
| -  **[Unquoting Insurance Company](https://unquoting.com)** - Sells insurance to protect you and your assets in all 50 states. | |
| Licensed insurance school offering pre-licensing education in the state of Ohio. <sub>[llms.txt](https://unquoting.com/llms.txt) • [llms-full.txt](https://unquoting.com/llms-full.txt)</sub> | |
| -  **[VeChain](https://vechain.org)** - Discover VeChainThor blockchain: Sustainable, fast, scalable, secure, and EVM-compatible. Join us in leading the future of Web3. <sub>[llms.txt](https://docs.vechain.org/llms.txt)</sub> | |
| -  **[Unquoting Insurance Company](https://unquoting.com)** - Sells insurance in all 50 states. Licensed insurance school offering pre‑licensing education in Ohio. <sub>[llms.txt](https://unquoting.com/llms.txt) • [llms-full.txt](https://unquoting.com/llms-full.txt)</sub> | |
| -  **[VeChain](https://vechain.org)** - Discover VeChainThor blockchain: Sustainable, fast, scalable, secure, and EVM-compatible. Join us in leading the future of Web3. <sub>[llms.txt](https://docs.vechain.org/llms.txt)</sub> |
🤖 Prompt for AI Agents
In README.md around lines 770 to 772 the bullet for "Unquoting Insurance
Company" is broken because the description is split across lines; collapse the
broken lines so each list item is a single line (merge the description into the
same line as the bullet) and ensure the markdown list syntax remains "- [text] -
description" for that entry so it no longer spills to a new line.
| -  **[Clouve](https://www.clouve.com)** - Clouve connects businesses with enterprise-ready applications while empowering developers to monetize their solutions through our secure platform. <sub>[llms.txt](https://www.clouve.com/llms.txt) • [llms-full.txt](https://www.clouve.com/llms-full.txt)</sub> | ||
| -  **[Community Helm Charts](https://community-charts.github.io)** - Discover community-driven Helm charts for open-source tools lacking official support. Deploy apps effortlessly on Kubernetes. <sub>[llms.txt](https://community-charts.github.io/llms.txt) • [llms-full.txt](https://community-charts.github.io/llms-full.txt)</sub> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🧩 Analysis chain
Verify Clouve endpoints return 200 and text/ (not HTML)*
Confirm both llms.txt and llms-full.txt serve 200 with a text content-type.
🏁 Script executed:
#!/bin/bash
set -euo pipefail
for u in https://www.clouve.com/llms.txt https://www.clouve.com/llms-full.txt; do
echo "== $u"
curl -sS -o /dev/null -w 'HTTP %{http_code} CT %{content_type}\n' -L "$u"
doneLength of output: 148
Remove or correct Clouve endpoints—both URLs return 404
Both llms.txt and llms-full.txt endpoints at https://www.clouve.com/ return HTTP 404 with HTML content-type instead of serving text content. Either correct the URLs if they are wrong, or remove the Clouve entry from the README until these endpoints are functional.
🤖 Prompt for AI Agents
In README.md around lines 795-796 the Clouve entry points to
https://www.clouve.com/llms.txt and https://www.clouve.com/llms-full.txt which
return HTTP 404 and HTML instead of the expected text files; either update these
URLs to the correct working endpoints or remove the entire Clouve line (favicon,
link and sublinks) from the README until the endpoints are fixed; if you update
URLs verify they return 200 with text/plain and update the sublink targets
accordingly.
This PR updates the llms.txt list in README.md based on the current state of llms.txt in the repository.
Summary by CodeRabbit