This is an open list of web crawlers associated with AI companies and the training of LLMs to block. We encourage you to contribute to and implement this list on your own site.
You can subscribe to updates the releases feed: https://github.com/ai-robots-txt/ai.robots.txt/releases.atom
If you just want to pull a robots.txt file: https://raw.githubusercontent.com/ai-robots-txt/ai.robots.txt/refs/heads/main/robots.txt