Useful resources for using IPFS and building things on top of it.
A repository for monitoring attack vectors mentioned in the billion-dollar disinformation campaign to reelect the president in 2020. Includes some Python code for analyzing the data.
A topic-centric list of high-quality open datasets in public domains. By everyone, for everyone!
This website generates random JSON documents, suitable for use as test data or learning how to write and interface with various APIs.
Free datasets made available by Amazon. Stuff like an atlas of the Galactic Plane, NASA NEX data, the Human Microbiome Project, the Enron emails, Freebase, and the Marvel Universe's socialgraph. The Google Books Ngrams corpus is in here, also, alongside the Westbury Lab USENET corpus.
An opensource tool for the visualization of extremely large datasets, like twitter maps or email databases.
The datarefuge website. Probably as official as it's going to get. Has some useful definitions, at least. opendata Also has a bunch of rescued datasets available for download. data
A wiki of resources for people writing bots - actual bots to interact with, tools, tutorials, code, and datasets. exocortex chatbots howto twitter
socnet and archive of public and open data for research and study. Encourages people to upload their own datasets for others to use. I use Github to authenticate.
Vast collections of data suitable for training and teaching AI ML software.
3722 links, including 192 private