Tulip is an information visualization framework dedicated to the analysis and visualization of relational data. Tulip aims to provide the developer with a complete library, supporting the design of interactive information visualization applications for relational data that can be tailored to the problems he or she is addressing.
Comes with Python embedded to interact with the data.
Versions for multiple OSes are available. Might be worth grabbing the .appimage to save time.l
Generate test data, unique ids, passwords, vouchers or other randomized textual data very quickly using a template language. The template language is superficially similar to regular expressions but instead of defining how to match or capture strings, it defines how to generate randomized strings.
A curated list of awesome articles about falsehoods programmers make about things which are simply untrue.
A collective list of free APIs for use in software and web development.
Open source machine learning and data visualization for novice and expert. Interactive data analysis workflows with a large toolbox. Perform simple data analysis with clever data visualization. Explore statistical distributions, box plots and scatter plots, or dive deeper with decision trees, hierarchical clustering, heatmaps, MDS and linear projections. Interactive data exploration. Add-ons available.
Aleph is a tool for indexing large amounts of both documents (PDF, Word, HTML) and structured (CSV, XLS, SQL) data for easy browsing and search. It is built with investigative reporting as a primary use case. Aleph allows cross-referencing mentions of well-known entities (such as people and companies) against watchlists, e.g. from prior research or public datasets. Web-based search. Processing includes optical character recognition, language and encoding detection and named entity extraction. Load structured entity graph data from databases and CSV files. This allows navigation of complex datasets like companies registries, sanctions lists or procurement data.
The Living Library seeks to provide actionable knowledge on governance innovation. We identify for our core audience the “signal in the noise” by curating research, best practices, points of view, new tools, and developments. Spans topic areas from artificial intelligence, open data, and blockchain, to citizen science, open innovation, and civic technology. The platform has an international purview, with insights drawn from across the globe and relevant to a diversity of sectors. Research-based knowledge offerings. Appears to have some up-to-date data archived in some contexts.
A Python package that generates fake data for you. Use to populate databases, spreadsheets, or what have you with plausible looking data. Names, addresses, text... has a plugin system so you can install generators for other kinds of data. There is also a community collection of plugins. Localized. Can be used as a module or a stand-alone tool.
A Python module for formatting things in a human-friendly way. Parses and formats numbers, file sizes, pathnames, lengths of time. Includes timer functionality. Implements lists of options for user input. ANSI friendly.
How to load JSON into a SQLite database all in one go using Python.
SQLite has a JSON datatype, so it's possible to load JSON objects into columns. There still needs to be a unique key for each entry, though.
Convert CSV files into a SQLite database. Designed for use with Datasette. Requires Python 3.
Datasette provides an instant, read-only JSON API for any SQLite database, even those applications on your system maintain. It also provides tools for packaging the database up as a Docker container and deploying that container to hosting providers such as Zeit Now. Got CSV data? Use csvs-to-sqlite to convert them to SQLite. Runs its own webserver on localhost:8001 so you don't need to write code to use it, just point it at a .sqlite3 file.
Requires Python 3.
dataset provides a simple abstraction layer removes most direct SQL statements without the necessity for a full ORM model - essentially, databases can be treated like a JSON file, a hash table or a NoSQL datastore.
Schema is automatically generated and updated. If you want to, you can use SQLite's in-memory database functionality. Records are either created or updated automatically, depending on whether or not they exist already. Built on top of SQLalchemy so whatever it supports, dataset supports.
A collection of open source and commercial tools that aid in red team operations.
Documentation and Test Consoles for Over 500 Public APIs
Overview of one of Arthur Britto's Ripple accounts.
The GeoNames geographical database covers all countries and contains over eleven million placenames that are available for download free of charge.
Web services data API: http://www.geonames.org/export/ws-overview.html
Poke around in there, you'll find some interesting things, like earthquakes, geolocated datasets (eg, nearby Wikipedia pages), Open Streetmap interface. They might even be useful.
A worldwide, real time, community collaborative lightning location network.
Might be an API. Data could be useful.
The OMDb API is a RESTful web service to obtain movie information, all content and images on the site are contributed and maintained by our users.