The U.S. Energy Information Administration is committed to its free and open data by making it available through an Application Programming Interface (API) and its open data tools. Hourly operating data, power generation stats, capabilities, and more.
We build and maintain an open repository of web crawl data that can be accessed and analyzed by anyone. The Common Crawl corpus contains petabytes of data collected since 2008. It contains raw web page data, extracted metadata and text extractions. The Common Crawl dataset lives on Amazon S3 as part of the Amazon Web Services’ Open Data Sponsorships program. You can download the files entirely free using HTTP(S) or S3. Our goal is to democratize the data so everyone, not just big companies, can do high quality research and analysis.
As part of the OpenFEMA initiative, FEMA is providing read-only API based access to datasets (Entities). The data is exposed using a RESTful interface that uses query string parameters to manage the query. Use of the service is free and does not require a subscription or API key.
A full list of Entities/endpoints supported by the API can be found at Data Sets.
Public data about businesses, government entities, military agencies, police, and so forth throughout the United States.
When the US-based SkyHub organisation regrettably closed down in August 2021, enthusiastic former community members wanted to keep the project and the valuable exchange within the community alive. Therefore, in October 2021, we, a group of European astronomers, software developers and hardware engineers, founded Sky360 as a non-profit NGO association, registered in Austria.
We want to provide a community platform, tools and support to all people interested in observing the skies for stars, meteors, satellites, planes, drones, weather phenomena, birds, UAPs or anything else that happens in our atmosphere and low Earth orbit. We already support the Discord channel the UAP Tracking Forum for the community of UAP trackers with over 900 members and more communities to come in the future. Together with and for the community we develop hardware and software for a 24/7 citizen sky observatory that can detect, track, identify and analyze any aerial phenomena and yet is still affordable for citizens.
Congress.gov shares its application programming interface (API) with the public to ingest the Congressional data. Sign up for an API key that you can use to access web services provided by Congress.gov.
Github (with better documentation): https://github.com/LibraryOfCongress/api.congress.gov/
Global.health is a collaborative effort by technologists and researchers from leading international institutions to build a trusted, detailed, and accurate resource of real-time infectious disease data.
By creating a centralized open resource of verified case-level data from around the world, our aim is to accelerate the work of researchers, public health officials, and the global community to better prepare for, respond to, and reduce the burden of disease outbreaks. We hope that this work will help cultivate a global community invested in improving health outcomes for all through open and secure data sharing.
This repository contains dated records of curated Monkeypox cases from the 2022 outbreak (April - ), a data dictionary, and a script used to pull contents from a spreadsheet into JSON and CSV files.
The script is intended for use by the curation team and supporting engineers. It requires access to the relevant Google Sheet, and a Google Cloud service account.
The data dictionary is located in the root directory of this project. It contains information about columns/fields in the data sets.
The archives folder contains dated JSON and CSV files. They are currently uploaded manually; regularly and automatically updated data sets live in an (currently private) S3 bucket.
The data is updated six times per day.
Powers this site: https://map.monkeypox.global.health/country
Open Infrastructure Map is a view of the world's infrastructure mapped in the OpenStreetMap database. This data isn't exposed on the default OSM map, so I built Open Infrastructure Map to visualise it. If you want to edit the data and you're new to OpenStreetMap, check out learnOSM.
If you already have some OSM experience and want to start tagging infrastructure things, take a look at the tagging guidelines for power and telecoms.
The Commercial Military Actor Database (CMAD) is able to support research on civil war and commercial military actors. First, the CMAD covers all civil wars from 1980 to 2016 across all of the world’s regions except Europe, which enables the investigation of long-term regional and global trends. Second, the CMAD encompasses the corporate market segment and mercenary outfits, which facilitates the analysis of how those actors have impacted conflicts differently. Third, containing detailed information about the relationships behind exchanges, the CMAD allows users to disaggregate market exchanges.
A directory of the CDC's REST APIs.
Risk Factor™ is a free online tool created by the nonprofit First Street Foundation® that makes it easy for Americans to find their property’s risk from environmental threats such as flooding and wildfires and understand how risks are changing because of a changing environment.
First Street Foundation’s mission is to address the asymmetry in access to high-quality climate change data by quantifying and communicating America’s environmental risks so that everyone can make informed decisions for the future. By making flood and fire risk data accessible and easy to understand, individuals and communities can prepare for and mitigate risks before they become a reality.
Look up your zipcode to get an assessment of environmental disaster risk in your area.
The Socrata Open Data API allows you to programmatically access a wealth of open data resources from governments, non-profits, and NGOs around the world.
The OpenAQ Community harmonizes disparate air quality data from across the world so that citizens and organizations can fight air inequality more efficiently. The data is captured from multiple sources and made accessible to all through our open-source platform.
Bulk upload: https://upload.openaq.org/
A collective list of free APIs for use in software and web development.
A Creative Commons handbook that teaches the legal, social, and technical aspects of open data. Teaches not only what you can do with an open data set, but how to go about getting datasets opened to analysis. If you're looking for a hands-on nitty-gritty book about mining open data, however, this isn't it.
CKAN is open source data management portal software for extremely large data sets, on the scale of entire governments. Used to publish, curate, share, search, and find datasets of just about any kind. Does visualization and metrics. Source code on Github (https://github.com/ckan/ckan). Written in Python, uses Postgres as its back end. Can interface with Apache Solr for search and indexing.
The datarefuge website. Probably as official as it's going to get. Has some useful definitions, at least. opendata Also has a bunch of rescued datasets available for download. data