Crowdsec is an open-source, lightweight software, detecting peers with aggressive behaviors to prevent them from accessing your systems. Its user friendly design and assistance offers a low technical barrier of entry and nevertheless a high security gain. Scans logs for signs of activity. Matches signs to local and crowdsourced attack signs. If a response agent is integrated with the service, it will react to the attack. Signs are also contributed back to the project to aid the community. Interactive setup and configuration. Designed not to need fine tuning to be effective.
A F/OSS natural language translation system that seems to want to give Google Translate a run for its money. The corpuses used for training appear to be crowdsourced, and I think you can download the trained models on their own. Aims to be self-hosted.
Installation docs: http://wiki.apertium.org/wiki/Installation
A crowdsourced map of the weird and uncanny.
A global network of satellite ground stations, designed as an open source participatory project. A machine readable crowdsourced satellite information database. Built from readily available and affordable tools and resources. Concentrates on satellites in LEO. A significant amount of work is involved so you won't be able to just throw up an RTL-SDR and get going. Designed to be built using readily available materials and access to basic tools and machinery using 3D printers and CNC as provided by average hackerspaces.
A crowdsourced mapping project which aims to document the trunk lines underlying the Net, and where privacy violations and surveillance infrastructure can watch it. Has open source, cross-platform software for download that runs traceroutes to various places, correlates the data, and uploads it to the project (anonymously, by default) for analysis.
API documentation for wigle.net.
A free, publically referrable, publically editable street map of the world, generated out of data contributed by the people who live there.
A crowdsourced open environmental data project. volunteer centered. Measures airquality, radiation, and environmental health.
A site that has presents a step-by-step howto for getting involved with OpenStreetmap.
A site that lists every creditcard vendor and their Cardholder Verification Methods for the cards they issue. The site seems to be crowdsourced because the companies don't necessarily publish this stuff.
An online open and crowdsourced weather service. People set up automatic weather stations (which are fairly cheap) and contribute measurements that are aggregated into forecasts. Has an API so you can pull data out of it as well as contribute it: http://openweathermap.org/appid Free accounts are, of course, limited in several ways. You can also get weather maps of various kinds from the service to visualize the forecast data. Forecast data is in XML, JSON, and HTML formats.
Another crowdsourced radiation monitoring website.
NLP training corpuses for the Chatterbot python module. Contains all of the structured text used to teach the text classifier and semantic analysis engines for the module. All user contributed. Encouages contribution by the community. YAML categories The training data consists of actual conversations and fragments thereof in the file.
3697 links, including 185 private