Stanford has open sourced a self-hosted, personal assistant system. Designed with privacy in mind. Speech recognition, analysis, task execution. They want to make so that it's easy and highly useful for everyone to integrate into their stuff. Can monitor things and filter for certain things. Aims for composability. Services (skills) are also open source and crowdsourced.
Github: https://github.com/stanford-oval
An application that uses AI and ML to intelligently strip the vocals out of music tracks. Written in Python, uses Tensorflow. The pre-trained model is included in the distribution. Use as a CLI tool or a library for your own code.
A three-stage deep learning system which can figure out how to imitate a person's voice from as little as five seconds of recorded speech. Speaks with the deepfaked voice in realtime. The sample is used to condition an existing TTS model to sound like someone.
Running it inside a Docker container: https://sean.lane.sh/posts/2019/07/Running-the-Real-Time-Voice-Cloning-project-in-Docker/
An opensource voice chat system for use by gamers. Client/server architecture. Runs on Windows, Linux, an OSX.
What seems to be a fairly new web based XMPP client. Owncloud and Nextcloud integrations. Standalone mode also. voice and video enabled, also. OTR enabled. /me emotes.
voice assistant speechrecognition exocortex Designed for privacy, runs on-device on-prem, works without a network connection. There are community contributed skills, called snips, provided skills, and you can develop your own with a web based visual builder. Supports multiple human languages. Deploy to your own devices as long as they run android or linux. github repos here: https://github.com/snipsco
A way to donate your voice to build speech synthesizers for people. In a way, you can back up your voice if you think you're going to lose it.
F/OSS voice control system. Runs on a raspi. Extensible. Uses speech synthesis to respond.