Bookmarks
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
  • Weekly Feed
  • Monthly Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
4 results tagged speechtotext  ✕   ✕
Audiogrep by antiboredom http://antiboredom.github.io/audiogrep/
Wed 15 Jan 2020 02:59:33 PM PST archive.org

Uses CMU's Pocketsphinx to do a quick speech-to-text transcription, then grep that transcript. It then takes any found text, extracts the sentence in question from the audio stream and saves it as a separate mp3.

speechtotext audio search editor grep cli mp3
GitHub - synesthesiam/rhasspy https://github.com/synesthesiam/rhasspy
Mon 06 Jan 2020 01:41:57 PM PST archive.org

Rhasspy (pronounced RAH-SPEE) is an offline, multilingual voice assistant toolkit inspired by Jasper that works well with Home Assistant, Hass.io, and Node-RED. Designed so that you don't have to use any not-self-hosted software under the hood, from speech recognition to TTS. Emits JSON events. Vocabulary can be expanded with the automated assistance feature. Will run on something as simple as a RasPi but doesn't treat x86(-64) like a second-class citizen. Commands/intents are specified in a fairly easy templating language.

Docs: https://rhasspy.readthedocs.io/en/latest/

python exocortex personalassistant speechrecognition speechtotext selfhosted leandra
mozilla/DeepSpeech: A TensorFlow implementation of Baidu's DeepSpeech architecture https://github.com/mozilla/DeepSpeech
Mon 19 Mar 2018 10:34:08 PM PDT archive.org

Can be used with audio files and probably a hot mic to transcribe speech into text for later processing. Uses git Large File Storage for the neural network objects. GPU acceleration enabled. Includes trained models as well as source code. Available in PyPy as deepspeech and deepspeech-gpu. Supports the RasPi explicitly as a platform, interestingly.

Looking at the releases page is a good way to keep up with the project: https://github.com/mozilla/DeepSpeech/releases

python exocortex leandra tensorflow speechtotext gpu speechrecognition
Common Voice https://voice.mozilla.org/
Mon 19 Mar 2018 10:34:05 PM PDT archive.org

Mozilla's open source speech recognition project. They're asking people to contribute samples of themselves speaking sentences on the screen to grow their corpus.

stt foss samples speechtotext corpus data speechrecognition
4220 links, including 280 private
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn