str2speech is a simple command-line tool for converting text to speech using Transformer-based text-to-speech (TTS) models. It supports multiple models and voice presets, allowing users to generate high-quality speech audio from text.
Supports multiple TTS models, including suno/bark-small, suno/bark, and various facebook/mms-tts models. Allows selection of voice presets. Supports text input via command-line arguments or files. Outputs speech in .wav format. Works with both CPU and GPU.
Looks like the speech models have to be installed locally to work.