Bookmarks
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
  • Weekly Feed
  • Monthly Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
7 results tagged transcripts  ✕   ✕
thewh1teagle/vibe https://github.com/thewh1teagle/vibe
Fri 17 Oct 2025 12:55:56 PM PDT archive.org

Transcribe almost every language. Fully offline transcription, no data ever leaves your device. User friendly design (for a change). Transcribe audio or video. Option to transcribe audio from popular websites (YouTube, Vimeo, Facebook, Twitter and more!) Batch transcription. Supports multiple document formats. Plug it into an LLM and you can get summaries. Translate to English from any language. Optimized for GPUs. Has a CLI tool. Supports custom models.

javascript rust transcripts stt exocortex webapps rest api translation cli
James-P-D/SDRTranscriber https://github.com/James-P-D/SDRTranscriber
Tue 26 Aug 2025 02:41:10 PM PDT archive.org

This script which continuously records 1-2 second snippets and saves them as files named 00000000.wav, 00000001.wav and so on. After every 20 seconds we concatenate the files into a larger one named 00000000_00000035.wav and analyse the file for periods of silence lasting 500ms or longer. We then snip the file back into smaller chunks which be believe contain full sentences. These files are saved as 00000000_00000035_x_y.wav where x and y represent the start and end period of the audio in ms. We then pass this .wav file into the offline OpenAI-Whisper speech recognition library. Finally we clear up any old .wav files that have been processed and repeat the process.

rtlsdr python transcripts speechtotext
murtaza-nasir/speakr https://github.com/murtaza-nasir/speakr
Tue 06 May 2025 07:51:17 PM PDT archive.org

Speakr is a personal, self-hosted web application designed for transcribing audio recordings (like meetings), generating concise summaries and titles, and interacting with the content through a chat interface. Keep all your meeting notes and insights securely on your own server. This includes self-hosting your own LLM models to do the heavy lifting, so you don't have to use an LLM service provider.

Upload audio files (MP3, WAV, M4A, etc.) via drag-and-drop or file selection. Transcription and summarization happen in the background without blocking the UI. Uses OpenAI-compatible Speech-to-Text (STT) APIs that you can connect to a self-hosted model (like Whisper). Generates concise titles and summaries using configurable LLMs via OpenAI-compatible APIs. Ask questions and interact with the transcription content using an AI model.

python webapps llm transcripts audio exocortex selfhosted audio
troed/summarize.sh https://codeberg.org/troed/summarize.sh
Sun 23 Feb 2025 09:46:58 PM PST archive.org

A bit of glue between components that is able to textually summarize videos and podcasts - offline. The script takes a URL as argument, downloads and extracts the audio, transcribes the spoken words to text and then finally prints a summary of the content. No external services are used by this script except for the initial audio download. Examples of URLs that work are Youtube videos and Apple podcasts, see the yt-dlp project for the full list.

This script doesn't do anything clever, it just makes use of the great work done by other projects. Since the purpose is to not have to sit through 8-12 minutes of someone explaining what should've just been a short blog post. The default model used is LLaMa-3 to support medium spec hardware. If you have a large system, Mixtral 8x7b is another great option with a much larger context window (= able to work with longer transcriptions).

The script saves transcriptions to a folder in the same directory, and if the same URL is later used again it will not re-download the audio and create a new transcription but use the existing one. This means it's possible to later use the conversational mode to ask questions on the content, even if not done the first time.

Relies upon a locally hosted LLM to do the heavy lifting so you don't have to ship the data off to another service. Entirely self hosted.

scripts utility cli transcripts llm video audio
lkmeta/txtify https://github.com/lkmeta/txtify
Thu 27 Jun 2024 01:58:53 PM PDT archive.org

Txtify is a free and open-source web app for converting audio and video to text using advanced AI models. It supports YouTube videos and personal media files, offering fast and accurate transcriptions. Txtify can be self-hosted, giving you full control over your transcription process.

python webapps exocortex ai ml transcripts generators audio video
TranslateMom https://www.translate.mom/
Fri 13 Oct 2023 02:14:20 PM PDT archive.org

Free, instant translations and transcriptions for video and audio files!

online service transcripts translation audio video
obra/Youtube2Webpage https://github.com/obra/Youtube2Webpage
Fri 15 Sep 2023 05:31:08 PM PDT archive.org

I learn much better from text than from videos.

Youtube-to-Webpage is a Perl script to create a webpage from a Youtube video with a transcript generated from the video's closed captions paired with screenshots of the video.

The project is built upon:

  • yt-dlp
  • ffmpeg
perl youtube videos converter transcripts
6746 links, including 433 private
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn