Stars
A modern Anki custom scheduling based on Free Spaced Repetition Scheduler algorithm
LordKayBanks / ts-fsrs
Forked from open-spaced-repetition/ts-fsrsts-fsrs is a versatile package written in TypeScript that supports ES modules, CommonJS, and UMD.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voic…
Use Microsoft Edge's online text-to-speech service in Node.js, browsers, or any JavaScript environment WITHOUT needing Microsoft Edge or Windows or an API key
Use Microsoft Edge's TTS service on Node.js with support for proxy and subtitles.
Edge TTS is a Node or Bun package that allows access to the online text-to-speech service used by Microsoft Edge without the need for Microsoft Edge, Windows, or an API key.
🌐 The Internet Computer! Free, Open-Source, and Self-Hostable.
Open Source project using LLMs to translate subtitles (SRT, SSA/ASS, VTT)
Generate audiobooks from e-books, voice cloning & 1158+ languages!
Meridian cuts through news noise by scraping hundreds of sources, analyzing stories with AI, and delivering concise, personalized daily briefs.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
mdsplit is a python command line tool to split Markdown files into chapters at a given heading level
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Command line utility for forced alignment using Kaldi
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.