Lists (13)
Sort Name ascending (A-Z)
Starred repositories
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
WordPress with SQLite, ready to use out of the box.
A generative speech model for daily dialogue.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
Deezer source separation library including pretrained models.
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Port of OpenAI's Whisper model in C/C++
Faster Whisper transcription with CTranslate2
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
Summarize Youtube Videos and Generate Timestamps Efficiently using LLM [Google Gemini Pro, OpenAI ChatGPT]
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
Tesseract Open Source OCR Engine (main repository)
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. Open Code.
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
A collection of my own ComfyUI workflows for working with SDXL
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
High-Resolution Image Synthesis with Latent Diffusion Models
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A curated list of the most impressive AI papers
Retrieval and Retrieval-augmented LLMs
A versatile Python library for EPUB2/EPUB3 manipulation and processing.