Stars
Python interface to the WebRTC Voice Activity Detector
🎓🖥️ Solutions for 350+ Interview Questions asked at FANG and other top tech companies
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
Swift app demonstrating Core ML Stable Diffusion
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
High-Resolution Image Synthesis with Latent Diffusion Models
Robust Speech Recognition via Large-Scale Weak Supervision
A multi-voice TTS system trained with an emphasis on quality
Stable diffusion for real-time music generation
Efficient 6-DoF Grasp Generation in Cluttered Scenes
[ICLR 2022 poster] Official PyTorch implementation of "Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework"
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
A latent text-to-image diffusion model
🎓 Sharing machine learning course / lecture notes.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.
The correct way to resize images or tensors. For Numpy or Pytorch (differentiable).
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
🏆 RoadToWeb3 Polygon Prize - Decentralized "TicketMaster"