Stars
Stable Diffusion web UI
Command-line program to download videos from YouTube.com and other video sites
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
real time face swap and one-click video deepfake with only a single image
The definitive Web UI for local AI, with powerful features and easy setup.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
High-Resolution Image Synthesis with Latent Diffusion Models
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Real-time face swap for PC streaming or video calls
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Lets make video diffusion practical!
Command-line program to download image galleries and collections from several image hosting sites
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in.
Waydroid uses a container-based approach to boot a full Android system on a regular GNU/Linux system like Ubuntu.
Script for downloading Coursera.org videos and naming them.
ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.
An arbitrary face-swapping framework on images and videos with one single trained model!
« usbkill » is an anti-forensic kill-switch that waits for a change on your USB ports and then immediately shuts down your computer.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
This repository provides motion datasets collected by Bandai Namco Research Inc