Stars
MAGI-1: Autoregressive Video Generation at Scale
[CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
A fundamental toolkit designed for music, song, and audio generation
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
[NeurIPS 2024] SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
[CVPR 2025] Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"
Official pytorch implementation of paper "Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data" (CVPR 2022).
Official repository of In-Context LoRA for Diffusion Transformers
victorchall / genmoai-smol
Forked from genmoai/mochiThe best OSS video generation models
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[Patterns (Cell subsidiary journal)] The official code for "UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation".
reproduction of AnimateAnyone
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
Unofficial Implementation of Animate Anyone
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Generative Models by Stability AI
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
MagicAvatar: Multimodal Avatar Generation and Animation
Tiny AutoEncoder for Stable Diffusion in TensorFlow / Keras
