Stars
[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou…
Enjoy the magic of Diffusion models!
Toolkit for linearizing PDFs for LLM datasets/training
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Efficient vision foundation models for high-resolution generation and perception.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Transparent Image Layer Diffusion using Latent Transparency
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" (AIM ECCVW 2022) https://arxiv.org/abs/2201.13433
Official Implementation for "HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing" (CVPR 2022) https://arxiv.org/abs/2111.15666
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
A reference containing Styles and Keywords that you can use with MidJourney AI. There are also pages showing resolution comparison, image weights, and much more!
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
High-Resolution Image Synthesis with Latent Diffusion Models
Auto detecting, masking and inpainting with detection model.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch