Stars
[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide
[NeurIPS 2025] One-Step Diffusion-Based Image Compression with Semantic Distillation
Open source code in the field of semantic communication.
Official inference repo for FLUX.1 models
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Pytorch implementation of MeanFlow on ImageNet and CIFAR10
Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.
Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Official repo for consistency models.
Deformable ConvNets V2 (DCNv2) in PyTorch
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations (MIB for multimodal sentiment analysis)
ImageBind One Embedding Space to Bind Them All
Cross-modal few-shot adaptation with CLIP
Deep learning-based task-oriented and unified multi-task semantic communications
Alignedreid++: Dynamically Matching Local Information for Person Re-Identification.
[CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
[CVPR 2022] Part-based Pseudo Label Refinement for Unsupervised Person Re-identification
Collection of public available person re-identification datasets
[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
[CVPR 2024] CA-Jaccard: Camera-aware Jaccard Distance for Person Re-identification
A game theoretic approach to explain the output of any machine learning model.
DT-JSCC: discrete joint source-channel coding for task-oriented communication with digital modulation
Demo of robust semantic communication against semantic noise