Qi Mao HelenMao

🎯

Focusing

Associate Professor, Communication University of China，PhD from Peking University

165 followers · 29 following

Communication University of China
CUC, Beijing, China

Achievements

Stars

inFaaa / Awesome-Personalized-Video-Creation

📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.

44 1 Updated Jul 2, 2025

Yui010206 / VEGGIE-VidEdit

[ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation

20 Updated Jun 25, 2025

lzyhha / VisualCloze

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 247 11 Updated Jun 4, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 14,865 1,347 Updated Jun 27, 2025

showlab / FAR

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 223 8 Updated Apr 23, 2025

zibojia / SENORITA

This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists

Python 56 1 Updated Apr 9, 2025

zhang0jhon / diffusion-4k

[CVPR 2025] Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models

Python 239 8 Updated Jun 3, 2025

cshw2021 / Learned-Image-Video-Compression

A collection of papers related to data compression

76 2 Updated Jul 2, 2025

showlab / MovieAgent

MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning

Python 213 27 Updated Mar 26, 2025

CUC-MIPG / Edit-Transfer

Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"

Jupyter Notebook 79 1 Updated Jun 6, 2025

showlab / Awesome-Robotics-Diffusion

A curated list of recent robot learning papers incorporating diffusion models for robotics tasks.

203 6 Updated Jun 13, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

603 32 Updated Jun 27, 2025

LMM101 / Awesome-Multimodal-Next-Token-Prediction

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

445 10 Updated Jan 17, 2025

showlab / ROICtrl

Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation

Python 108 Updated Apr 16, 2025

kyegomez / movie-gen

An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!

Python 60 3 Updated Jun 30, 2025