Stars
3D Slicer extension for fully-automatic segmentation of CT and CBCT dental volumes.
Automatic segmentation of CBCT scans with a 3D Unet
CVPR 2024: AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
NeurIPS 2023: Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation
3D Dental surface segmentation with Tooth Group Network
TCSVT 2025: MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Official code for CVPR 2023 paper NeuFace: Realistic 3D Neural Face Rendering from Multi-view Images.
Out of time: automated lip sync in the wild
Disentangled Speech Embeddings using Cross-Modal Self-Supervision
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
MiVOLO age & gender transformer neural network
[TAC 2024] SVFAP: Self-supervised Video Facial Affect Perceiver
An MBTI Exploration of Large Language Models
High-fidelity 3D Face Generation from Natural Language Descriptions (CVPR 2023)
Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)
Retrieval and Retrieval-augmented LLMs
TripoSR: Fast 3D Object Reconstruction from a Single Image
