Stars
[ICCV 2025] LayerAnimate: Layer-specific Control for Animation
Efficient DiT architecture for text2any tasks, ICLR2025
Understand Human Behavior to Align True Needs
Lumina-T2X is a unified framework for Text to Any Modality Generation
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
A hyperspherical face recognition library based on PyTorch
Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, oral).
Paper and Codes for “RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection” (ICCV2021)
State-of-the-art 2D and 3D Face Analysis Project
LiDAR R-CNN: An Efficient and Universal 3D Object Detector
LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention (CVPR20)
SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds (ECCV2020)
Content-Aware Unsupervised Deep Homography Estimation
适用于移动端的人脸识别模型,计算量与mobilefacenet相同,但megaface上提升了2%+
A list of papers and datasets about point cloud analysis (processing)
SECOND for KITTI/NuScenes object detection
Deep Hough Voting for 3D Object Detection in Point Clouds (https://arxiv.org/abs/1904.09664)
PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud, CVPR 2019.
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
official github for paper "CELEB-500K: A LARGE TRAINING DATASET FOR FACE RECOGNITION"
对比ZQCNN-MTCNN与libfacedetection
A Simple and Versatile Framework for Object Detection and Instance Recognition