Stars
Parse PDFs into markdown using Vision LLMs
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
[CVPR 2025] HUSH: Holistic Panoramic 3D Scene Understanding using Spherical Harmonics
[CVPR 2025] "DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion" official implementation.
[ICLR 2025 Spotlight] MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility
[CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation
About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]
Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"
[NeurIPS 2024] Data exporter for SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Universal Monocular Metric Depth Estimation
Matterport3D is a pretty awesome dataset for RGB-D machine learning tasks :)
[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation
The calibration program with Double Sphere Camera Model
[NeurIPS 2024] Benchmarking code for SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
[ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
[AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input"
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Official implementation of Continuous 3D Perception Model with Persistent State
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
SfM for sphere images in the ERP format within the framework of ColMap
Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)
Official Implementation of Posterior Distillation Sampling
High-resolution models for human tasks.
[NeurIPS'24] Official implementation of "HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors"
This is the project page for paper "A Simple Baseline for Efficient Hand Mesh Reconstruction, CVPR2024"


