Skip to content
View TianxingWu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report TianxingWu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PhysX: Physical-Grounded 3D Asset Generation (NeurIPS 2025, Spotlight)

Jupyter Notebook 338 19 Updated Dec 18, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,877 1,816 Updated Oct 13, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,647 749 Updated Sep 22, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,706 311 Updated Nov 28, 2025

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 137 5 Updated Jun 4, 2025

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

2,982 135 Updated Dec 20, 2025

AllTracker is a model for tracking all pixels in a video.

Python 378 27 Updated Sep 2, 2025

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Python 2,228 522 Updated Dec 22, 2025

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 2,151 139 Updated Nov 2, 2025

StereoPilot Elastic3D StereoWorld BetterDepth BRIDGE BriGeS ChronoDepth Depth Any Video Depth Anything Depth Pro DepthCrafter Distill Any Depth FE2E GRIN M2SVid MASt3R MegaSaM Metric3D Metric-Solve…

207 6 Updated Dec 21, 2025

[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy

Python 869 39 Updated Sep 26, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,195 106 Updated Oct 15, 2025

[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 584 29 Updated Jul 1, 2025

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,117 60 Updated Nov 9, 2025

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 274 14 Updated Dec 17, 2025

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 14,085 3,018 Updated Oct 22, 2025

[CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Python 454 26 Updated Apr 4, 2025

Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"

103 1 Updated Jun 4, 2025

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 1,640 140 Updated Oct 7, 2025
Python 144 8 Updated Oct 9, 2025

Web-based 3D visualization + Python

Python 2,137 162 Updated Dec 20, 2025

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Python 1,939 146 Updated Oct 1, 2025

[CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"

Python 229 11 Updated May 7, 2024

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

1,174 71 Updated Dec 25, 2025

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,313 59 Updated Jul 23, 2025
Python 111 4 Updated Oct 1, 2021

Taichi Blender intergration for physics simulation and animation

Python 170 15 Updated Mar 2, 2021

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 752 102 Updated Oct 29, 2025

Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Jupyter Notebook 390 76 Updated Aug 20, 2025

A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.

229 8 Updated Dec 19, 2025
Next