Skip to content
View Ivan-Tang-3D's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Ivan-Tang-3D

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 1,924 119 Updated Dec 25, 2025

Dexbotic: Open-Source Vision-Language-Action Toolbox

Python 615 49 Updated Dec 22, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,688 895 Updated Dec 18, 2025

Native and Compact Structured Latents for 3D Generation

Python 2,338 162 Updated Dec 23, 2025

[AAAI 26 Oral] Official implementation of "FreeGaussian: Annotation-free Control of Articulated Objects via 3D Gaussian Splats with Flow Derivatives"

Python 27 2 Updated Dec 1, 2025

Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation

Python 179 11 Updated Dec 9, 2025

SAM 3D Objects with Multi-view Images

Python 144 4 Updated Dec 5, 2025

Tools to build pytorch3d wheels for linux

Shell 5 Updated Nov 19, 2025

The first Interleaved framework for textual reasoning within the visual generation process

153 1 Updated Nov 21, 2025

The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"

Python 75 Updated Dec 17, 2025

Team Comet's 2025 BEHAVIOR Challenge Codebase

Python 166 8 Updated Dec 17, 2025

Are Video Models Ready as Zero-shot Reasoners?

Python 84 4 Updated Nov 24, 2025

每日arxiv论文更新;Topic:EmbodiedAI,MLLM,Vision- Language- Navigation

HTML 15 Updated Dec 25, 2025

3D Gaussian Splat Editor

TypeScript 3,362 382 Updated Dec 24, 2025

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

Python 13,082 1,756 Updated Dec 11, 2025

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 627 48 Updated Dec 25, 2025

Fully Open Framework for Democratized Multimodal Training

Python 663 53 Updated Dec 15, 2025

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 245 13 Updated Dec 5, 2025

EO: Open-source Unified Embodied Foundation Model Series

Jupyter Notebook 279 26 Updated Nov 12, 2025
Python 1,372 120 Updated Sep 12, 2025
Python 11 Updated Aug 11, 2025

paper code

Python 42 2 Updated Mar 20, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,077 666 Updated Nov 20, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 414 21 Updated Dec 22, 2024

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,729 1,024 Updated Dec 4, 2025

Text-audio foundation model from Boson AI

Python 7,772 578 Updated Sep 15, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,781 129 Updated Dec 25, 2025
Python 19 Updated Oct 15, 2025
Next