Stars
NVIDIA curated collection of educational resources related to general purpose GPU programming.
Nvidia contributed CUDA tutorial for Numba
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
Layout-based Causal Inference for Object Navigation (CVPR 2023)
[ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
[CVPR 2025] This is the official PyTorch implementation of our paper "TopNet: Transformer-Efficient Occupancy Prediction Network for Octree-Structured Point Cloud Geometry Compression"
[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
[CoRL '25] Pseudo-Simulation for Autonomous Driving; [NeurIPS '24] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking
[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving
[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"
[ECCV 2024] Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
[NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
Code for Streaming 4D Visual Geometry Transformer
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
Diffusion model derived evolutionary algorithm
An optimization-based multi-sensor state estimator
TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
中国大学MOOC《机器人操作系统入门》代码示例 ROS tutorial
A complete computer science study plan to become a software engineer.