ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple and have a scalable dataset.

Python 34 1 Updated Nov 27, 2025

leofan90 / Awesome-World-Models

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

948 23 Updated Dec 24, 2025

PRIME-RL / SimpleVLA-RL

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,146 63 Updated Oct 13, 2025

shiqichen17 / SPA

Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"

Python 32 2 Updated Nov 1, 2025

zeyofu / BLINK_Benchmark

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]

Python 153 8 Updated Sep 27, 2025

2U1 / Qwen-VL-Series-Finetune

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,504 190 Updated Dec 19, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,162 1,806 Updated Feb 26, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,506 1,155 Updated Nov 21, 2025

DelinQu / SimplerEnv-OpenVLA

Forked from simpler-env/SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)

Jupyter Notebook 252 45 Updated Jun 23, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 8,931 838 Updated Oct 25, 2025

End2End-Diffusion / REPA-E

[ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 425 20 Updated Dec 6, 2025

CHEN-H01 / Fast-in-Slow

Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning

Python 134 12 Updated Aug 1, 2025

simpler-env / SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 898 164 Updated Dec 20, 2025

gen-robot / RL4VLA

Python 214 16 Updated Aug 25, 2025

Zijian007 / LIBERO-PRO

Forked from Zxy-MLlab/LIBERO-PRO

LIBERO-PRO is the official repository of the LIBERO-PRO — an evaluation extension of the original LIBERO benchmark

Jupyter Notebook 1 Updated Oct 27, 2025

Yu Qi yqi19

Lists (8)

3DV

CG

dexterous hands

Generative Models

🚀 My stack

reinforcement learning

rl

Robotics

Stars