Skip to content
View mskimS2's full-sized avatar

Block or report mskimS2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

20 repositories

PyTorch implementation of DreamerV2 model-based RL algorithm

Python 236 54 Updated Apr 26, 2023

PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies

Python 58 6 Updated Jan 22, 2021

A collection of offline reinforcement learning algorithms.

Python 207 27 Updated Nov 26, 2024

Reinforcement learning resources curated

9,505 1,886 Updated May 25, 2023

Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code for Decision LSTM architecture. Extension of the original D…

Python 27 2 Updated Mar 24, 2023

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,514 584 Updated May 25, 2024

[ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.

Python 96 2 Updated Oct 20, 2025

PWM: Policy Learning with Large World Models

Jupyter Notebook 64 7 Updated Aug 4, 2025

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,733 506 Updated Apr 29, 2024

Online Decision Transformer

Python 274 42 Updated Jan 22, 2024

Learning Latent Dynamics for Planning from Pixels

Python 1,227 213 Updated Mar 24, 2023

Deep Planning Network: Control from pixels by latent planning with learned dynamics

Python 373 63 Updated Oct 15, 2021

A Simplified Pytorch Version of the Dreamer Algorithm

Python 146 26 Updated Jul 24, 2023

A pytorch implementation of Dreamer

Python 23 3 Updated Mar 13, 2023

pytorch-implementation of Dreamer (Model-based Image RL Algorithm)

Python 167 38 Updated Jan 19, 2025

Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.

Python 321 39 Updated Jan 11, 2024

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Python 763 64 Updated Oct 4, 2024
Python 343 46 Updated Mar 24, 2025
C++ 3 1 Updated May 17, 2024