Skip to content
View peterdavidfagan's full-sized avatar

Highlights

  • Pro

Organizations

@ros-planning @moveit @ros-controls @ipab-rad @scpd-proed @ML-Collective

Block or report peterdavidfagan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
82 stars written in Python
Clear filter

Inference code for Llama models

Python 58,913 9,813 Updated Jan 26, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,764 6,891 Updated Nov 11, 2025

A toolkit for developing and comparing reinforcement learning algorithms.

Python 36,757 8,710 Updated Oct 11, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,936 3,237 Updated Nov 11, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,213 4,034 Updated Jul 17, 2024

DSPy: The framework for programming—not prompting—language models

Python 29,919 2,392 Updated Nov 10, 2025

Open standard for machine learning interoperability

Python 19,856 3,826 Updated Nov 7, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 19,243 2,984 Updated Nov 10, 2025

Hierarchical Reasoning Model Official Release

Python 11,679 1,708 Updated Sep 9, 2025

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 10,529 792 Updated Nov 11, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,377 583 Updated Oct 28, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,252 890 Updated Jul 8, 2025

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,410 2,621 Updated Nov 11, 2025

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Python 4,313 725 Updated Nov 6, 2025

A library of reinforcement learning components and agents

Python 3,839 505 Updated Sep 26, 2025

JAX-based neural network library

Python 3,114 265 Updated Sep 29, 2025
Python 2,907 336 Updated Nov 11, 2025

Isaac Gym Reinforcement Learning Environments

Python 2,724 497 Updated Oct 26, 2024

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,613 574 Updated Oct 15, 2025

Monte Carlo tree search in JAX

Python 2,555 209 Updated Sep 2, 2025

A TensorFlow implementation of the Differentiable Neural Computer.

Python 2,518 444 Updated Jul 23, 2021

Simple and easily configurable grid world environments for reinforcement learning

Python 2,357 635 Updated Oct 27, 2025
Python 2,025 310 Updated Apr 19, 2024

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Python 2,016 607 Updated Nov 8, 2025

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,824 68 Updated Jun 22, 2025

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Python 1,756 317 Updated Jul 30, 2024

An offline deep reinforcement learning library

Python 1,584 260 Updated Sep 10, 2025

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,576 156 Updated Sep 3, 2025
Python 1,547 308 Updated Jul 23, 2024
Next