ORLax - Offline Reinforcement Learning with JAX

ORLax is an extensible, research-friendly offline reinforcement learning framework built with JAX, Flax, and Optax. It provides clean, typed APIs optimized for editor autocompletion, modular algorithm implementations, and production-ready features like WandB logging and GPU acceleration.

Features

🔥 Modern JAX Stack: Built on JAX, Flax, and Optax for high-performance GPU/TPU training
📦 Modular Design: Clean separation of concerns with pluggable algorithms, models, and datasets
🎯 Type-Safe: Comprehensive type hints with dataclasses instead of dict-heavy patterns
📊 Built-in Logging: WandB integration with terminal progress bars (tqdm)
🚀 Production-Ready: Checkpointing, multi-device training, and reproducible experiments
🧪 Research-Friendly: Clear interfaces, and easy extensibility

Algorithms

BC (Behavioral Cloning) - Supervised learning from expert demonstrations
CQL (Conservative Q-Learning) - Conservative offline RL with Q-value penalties
IQL (Implicit Q-Learning) - Expectile regression-based offline RL

Installation

Using uv (Recommended)

# Clone the repository
git clone https://github.com/sql-hkr/orlax.git
cd orlax

# Install with uv
uv sync

Using pip

# Clone the repository
git clone https://github.com/sql-hkr/orlax.git
cd orlax

# Install in editable mode
pip install -e .

GPU Support

For CUDA support, install JAX with CUDA:

# For CUDA 12
pip install --upgrade "jax[cuda12]"

Quick Start

Training

# Train IQL on Hopper-Medium
uv run orlax-train --config configs/iql_hopper.toml

Citation

If you use ORLax in your research, please cite:

@software{orlax2025,
  title = {ORLax: Offline Reinforcement Learning with JAX},
  author = {sql-hkr},
  year = {2025},
  url = {https://github.com/sql-hkr/orlax}
}

Acknowledgments

Built with JAX, Flax, and Optax
Offline RL datasets from Minari (successor to D4RL)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feat/amazing-feature)
Commit your changes (git commit -m 'feat: add amazing feature')
Push to the branch (git push origin feat/amazing-feature)
Open a Pull Request

Contact

Author: sql-hkr
Email: [email protected]
GitHub: @sql-hkr
Issues: GitHub Issues

Note: This software is under active development. API stability is not guaranteed until version 1.0.0.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
configs		configs
src/orlax		src/orlax
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ORLax - Offline Reinforcement Learning with JAX

Features

Algorithms

Installation

Using uv (Recommended)

Using pip

GPU Support

Quick Start

Training

Citation

Acknowledgments

Contributing

Contact

About

Uh oh!

Releases 2

Languages

License

sql-hkr/orlax

Folders and files

Latest commit

History

Repository files navigation

ORLax - Offline Reinforcement Learning with JAX

Features

Algorithms

Installation

Using uv (Recommended)

Using pip

GPU Support

Quick Start

Training

Citation

Acknowledgments

Contributing

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Languages