Skip to content

Navigation Menu

Appearance settings

PRIME-RL

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

PRIME-RL

Researching scalable (RL) methods on language models.

Overview
Repositories
Discussions
Projects
Packages
People

More

Overview
Repositories
Discussions
Projects
Packages
People

Pinned Loading

Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 185 6
SimpleVLA-RL SimpleVLA-RL Public

Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

Python 230 6
PRIME PRIME Public

Scalable RL solution for advanced reasoning of language models

Python 1.6k 96
TTRL TTRL Public

TTRL: Test-Time Reinforcement Learning

Python 644 46
ImplicitPRM ImplicitPRM Public

Repo of paper "Free Process Rewards without Process Labels"

Python 152 10

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All Python

Sort

Select order

Last updated Name Stars

Showing 5 of 5 repositories

Entropy-Mechanism-of-RL Public
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

PRIME-RL/Entropy-Mechanism-of-RL’s past year of commit activity

Python 185 6 7 0 Updated Jun 20, 2025
SimpleVLA-RL Public
Online RL with Simple Reward Enables Training VLA Models with Only One Trajectory

PRIME-RL/SimpleVLA-RL’s past year of commit activity

Python 230 MIT 6 10 1 Updated Jun 20, 2025
TTRL Public
TTRL: Test-Time Reinforcement Learning

PRIME-RL/TTRL’s past year of commit activity

Python 644 MIT 46 4 0 Updated Jun 6, 2025
PRIME Public
Scalable RL solution for advanced reasoning of language models

PRIME-RL/PRIME’s past year of commit activity

Python 1,619 Apache-2.0 96 6 1 Updated Mar 18, 2025
ImplicitPRM Public
Repo of paper "Free Process Rewards without Process Labels"

PRIME-RL/ImplicitPRM’s past year of commit activity

Python 152 Apache-2.0 10 12 0 Updated Mar 14, 2025

People

Top languages

Most used topics

Loading…

Uh oh!

There was an error while loading. Please reload this page.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.