Skip to content
@RLHF-And-Friends

RLHF-And-Friends

RLHF & Friends

Rain all day, fine-tuning all night

Popular repositories Loading

  1. TunePPO TunePPO Public

    Python 4 1

  2. PrePPO PrePPO Public

    Some preparation steps for RLHF PPO training

    Jupyter Notebook 1

  3. .github .github Public

  4. FedRL FedRL Public

    Python

  5. ForkedPPO ForkedPPO Public

    Forked from vwxyzjn/ppo-implementation-details

    The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

    Python

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…