DL researcher/research engineer, interesting in LLM alignment, reinforcement learning, mechanistic interpretability. Very like PyTorch and Transformers
Pinned Loading
-
Alignment_project
Alignment_project PublicImplementation WARP algorithm for LLM alignment
Jupyter Notebook
-
RL_Algorithms
RL_Algorithms PublicReinforcement learning algorithms implementation on PyTorch
Python
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.