LuLim14

LuLim14

DL researcher/research engineer, interesting in LLM alignment, reinforcement learning, mechanistic interpretability. Very like PyTorch and Transformers

2 followers · 4 following

Pinned Loading

Alignment_project Alignment_project Public

Implementation WARP algorithm for LLM alignment

Jupyter Notebook
RL_Algorithms RL_Algorithms Public

Reinforcement learning algorithms implementation on PyTorch

Python
StyleTransferBot StyleTransferBot Public

Style Transfer Bot

Jupyter Notebook
Transformers_Experiments Transformers_Experiments Public

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LuLim14

Block or report LuLim14

Pinned Loading

Uh oh!