-
yifeizhou02.github.io Public
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedSep 5, 2025 -
-
ArCHer Public
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
-
RAM_public Public
Forked from facebookresearch/RAMA framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
Python MIT License UpdatedMar 17, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedFeb 20, 2025 -
collab_openrlhf Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
-
archer.io Public
Forked from nerfies/nerfies.github.iowebsite for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
JavaScript UpdatedMar 12, 2024 -
-
BT-2 Public
Research Code for "BT^2: Backward-compatible Training with Basis Transformation" (https://arxiv.org/abs/2211.03989).
-
-
hybrid_trpo Public
Forked from ikostrikov/pytorch-trpoPyTorch implementation of Trust Region Policy Optimization
Python MIT License UpdatedFeb 2, 2023 -
clipscore Public
Forked from jmhessel/clipscoreCLIPScore EMNLP code
Python MIT License UpdatedDec 16, 2022 -
-
Research code for "GAPX: Generalized Autoregressive Paraphrase-identification X", NeurIPS 2022
-
Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022
-
-
ml-fct Public
Forked from apple/ml-fctResearch publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022.
Python Other UpdatedMay 27, 2022 -
SimCSE Public
Forked from princeton-nlp/SimCSEEMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings
Python MIT License UpdatedNov 26, 2021

