Ginray

🌶️

艰难，但相信

Yinlei Sun Ginray

🌶️

艰难，但相信

19 followers · 33 following

hangzhou

Achievements

Stars

the-seeds / LLaMA-Factory-Doc

LLaMA Factory Document

154 34 Updated Nov 6, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,701 601 Updated Nov 14, 2025

jd-opensource / xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 699 77 Updated Nov 13, 2025

microsoft / rStar

Python 1,337 120 Updated Sep 12, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,290 95 Updated Nov 13, 2025

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 751 76 Updated Apr 27, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 269 17 Updated Nov 7, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,174 101 Updated Oct 20, 2025

Qihoo360 / Light-R1

Python 748 49 Updated Sep 3, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,412 162 Updated Mar 20, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,640 2,399 Updated Sep 8, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,371 811 Updated Nov 9, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,566 2,516 Updated Nov 13, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,985 384 Updated Nov 13, 2025

jsksxs360 / How-to-use-Transformers

Transformers 库快速入门教程

Python 1,733 207 Updated Sep 20, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 11,033 969 Updated Nov 13, 2025