Ginray

Follow

🌶️

艰难，但相信

Yinlei Sun Ginray

🌶️

艰难，但相信

Follow

17 followers · 33 following

hangzhou

Achievements

Achievements

Stars

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,631 590 Updated Nov 2, 2025

jd-opensource / xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 638 76 Updated Nov 3, 2025

microsoft / rStar

Python 1,329 118 Updated Sep 12, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,263 88 Updated Nov 1, 2025

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 744 76 Updated Apr 27, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 263 18 Updated Sep 29, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,120 95 Updated Oct 20, 2025

Qihoo360 / Light-R1

Python 749 49 Updated Sep 3, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,409 162 Updated Mar 20, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,599 2,401 Updated Sep 8, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,295 806 Updated Oct 31, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,028 2,409 Updated Nov 3, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,964 379 Updated Nov 1, 2025

jsksxs360 / How-to-use-Transformers

Transformers 库快速入门教程

Python 1,718 206 Updated Sep 20, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,810 938 Updated Nov 3, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,446 284 Updated Oct 30, 2025

luxonis / datadreamer

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 129 7 Updated Sep 25, 2025

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,073 55 Updated Feb 2, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

35,422 1,928 Updated Aug 1, 2024

Tencent / KsanaLLM

C++ 509 41 Updated Sep 12, 2025

TencentARC / mllm-npu

mllm-npu: training multimodal large language models on Ascend NPUs

Python 93 2 Updated Aug 29, 2024

liguodongiot / unify-easy-llm

unify-easy-llm（ULM）旨在打造一个简易的一键式大模型训练工具，支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 58 10 Updated Jul 26, 2024

pandada8 / llm-inference-benchmark

LLM 推理服务性能测试

Jupyter Notebook 44 5 Updated Dec 17, 2023

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,772 3,902 Updated Nov 3, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,594 4,608 Updated Oct 31, 2025

xai-org / grok-1

Grok open release

Python 50,553 8,369 Updated Aug 30, 2024

alibaba / EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,172 257 Updated Nov 27, 2024

reworkd / AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,163 9,484 Updated Apr 29, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,183 363 Updated Aug 14, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,123 2,269 Updated Nov 3, 2025