Ginray

Follow

🌶️

艰难，但相信

Yinlei Sun Ginray

🌶️

艰难，但相信

Follow

17 followers · 33 following

hangzhou

Achievements

Achievements

Stars

137 results for source starred repositories

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,643 594 Updated Nov 4, 2025

jd-opensource / xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 649 76 Updated Nov 4, 2025

microsoft / rStar

Python 1,330 119 Updated Sep 12, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,266 89 Updated Nov 4, 2025

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 744 76 Updated Apr 27, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 263 18 Updated Sep 29, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,124 96 Updated Oct 20, 2025

Qihoo360 / Light-R1

Python 749 49 Updated Sep 3, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,409 163 Updated Mar 20, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,603 2,401 Updated Sep 8, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,305 807 Updated Oct 31, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,061 2,416 Updated Nov 4, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,964 379 Updated Nov 4, 2025

jsksxs360 / How-to-use-Transformers

Transformers 库快速入门教程

Python 1,719 206 Updated Sep 20, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,837 940 Updated Nov 3, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,458 285 Updated Nov 4, 2025

luxonis / datadreamer

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 129 7 Updated Sep 25, 2025

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,073 55 Updated Feb 2, 2025

Tencent / KsanaLLM

C++ 509 41 Updated Sep 12, 2025

TencentARC / mllm-npu

mllm-npu: training multimodal large language models on Ascend NPUs

Python 93 2 Updated Aug 29, 2024

liguodongiot / unify-easy-llm

unify-easy-llm（ULM）旨在打造一个简易的一键式大模型训练工具，支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 58 10 Updated Jul 26, 2024

pandada8 / llm-inference-benchmark

LLM 推理服务性能测试

Jupyter Notebook 44 5 Updated Dec 17, 2023

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,817 3,906 Updated Nov 3, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,600 4,610 Updated Nov 3, 2025

xai-org / grok-1

Grok open release

Python 50,555 8,369 Updated Aug 30, 2024

alibaba / EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,171 257 Updated Nov 27, 2024

reworkd / AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,167 9,482 Updated Apr 29, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,131 2,270 Updated Nov 3, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,606 2,050 Updated May 19, 2025

Liang-ZX / VectorNet

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

Jupyter Notebook 282 57 Updated May 26, 2022