Skip to content
View Ginray's full-sized avatar
🌶️
艰难,但相信
🌶️
艰难,但相信

Block or report Ginray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
137 results for source starred repositories
Clear filter

A PyTorch native platform for training generative AI models

Python 4,643 594 Updated Nov 4, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 649 76 Updated Nov 4, 2025
Python 1,330 119 Updated Sep 12, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,266 89 Updated Nov 4, 2025

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 744 76 Updated Apr 27, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 263 18 Updated Sep 29, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,124 96 Updated Oct 20, 2025
Python 749 49 Updated Sep 3, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,409 163 Updated Mar 20, 2025

Fully open reproduction of DeepSeek-R1

Python 25,603 2,401 Updated Sep 8, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,305 807 Updated Oct 31, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,061 2,416 Updated Nov 4, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,964 379 Updated Nov 4, 2025

Transformers 库快速入门教程

Python 1,719 206 Updated Sep 20, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,837 940 Updated Nov 3, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,458 285 Updated Nov 4, 2025

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 129 7 Updated Sep 25, 2025

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,073 55 Updated Feb 2, 2025
C++ 509 41 Updated Sep 12, 2025

mllm-npu: training multimodal large language models on Ascend NPUs

Python 93 2 Updated Aug 29, 2024

unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 58 10 Updated Jul 26, 2024

LLM 推理服务性能测试

Jupyter Notebook 44 5 Updated Dec 17, 2023

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,817 3,906 Updated Nov 3, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,600 4,610 Updated Nov 3, 2025

Grok open release

Python 50,555 8,369 Updated Aug 30, 2024

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,171 257 Updated Nov 27, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,167 9,482 Updated Apr 29, 2025

Train transformer language models with reinforcement learning.

Python 16,131 2,270 Updated Nov 3, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,606 2,050 Updated May 19, 2025

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

Jupyter Notebook 282 57 Updated May 26, 2022
Next