Skip to content
View Ginray's full-sized avatar
🌶️
艰难,但相信
🌶️
艰难,但相信

Block or report Ginray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A PyTorch native platform for training generative AI models

Python 4,631 590 Updated Nov 2, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 638 76 Updated Nov 3, 2025
Python 1,329 118 Updated Sep 12, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,263 88 Updated Nov 1, 2025

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 744 76 Updated Apr 27, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 263 18 Updated Sep 29, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,120 95 Updated Oct 20, 2025
Python 749 49 Updated Sep 3, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,409 162 Updated Mar 20, 2025

Fully open reproduction of DeepSeek-R1

Python 25,599 2,401 Updated Sep 8, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,295 806 Updated Oct 31, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,028 2,409 Updated Nov 3, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,964 379 Updated Nov 1, 2025

Transformers 库快速入门教程

Python 1,718 206 Updated Sep 20, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,810 938 Updated Nov 3, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,446 284 Updated Oct 30, 2025

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 129 7 Updated Sep 25, 2025

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,073 55 Updated Feb 2, 2025

LLM101n: Let's build a Storyteller

35,422 1,928 Updated Aug 1, 2024
C++ 509 41 Updated Sep 12, 2025

mllm-npu: training multimodal large language models on Ascend NPUs

Python 93 2 Updated Aug 29, 2024

unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 58 10 Updated Jul 26, 2024

LLM 推理服务性能测试

Jupyter Notebook 44 5 Updated Dec 17, 2023

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,772 3,902 Updated Nov 3, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,594 4,608 Updated Oct 31, 2025

Grok open release

Python 50,553 8,369 Updated Aug 30, 2024

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,172 257 Updated Nov 27, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,163 9,484 Updated Apr 29, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,183 363 Updated Aug 14, 2025

Train transformer language models with reinforcement learning.

Python 16,123 2,269 Updated Nov 3, 2025
Next