Skip to content
View Ginray's full-sized avatar
🌶️
艰难,但相信
🌶️
艰难,但相信

Block or report Ginray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLaMA Factory Document

154 34 Updated Nov 6, 2025

A PyTorch native platform for training generative AI models

Python 4,701 601 Updated Nov 14, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 699 77 Updated Nov 13, 2025
Python 1,337 120 Updated Sep 12, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,290 95 Updated Nov 13, 2025

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 751 76 Updated Apr 27, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 269 17 Updated Nov 7, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,174 101 Updated Oct 20, 2025
Python 748 49 Updated Sep 3, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,412 162 Updated Mar 20, 2025

Fully open reproduction of DeepSeek-R1

Python 25,640 2,399 Updated Sep 8, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,371 811 Updated Nov 9, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,566 2,516 Updated Nov 13, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,985 384 Updated Nov 13, 2025

Transformers 库快速入门教程

Python 1,733 207 Updated Sep 20, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 11,033 969 Updated Nov 13, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,518 289 Updated Nov 13, 2025

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 129 7 Updated Sep 25, 2025

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,077 56 Updated Feb 2, 2025

LLM101n: Let's build a Storyteller

35,543 1,933 Updated Aug 1, 2024
C++ 512 42 Updated Sep 12, 2025

mllm-npu: training multimodal large language models on Ascend NPUs

Python 94 2 Updated Aug 29, 2024

unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 58 10 Updated Jul 26, 2024

LLM 推理服务性能测试

Jupyter Notebook 44 5 Updated Dec 17, 2023

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,250 3,961 Updated Nov 12, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,687 4,626 Updated Nov 14, 2025

Grok open release

Python 50,565 8,373 Updated Aug 30, 2024

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,172 257 Updated Nov 27, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,223 9,487 Updated Apr 29, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,189 365 Updated Aug 14, 2025
Next