Skip to content
View peterjc123's full-sized avatar

Organizations

@llvm

Block or report peterjc123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Train your Agent model via our easy and efficient framework

Python 1,157 97 Updated Jun 19, 2025

Scalable toolkit for efficient model reinforcement

Python 438 47 Updated Jun 19, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,381 311 Updated May 13, 2025

Fully open data curation for reasoning models

Python 1,927 163 Updated Jun 5, 2025

Redis for LLMs

Python 1,440 230 Updated Jun 19, 2025
Python 4 Updated Feb 24, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 990 48 Updated May 24, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,617 96 Updated Mar 18, 2025

Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this project provides a Tokenizer vocabulary shearing solution for Qwen…

Python 22 3 Updated Aug 15, 2024

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Python 176 21 Updated Jun 19, 2025

An unofficial cuda assembler, for all generations of SASS, hopefully :)

Python 83 10 Updated Mar 20, 2023

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,460 622 Updated Jun 16, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,612 866 Updated Apr 29, 2025
Python 203 8 Updated Feb 20, 2025
Python 4,357 354 Updated Jun 12, 2025

Official Repo for Open-Reasoner-Zero

Python 1,968 104 Updated Jun 2, 2025

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

Python 8,192 613 Updated Jun 19, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,355 160 Updated Mar 20, 2025

Inject code into running Python processes

Python 2,827 220 Updated Apr 7, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,300 103 Updated Jun 19, 2025

Fully open reproduction of DeepSeek-R1

Python 24,838 2,296 Updated Jun 2, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,917 1,489 Updated Apr 24, 2025

Simple RL training for reasoning

Python 3,633 271 Updated Apr 10, 2025

[NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning

Python 17 Updated May 31, 2025

Same as llm.c but in Rust, as I want to get deeper and deeper into Rust programming

Rust 59 5 Updated Jan 13, 2025

A Python package that makes it easy for developers to create AI apps powered by various AI providers.

Python 1,620 197 Updated Apr 8, 2025

A self-hosted, ad-free, privacy-respecting metasearch engine

Python 10,752 1,007 Updated Jun 18, 2025
Next