peterjc123

peterjc123

317 followers · 57 following

Shanghai, China
peterjc.stream

Achievements

x3 x2

Achievements

x3 x2

Highlights

Developer Program Member

Organizations

Lists (21)

Sort

Starred repositories

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,157 97 Updated Jun 19, 2025

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 438 47 Updated Jun 19, 2025

nishtahir / antec-flux-pro-display

Rust 7 1 Updated Apr 7, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,381 311 Updated May 13, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 1,927 163 Updated Jun 5, 2025

LMCache / LMCache

Redis for LLMs

Python 1,440 230 Updated Jun 19, 2025

Hong-Lab-UMN-ECE / IRLAlignment

Python 4 Updated Feb 24, 2025

Hong-Lab-UMN-ECE / Reward_learning_SFT

Python 7 Updated Mar 4, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 990 48 Updated May 24, 2025

Small-Model-Gap / Small-Model-Learnability-Gap

Python 13 Updated Mar 9, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,617 96 Updated Mar 18, 2025

KaihuaTang / Qwen-Tokenizer-Pruner

Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this project provides a Tokenizer vocabulary shearing solution for Qwen…

Python 22 3 Updated Aug 15, 2024

JT-Ushio / MHA2MLA

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Python 176 21 Updated Jun 19, 2025

OpenPPL / CuAssembler

Forked from cloudcores/CuAssembler

An unofficial cuda assembler, for all generations of SASS, hopefully ：）

Python 83 10 Updated Mar 20, 2023

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,460 622 Updated Jun 16, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,612 866 Updated Apr 29, 2025

GAIR-NLP / LIMR

Python 203 8 Updated Feb 20, 2025

stepfun-ai / Step-Audio

Python 4,357 354 Updated Jun 12, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,968 104 Updated Jun 2, 2025

oumi-ai / oumi

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

Python 8,192 613 Updated Jun 19, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,355 160 Updated Mar 20, 2025

lmacken / pyrasite

Inject code into running Python processes

Python 2,827 220 Updated Apr 7, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,300 103 Updated Jun 19, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,838 2,296 Updated Jun 2, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,917 1,489 Updated Apr 24, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,633 271 Updated Apr 10, 2025

sufenlp / MiLoRA

[NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning

Python 17 Updated May 31, 2025

Steboss / llm.rust

Same as llm.c but in Rust, as I want to get deeper and deeper into Rust programming

Rust 59 5 Updated Jan 13, 2025

AK391 / ai-gradio

A Python package that makes it easy for developers to create AI apps powered by various AI providers.

Python 1,620 197 Updated Apr 8, 2025

benbusby / whoogle-search

A self-hosted, ad-free, privacy-respecting metasearch engine

Python 10,752 1,007 Updated Jun 18, 2025

peterjc123

Highlights

Organizations

Lists (21)

Awesome rust projects

CUDA / GPU Computing

Deep learning

Gaming

Generic algorithms

Generic computing

LLM agents

LLM applications

LLM datasets

LLM enhancement

LLM evaluation

LLM frameworks

LLM models

LLM papers

LLM resources

LLM speedup

Prompt engineering

Resources

Role acting LLM

Tools

Web

Starred repositories

large-language-models

llama

Large Language Model

neon

gpu-acceleration

tensors

performance

PyTorch

Deep learning

network-compression