coding-famer

Follow

🤔

Thinking

Chenhe coding-famer

🤔

Thinking

Follow

LLM/MLLM Alignment @ Li Auto Previous CS Master Student @ UCIrvine

29 followers · 175 following

University of California, Irvine
Irvine, CA
09:27 (UTC -08:00)
https://coding-famer.github.io/
@Chenhe_Gu

Achievements

Achievements

Highlights

Developer Program Member
Pro

Lists (6)

Sort

algorithm

basic

cpp

interview

ml

21 repositories

OS

Starred repositories

mega002 / llm-interp-tau

Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University

98 10 Updated Nov 18, 2025

pettingllms-ai / PettingLLMs

A RL Framework for multi LLM agent system

Python 66 9 Updated Nov 18, 2025

zsc / vla_tutorial

HTML 20 1 Updated Nov 2, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 363 23 Updated Nov 10, 2025

lukahhcm / Awesome_Environment_Scaling

Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to agent intelligence.

25 2 Updated Nov 14, 2025

RLsys-Foundation / TritonForge

🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation feedback, cross-platform NVIDIA/AMD, Kernelbook + KernelBench

Python 100 2 Updated Nov 10, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,061 117 Updated Nov 9, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,846 607 Updated Nov 19, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,211 53 Updated Sep 5, 2025

tanishqkumar / beyond-nanogpt

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,204 97 Updated Sep 28, 2025

AI-secure / PolyGuard

Python 14 2 Updated Jun 18, 2025

vnpy / vnpy

基于Python的开源量化交易平台开发框架

Python 33,956 10,369 Updated Nov 2, 2025

R100001 / Programming-Massively-Parallel-Processors

Cuda 199 39 Updated Aug 2, 2024

hsliuping / TradingAgents-CN

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 13,018 2,807 Updated Nov 19, 2025

changyeyu / LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

Python 1,866 201 Updated Nov 13, 2025

cmriat / l0

A scalable, end-to-end training pipeline for general-purpose agents

Python 361 54 Updated Jul 4, 2025

TauricResearch / TradingAgents

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 25,218 4,709 Updated Oct 9, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 666 87 Updated Nov 19, 2025

spiral-rl / spiral

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 160 18 Updated Sep 18, 2025

ChenxinAn-fdu / POLARIS

Scaling RL on advanced reasoning models

Python 633 40 Updated Oct 20, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,511 264 Updated Nov 19, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 9,088 1,100 Updated Nov 3, 2025

skyzh / tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,413 232 Updated Nov 2, 2025

thinkwee / AgentsMeetRL

Awesome List for Agentic RL

HTML 542 16 Updated Nov 9, 2025

karminski / one-small-step

这是一个简单的技术科普教程项目，主要聚焦于解释一些有趣的，前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,361 581 Updated Nov 10, 2025

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 345 35 Updated Nov 16, 2025

ColinLu50 / SafeDelta

The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.

Python 56 9 Updated Jun 27, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,401 186 Updated Nov 18, 2025

ruanyf / weekly

科技爱好者周刊，每周五发布

79,267 3,720 Updated Nov 14, 2025

eddycjy / go-gin-example

An example of gin

Go 7,114 1,615 Updated Jul 7, 2023

Starred topics

adversarial-machine-learning

ai-security

backdoor-attacks

targeted-adversarial-attacks

adversarial-attacks

Computer vision