Skip to content
View coding-famer's full-sized avatar
🤔
Thinking
🤔
Thinking

Block or report coding-famer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University

98 10 Updated Nov 18, 2025

A RL Framework for multi LLM agent system

Python 66 9 Updated Nov 18, 2025
HTML 20 1 Updated Nov 2, 2025

A Gym for Agentic LLMs

Python 363 23 Updated Nov 10, 2025

Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to agent intelligence.

25 2 Updated Nov 14, 2025

🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation feedback, cross-platform NVIDIA/AMD, Kernelbook + KernelBench

Python 100 2 Updated Nov 10, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,061 117 Updated Nov 9, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,846 607 Updated Nov 19, 2025

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,204 97 Updated Sep 28, 2025
Python 14 2 Updated Jun 18, 2025

基于Python的开源量化交易平台开发框架

Python 33,956 10,369 Updated Nov 2, 2025

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 13,018 2,807 Updated Nov 19, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 1,866 201 Updated Nov 13, 2025

A scalable, end-to-end training pipeline for general-purpose agents

Python 361 54 Updated Jul 4, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 25,218 4,709 Updated Oct 9, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 666 87 Updated Nov 19, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 160 18 Updated Sep 18, 2025

Scaling RL on advanced reasoning models

Python 633 40 Updated Oct 20, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,511 264 Updated Nov 19, 2025

Nano vLLM

Python 9,088 1,100 Updated Nov 3, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,413 232 Updated Nov 2, 2025

Awesome List for Agentic RL

HTML 542 16 Updated Nov 9, 2025

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,361 581 Updated Nov 10, 2025

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 345 35 Updated Nov 16, 2025

The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.

Python 56 9 Updated Jun 27, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,401 186 Updated Nov 18, 2025

科技爱好者周刊,每周五发布

79,267 3,720 Updated Nov 14, 2025

An example of gin

Go 7,114 1,615 Updated Jul 7, 2023
Next