Skip to content
View pringwong's full-sized avatar
  • Beijing

Block or report pringwong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 496 53 Updated Sep 11, 2025

A platform that lets you build agents to learn to play StarCraft: Brood War.

C++ 655 123 Updated Aug 31, 2021

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 64,458 9,374 Updated Nov 19, 2025

Optimizing inference proxy for LLMs

Python 3,156 246 Updated Nov 20, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,847 371 Updated Oct 17, 2025

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,770 350 Updated Jul 18, 2024

Universal memory layer for AI Agents

Python 43,367 4,695 Updated Nov 18, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,702 8,401 Updated Sep 20, 2025

一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索

Python 518 50 Updated Sep 4, 2024

[CVPR 2024] Code release for TransNeXt model

Python 555 24 Updated Jun 13, 2024

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Python 390 12 Updated Jul 9, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,306 13,701 Updated Nov 20, 2025

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,853 843 Updated May 29, 2022

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,493 537 Updated Mar 23, 2025

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Python 78 11 Updated Jul 28, 2023

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 27,751 3,489 Updated Sep 23, 2025

[NeurIPS 2021] You Only Look at One Sequence

Jupyter Notebook 895 125 Updated May 4, 2022

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,290 434 Updated Nov 21, 2025

LLM Analytics

TypeScript 696 33 Updated Oct 19, 2024

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,858 1,636 Updated Nov 20, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 66,607 6,952 Updated Nov 17, 2025

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,335 228 Updated Nov 7, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,680 578 Updated Jan 16, 2025

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Python 194 15 Updated Sep 18, 2025

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,556 127 Updated Sep 8, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,267 360 Updated Oct 26, 2025

High throughput synchronous and asynchronous reinforcement learning

Python 955 142 Updated Nov 14, 2025

CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.

Python 130 11 Updated Sep 11, 2024

WarAgent: LLM-based Multi-Agent Simulation of World Wars

Python 300 40 Updated Mar 5, 2024

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 441 23 Updated Oct 11, 2023
Next