Skip to content
View pringwong's full-sized avatar
  • Beijing

Block or report pringwong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
22 stars written in Python
Clear filter

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,708 8,401 Updated Sep 20, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 64,514 9,378 Updated Nov 19, 2025

Universal memory layer for AI Agents

Python 43,399 4,697 Updated Nov 18, 2025

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 27,757 3,489 Updated Sep 23, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,861 1,636 Updated Nov 21, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,683 578 Updated Jan 16, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,505 537 Updated Mar 23, 2025

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,853 843 Updated May 29, 2022

Optimizing inference proxy for LLMs

Python 3,160 246 Updated Nov 20, 2025

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,335 228 Updated Nov 7, 2024

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,770 350 Updated Jul 18, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,556 127 Updated Sep 8, 2025

High throughput synchronous and asynchronous reinforcement learning

Python 957 142 Updated Nov 14, 2025

[CVPR 2024] Code release for TransNeXt model

Python 555 24 Updated Jun 13, 2024

一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索

Python 518 50 Updated Sep 4, 2024

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 497 53 Updated Sep 11, 2025

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 441 23 Updated Oct 11, 2023

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Python 390 12 Updated Jul 9, 2024

WarAgent: LLM-based Multi-Agent Simulation of World Wars

Python 300 40 Updated Mar 5, 2024

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Python 194 15 Updated Sep 18, 2025

CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.

Python 130 11 Updated Sep 11, 2024

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Python 78 11 Updated Jul 28, 2023