pringwong

Pring Wong pringwong

The ugly people can't step into the same river.

5 followers · 10 following

Beijing

Lists (3)

Sort

Stars

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 496 53 Updated Sep 11, 2025

TorchCraft / TorchCraftAI

A platform that lets you build agents to learn to play StarCraft: Brood War.

C++ 655 123 Updated Aug 31, 2021

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 64,458 9,374 Updated Nov 19, 2025

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

Python 3,156 246 Updated Nov 20, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,847 371 Updated Oct 17, 2025

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,770 350 Updated Jul 18, 2024

mem0ai / mem0

Universal memory layer for AI Agents

Python 43,367 4,695 Updated Nov 18, 2025

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,702 8,401 Updated Sep 20, 2025

yuntianhe2014 / Easy-RAG

一个适合学习、使用、自主扩展的RAG【检索增强生成】系统！可联网做AI搜索

Python 518 50 Updated Sep 4, 2024

DaiShiResearch / TransNeXt

[CVPR 2024] Code release for TransNeXt model

Python 555 24 Updated Jun 13, 2024

Meituan-AutoML / VisionLLaMA

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Python 390 12 Updated Jul 9, 2024

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,306 13,701 Updated Nov 20, 2025

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,853 843 Updated May 29, 2022

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,493 537 Updated Mar 23, 2025

bytedance / kmax-deeplab

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Python 78 11 Updated Jul 28, 2023

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 27,751 3,489 Updated Sep 23, 2025

hustvl / YOLOS

[NeurIPS 2021] You Only Look at One Sequence

Jupyter Notebook 895 125 Updated May 4, 2022

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,290 434 Updated Nov 21, 2025

labmlai / inspectus

LLM Analytics

TypeScript 696 33 Updated Oct 19, 2024

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,858 1,636 Updated Nov 20, 2025

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 66,607 6,952 Updated Nov 17, 2025

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,335 228 Updated Nov 7, 2024

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,680 578 Updated Jan 16, 2025

nicklashansen / puppeteer

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Python 194 15 Updated Sep 18, 2025

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,556 127 Updated Sep 8, 2025

Tencent-Hunyuan / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,267 360 Updated Oct 26, 2025

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 955 142 Updated Nov 14, 2025

bigai-ai / civrealm

CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.

Python 130 11 Updated Sep 11, 2024

agiresearch / WarAgent

WarAgent: LLM-based Multi-Agent Simulation of World Wars

Python 300 40 Updated Mar 5, 2024

Joyce94 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 441 23 Updated Oct 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly