pringwong

Pring Wong pringwong

The ugly people can't step into the same river.

5 followers · 10 following

Beijing

Lists (3)

Sort

Stars

22 stars written in Python

Clear filter

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,708 8,401 Updated Sep 20, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 64,514 9,378 Updated Nov 19, 2025

mem0ai / mem0

Universal memory layer for AI Agents

Python 43,399 4,697 Updated Nov 18, 2025

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Python 27,757 3,489 Updated Sep 23, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,861 1,636 Updated Nov 21, 2025

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,683 578 Updated Jan 16, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,505 537 Updated Mar 23, 2025

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,853 843 Updated May 29, 2022

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

Python 3,160 246 Updated Nov 20, 2025

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curatio…

Python 2,335 228 Updated Nov 7, 2024

marlbenchmark / on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,770 350 Updated Jul 18, 2024

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,556 127 Updated Sep 8, 2025

alex-petrenko / sample-factory

High throughput synchronous and asynchronous reinforcement learning

Python 957 142 Updated Nov 14, 2025

DaiShiResearch / TransNeXt

[CVPR 2024] Code release for TransNeXt model

Python 555 24 Updated Jun 13, 2024

yuntianhe2014 / Easy-RAG

一个适合学习、使用、自主扩展的RAG【检索增强生成】系统！可联网做AI搜索

Python 518 50 Updated Sep 4, 2024

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 497 53 Updated Sep 11, 2025

Joyce94 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 441 23 Updated Oct 11, 2023

Meituan-AutoML / VisionLLaMA

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Python 390 12 Updated Jul 9, 2024

agiresearch / WarAgent

WarAgent: LLM-based Multi-Agent Simulation of World Wars

Python 300 40 Updated Mar 5, 2024

nicklashansen / puppeteer

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Python 194 15 Updated Sep 18, 2025

bigai-ai / civrealm

CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.

Python 130 11 Updated Sep 11, 2024

bytedance / kmax-deeplab

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Python 78 11 Updated Jul 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly