Skip to content

⭐零门槛的桌面伴侣!支持接入QQ、B站直播、RAG、联网、长期记忆、 酒馆角色卡、claude code 、浏览器控制、Dify、 Home Assistant、MCP、A2A、Comfyui、数字人口播等功能!⭐ A desktop companion with zero barriers! Supports integration with QQ, Bilibili live, RAG, web search, long-term memory, sillyTavern Card, claude code, browser control, HA, Dify, MCP, A2A, ComfyUI,Digital Human broadcast,more!

License

Notifications You must be signed in to change notification settings

heshengtao/super-agent-party

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image

Introduction

🚀 An AI desktop companion with endless possibilities!

  • ✅ Omnichannel One-Click Deployment: Supports rapid deployment of agent configurations to multiple terminals, already compatible with scenarios such as classic chat interfaces, desktop companion, QQ, Feishu, Discord, Telegram chatbots, Bilibili, YouTube, Twitch live-streaming bots, and more—ready to use out of the box.

  • ✅ Ecosystem Tool Interconnectivity: Freely integrates third-party agents and workflows as toolchains (already adapted for systems like Home Assistant, Claude Code, Qwen Code, Dify, ComfyUI, MCP, A2A, etc.), enabling cross-platform capability aggregation through the Agent-Party architecture.

  • ✅ Rich Extension Ecosystem: Supports custom extension development, with community sharing via the official Extension Marketplace to meet personalized needs.

  • ✅ Standardized interface openness: Provides OpenAI API-compatible interfaces and MCP protocol support, facilitating developers' direct integration with external systems for rapid agent capability transfer and secondary development. The VRM desktop pet supports the VMC protocol, enabling cross-application action synchronization. The VRM desktop pet interface also supports the webXR protocol, allowing immersive experiences on XR-compatible devices.

  • ✅ Seamless capability enhancement: It is possible to achieve an enterprise-level upgrade of the LLM API without code modification, seamlessly integrating knowledge bases, real-time networking, permanent memory, compatible SillyTavern Character Card, code execution, multimodal capabilities (vision/drawing/audition/speech), Automation capabilities (controlling smart homes, controlling browsers), deep thinking control and research, and other modular functions into the existing model interface, creating a pluggable LLM enhancement platform.

  • ✅ Compatible and adaptable across all platforms: It covers the native running environments of Windows/macOS/Linux, supports Docker containerized deployment and web-based cloud services, and meets the needs of multi-scene technology stacks.

Quick Start

Cloud disk download

👉 Click Baidu Cloud to download

👉 Click Google Cloud to download

Windows portable package (Recommended! Installation-free source version, support one-click synchronization to the latest version of the warehouse, no need to wait for the desktop version to be packaged)

👉 Click to download

⭐ Note! You can double-click 一键更新(update).bat to update the software, or you can double-click 一键启动(start).bat to start the software.

⭐ Note! To reduce the installation package size, the npm download logic has been modified. Integration packages of version v0.3.1 and earlier will not update properly. Please download and update to integration packages of v0.3.2 or later!

Windows Desktop Installation

👉 Click to download

⭐ Note! Choose to install only for the current user during installation, otherwise, administrator privileges will be required to start.

macOS portable package (Currently only M chips are supported, developer-oriented, no-install source version, one-click sync to latest repo, no need to wait for desktop build)

👉 Click to download

⭐ Note! You can update the software in Terminal with one-click update(update).sh or launch it with one-click start(start).sh. Remember to grant permissions before use!

⭐ Note! To reduce the installation package size, the npm download logic has been modified. Integration packages of version v0.3.1 and earlier will not update properly. Please download and update to integration packages of v0.3.2 or later!

chmod +x 一键更新(update).sh
./一键更新(update).sh
chmod +x 一键启动(start).sh
./一键启动(start).sh

MacOS Desktop Installation (Currently only M chips are supported.)

👉 Click to download

⭐Attention! After downloading, drag the app file of the dmg file into the /Applications directory, then open end point, execute the following command and enter the root password to remove the attached Quarantine attribute from the network download:

sudo xattr -dr com.apple.quarantine /Applications/Super-Agent-Party.app

Linux Desktop Installation

We provide two mainstream Linux installation package formats for your convenience in different scenarios.

1. Install using .AppImage

.AppImage is a Linux application format that does not require installation and can be used immediately. Suitable for most Linux distributions.

👉 Click to download

2. Install using .deb package (Suitable for Ubuntu/Debian systems)

👉 Click to download

Docker Deployment (This version of the table pet can only be viewed through the browser.)

  • Two commands to install this project:

    docker pull ailm32442/super-agent-party:latest
    docker run -d -p 3456:3456 -v ./super-agent-data:/app/data ailm32442/super-agent-party:latest
  • ⭐Note! ./super-agent-data can be replaced with any local folder, after Docker starts, all data will be cached in this local folder and will not be uploaded anywhere.

  • Plug and play: access http://localhost:3456/

Source Code Deployment

git clone https://github.com/heshengtao/super-agent-party.git
cd super-agent-party
uv sync && npm install && (cd static && npm install) && npm run dev

Software Screenshots

Multi-Service Provider Support: Supports both locally deployed engines (e.g., Ollama, Dify) and cloud service provider APIs.

image

Extensive Tool Integration: Built-in various tools (e.g., knowledge base, web search, smart home control, browser automation), supporting asynchronous invocation without blocking agent responses.

image

VRM Desktop Pet: Supports uploading custom VRM models to create a personalized desktop companion.

image

Extension System: Supports installing extensions and creating new ones. The image below shows a galgame extension.

image

Tavern Character Cards: Supports Tavern character cards with long-term memory capabilities.

image

QQ Bot: Supports one-click deployment to the official QQ bot platform, enabling users to access the agent anytime, anywhere.

image

Developer-Friendly: Provides open OpenAI API and MCP interfaces, allowing the agent to be exposed externally.

image

ComfyUI Integration: Converts ComfyUI workflows into agent tools with load balancing across multiple ComfyUI servers.

image

Extension

Added a brand-new extension system. You can check available plugins at plugins list, or directly view and install them in the party under 【Developer】->【Extensions】. You can also add your own developed extensions to the official plugin list at super-agent-party.github.io!

Hardware requirements

  • CPU: 2 cores or more
  • Memory: 2GB or more

Because all models are optional, you can access the local deployment engine, or you can all use the Cloud as a Service provider interface, so there are few hardware requirements. Test the docker version on the 2-core 2G Cloud as a Service server and it will work fine

Usage

  • Desktop: Click the desktop icon to use immediately.

  • Web or docker: Access http://localhost:3456/ after startup.

  • API call: Developer-friendly, perfectly compatible with OpenAI format, can output in real-time, and does not affect the original API's response speed. No need to modify the calling code:

    from openai import OpenAI
    client = OpenAI(
      api_key="super-secret-key",
      base_url="http://localhost:3456/v1"
    )
    response = client.chat.completions.create(
      model="super-model",
      messages=[
          {"role": "user", "content": "What is Super Agent Party?"}
      ]
    )
    print(response.choices[0].message.content)
  • MCP call: After starting, you can invoke the local MCP service by writing the following content in the configuration file:

    {
      "mcpServers": {
        "super-agent-party": {
          "url": "http://127.0.0.1:3456/mcp",
        }
      }
    }

Features

Please refer to the following document for the main functions:

Feature Details
Supported Model Service Providers Supports common local deployment engine interfaces and cloud service provider interfaces, such as: openai/ollama/dify, etc.
Multimodal Model Integration Integrates various types of models including role-playing, reasoning, vision, image generation, speech recognition, and speech synthesis for combined use.
VRM Desktop Pet Robot Highly customizable with support for custom avatars, custom animations, voice interaction, and dialogue interruption.It can be streamed transparently to screen recording software such as OBS, and supports bidirectional VMC protocol!
Messaging Platform Bots Currently supports QQ, Feishu, Discord, Telegram, with more platforms to be added later
Live Streaming Bots Currently supports Bilibili, YouTube, Twitch, with more platforms to be added later
Announcer Bot Supports long text narration, multi-voice narration, digital human video narration, batch conversion of ultra-long texts to speech (with download capability), and parsing of common ebook formats like EPUB. Chapter-based conversion will be developed later.
Chat Interface The chat interface supports front-end rendering features such as mathematical formulas, mermaid diagrams, and HTML code graphics. Images can be downloaded or copied.Supports capsule mode and assistant mode, making it easy to shrink and dock the conversation interface. Combined with desktop vision and screenshots, it seamlessly integrates into work and entertainment
Role-Playing Supports uploading, editing, and downloading tavern character cards. Allows configuring different voices and avatars for different characters. Features long-term memory, multi-voice support when using character cards, narrator voice for non-character text, and supports emojis and memes.
Extensive Native Tools Tool calling supports asynchronous execution, including web search, knowledge base access, smart home control, browser control, code execution in sandbox environments, controlling ComfyUI for image generation, and Claude code operates the file system.
Custom Tool Interfaces Supports MCP, A2A, HTTP requests, and any LLM interface as tools for the main agent, enabling users to freely customize their agent's toolchain.
Open External APIs Developer-friendly with open APIs simulating OpenAI and MCP, as well as desktop pet APIs.
Extension System You can check out the available plugins in the Extension List. You can also view and install plugins directly in Party by navigating to 【Developer】 -> 【Extensions】. You can add your own developed extensions to the official extension list at super-agent-party.github.io!
Storage Space All files and data are stored in the user's local data folder. When deployed with NAS, it can also serve as a personal image or file host within the local network.

Disclaimer:

This open-source project and its content (hereinafter referred to as the "project") are for reference only and do not imply any explicit or implicit warranties. The project contributors do not assume any responsibility for the completeness, accuracy, reliability, or applicability of the project. Any behavior that relies on the project content shall be at the user's own risk. In any case, the project contributors shall not be liable for any indirect, special, or incidental losses or damages arising from the use of the project content.

Special Note

Some features of this open-source project (such as Edge TTS voice synthesis, Bilibili WebSocket danmaku monitoring, etc.) rely on public interfaces or experimental functionalities provided by third-party services. These features may become unavailable at any time due to policy changes by the third parties. The developers are not responsible for their stability, legality, or continuity.

The QQ bot utilizes the official QQ bot interface. Please comply with the AIGC Access to QQ Bot Guidelines.

By using these features, users acknowledge and agree to assume the associated risks. The developers do not recommend or encourage the use of these features for commercial or large-scale deployment scenarios.

License Agreement

This project uses a dual licensing model:

  1. By default, this project follows the GNU Affero General Public License v3.0 (AGPLv3) license agreement
  2. If you need to use this project for closed-source commercial purposes, you must obtain a commercial license from the project administrator. Business cooperation: [email protected]

Using this project for closed-source commercial purposes without written authorization is considered a violation of this agreement. The complete text of AGPLv3 can be found in the LICENSE file in the project root directory or at gnu.org/licenses.

Third-Party License Notices

This project may include or depend on certain third-party libraries or components, whose licenses may differ from the main project's license. To comply with the relevant license requirements, you can find the license information for these third-party components in the LICENSE-third-party folder located in the project's root directory, or within the corresponding component's source code.

We extend our gratitude to all contributors of the third-party libraries and components and are committed to respecting their license terms.

Support:

Please star us!

⭐Your support is the driving force for us to move forward!

star

Follow us

Join the Community

If you have any questions or issues with the project, you are welcome to join our community.

  1. QQ Group: 931057213
  1. WeChat Group: we_glm (add the assistant's WeChat and join the group)

  2. Discord: Discord link

Star History

Star History Chart

About

⭐零门槛的桌面伴侣!支持接入QQ、B站直播、RAG、联网、长期记忆、 酒馆角色卡、claude code 、浏览器控制、Dify、 Home Assistant、MCP、A2A、Comfyui、数字人口播等功能!⭐ A desktop companion with zero barriers! Supports integration with QQ, Bilibili live, RAG, web search, long-term memory, sillyTavern Card, claude code, browser control, HA, Dify, MCP, A2A, ComfyUI,Digital Human broadcast,more!

Topics

Resources

License

Contributing

Stars

Watchers

Forks