Harry8207

Harry8207

Stars

infiniflow / ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 67,974 7,292 Updated Nov 19, 2025

Agent-on-the-Fly / Memento

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 1,884 212 Updated Oct 5, 2025

k2-fsa / icefall

Python 1,282 376 Updated Oct 5, 2025

k2-fsa / ZipVoice

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 713 95 Updated Nov 12, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

96,207 25,885 Updated Nov 18, 2025

networkx / networkx

Network Analysis in Python

Python 16,347 3,427 Updated Nov 14, 2025

mem0ai / mem0

Universal memory layer for AI Agents

Python 43,314 4,692 Updated Nov 18, 2025

guclan / techbooks

编程基础书籍和资料

8 4 Updated Jun 5, 2019

MotiaDev / motia

Multi-Language Backend Framework that unifies APIs, background jobs, queues, workflows, streams, and AI agents with a single core primitive with built-in observability and state management.

TypeScript 10,248 811 Updated Nov 19, 2025

FlowiseAI / Flowise

Build AI Agents, Visually

TypeScript 46,674 23,130 Updated Nov 19, 2025

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,421 401 Updated Apr 20, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,510 1,374 Updated Oct 1, 2025

NanGePlus / LangGraphChatBot

使用LangGraph+DeepSeek-R1+FastAPI+Gradio实现一个带有记忆功能的流量包推荐智能客服web端用例,同时也支持gpt大模型、国产大模型(OneApi方式)、Ollama本地开源大模型、阿里通义千问大模型

Python 343 62 Updated Apr 14, 2025

NanGePlus / AutoGenV04Test

AutoGen最新架构v0.4正式发布第一个稳定版本，v0.4是对AutoGen的一次从头开始的重写，目的是为构建Agent创建一个更健壮、可扩展、更易用的跨语言库，其应用接口采用分层架构设计，存在多套软件接口用以满足不同的场景需求。

Python 113 27 Updated Apr 14, 2025

harry0703 / AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

Python 1,933 279 Updated Jul 26, 2024

Goldziher / kreuzberg

Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.

HTML 2,516 113 Updated Nov 19, 2025

Zackriya-Solutions / meeting-minutes

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on addi…

Rust 8,299 664 Updated Nov 19, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,988 385 Updated Nov 19, 2025

jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 9,138 827 Updated Nov 8, 2025

datalab-to / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 29,894 2,020 Updated Nov 7, 2025

mermaid-js / mermaid

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 84,172 8,299 Updated Nov 19, 2025

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 24,636 2,035 Updated Jul 4, 2025

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,446 178 Updated Mar 28, 2025

gomate-community / TrustRAG

TrustRAG：The RAG Framework within Reliable input,Trusted output

Python 1,179 122 Updated Oct 21, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,175 4,146 Updated Jul 6, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 119,269 18,483 Updated Nov 19, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,370 513 Updated Aug 11, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,645 294 Updated Aug 14, 2025

MyNiuuu / MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 754 49 Updated Dec 5, 2024

yerfor / MimicTalk

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code

Python 790 97 Updated Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly