Lists (1)
Sort Name ascending (A-Z)
Starred repositories
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Advanced Python Mastery (course by @dabeaz)
Hierarchical Reasoning Model Official Release
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
A lightweight LMM-based Document Parsing Model
Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale
The open source platform for AI-native application development.
Agent-ready RPA suite with out-of-the-box automation tools. Built for individuals and enterprises.
Align Anything: Training All-modality Model with Feedback
Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Easiest and laziest way for building multi-agent LLMs applications.
The next generation deep reinforcement learning tookit
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
Build multimodal language agents for fast prototype and production
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
SDG is a specialized framework designed to generate high-quality structured tabular data.
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code