cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，mlops算法链路全流程，算力租赁平台，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU虚拟化，边缘计算，标注平台自动化标注，deepseek等大模型sft微调/奖励模型/强化学习训练，vllm/ollama/mindie大模型多机推理，私有知识库，AI模型市场…

Python 4,762 832 Updated Nov 7, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 4,610 507 Updated Nov 27, 2025

fudan-generative-vision / champ

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 4,243 484 Updated Jul 10, 2024

GreaterWMS / GreaterWMS

This Inventory management system is the currently Ford Asia Pacific after-sales logistics warehousing supply chain process . After I leave Ford , I start this project . You can share your vacant wa…

Python 4,203 1,118 Updated Sep 26, 2025

PaddlePaddle / PaddleRec

Recommendation Algorithm大规模推荐算法库，包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM，DSIN，SIGN，IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,068 654 Updated Apr 2, 2025

FedML-AI / FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…

Python 3,992 763 Updated Oct 28, 2025

fudan-generative-vision / hallo2

[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,667 531 Updated Feb 27, 2025

LazyAGI / LazyLLM

Easiest and laziest way for building multi-agent LLMs applications.

Python 3,641 352 Updated Dec 25, 2025

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,502 302 Updated Nov 5, 2024

Docta-ai / docta

A Doctor for your data

Python 3,493 256 Updated Jan 14, 2025

TJU-DRL-LAB / AI-Optimizer

The next generation deep reinforcement learning tookit

Python 3,461 596 Updated Jun 16, 2023

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,451 464 Updated Dec 18, 2025

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,283 209 Updated Mar 5, 2024

SkyworkAI / Skywork-R1V

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.

Python 3,133 274 Updated Dec 15, 2025

tinyvision / DAMO-YOLO

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,124 398 Updated May 25, 2024

nesaorg / nesa

Run AI models end-to-end encrypted.

Python 3,000 238 Updated Feb 10, 2025

guoqincode / Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Python 2,935 242 Updated Jul 9, 2024

Peterande / D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,919 273 Updated Oct 6, 2025

OpenBMB / BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,788 251 Updated Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ws2644

Block or report ws2644

Stars

OpenBMB / ChatDev

HKUDS / LightRAG

HKUDS / DeepCode

sapientinc / HRM

facebookresearch / vggt

fudan-generative-vision / hallo

microsoft / UFO

X-PLUG / MobileAgent

Yuliang-Liu / MonkeyOCR

Klavis-AI / klavis

TaskingAI / TaskingAI

tencentmusic / cube-studio