Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A generative speech model for daily dialogue.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Open standard for machine learning interoperability
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断
Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
PaddleScience is SDK and library for developing AI-driven scientific computing applications based on PaddlePaddle.
様々な環境向けの WebRTC のビルドを行って、そのバイナリを提供しています


