Skip to content
View joestarzxh's full-sized avatar

Block or report joestarzxh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
27 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,810 31,415 Updated Dec 12, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,849 26,204 Updated Dec 13, 2025

Inference code for Llama models

Python 58,983 9,812 Updated Jan 26, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 56,346 17,364 Updated Dec 8, 2025

Ultralytics YOLO 🚀

Python 49,842 9,639 Updated Dec 13, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 48,262 6,785 Updated Jun 11, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,988 4,663 Updated Dec 12, 2025

A generative speech model for daily dialogue.

Python 38,325 4,159 Updated Dec 3, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 35,552 4,181 Updated Dec 11, 2025

SOTA Open Source TTS

Python 24,323 1,995 Updated Dec 1, 2025

Open standard for machine learning interoperability

Python 20,015 3,844 Updated Dec 13, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,632 2,334 Updated Dec 13, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,633 1,953 Updated Oct 21, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 16,567 1,963 Updated Dec 2, 2025

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 15,849 1,185 Updated Dec 13, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,132 1,173 Updated Mar 14, 2025

Spark-TTS Inference Code

Python 10,811 1,154 Updated Apr 9, 2025

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,449 1,599 Updated Aug 9, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,350 1,093 Updated Dec 10, 2025

Towards Human-Sounding Speech

Python 5,808 499 Updated Dec 5, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,645 594 Updated Dec 5, 2025

A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.

Python 3,008 617 Updated Nov 16, 2025

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,637 235 Updated Dec 8, 2025

百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断

Python 1,545 261 Updated Jul 31, 2025

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python 1,115 191 Updated Nov 16, 2025

PaddleScience is SDK and library for developing AI-driven scientific computing applications based on PaddlePaddle.

Python 427 233 Updated Dec 10, 2025

様々な環境向けの WebRTC のビルドを行って、そのバイナリを提供しています

Python 274 103 Updated Dec 12, 2025