Skip to content
View PyrekP's full-sized avatar

Highlights

  • Pro

Block or report PyrekP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

76 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,828 29,616 Updated Jul 11, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 91,469 24,656 Updated Jul 12, 2025

Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to clo…

Python 65,583 24,045 Updated Jul 10, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 61,800 6,248 Updated Aug 24, 2024

Python tool for converting files and office documents to Markdown.

Python 60,199 3,167 Updated Jun 4, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 54,572 17,049 Updated Jul 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 52,041 8,652 Updated Jul 12, 2025

Ultralytics YOLO11 🚀

Python 42,964 8,395 Updated Jul 12, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,306 4,461 Updated Jul 11, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,767 3,533 Updated Jul 11, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 29,627 3,512 Updated Jul 12, 2025

The official Meta Llama 3 GitHub site

Python 28,834 3,415 Updated Jan 26, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,855 2,614 Updated Apr 30, 2025

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…

Python 26,546 4,751 Updated Oct 15, 2023

Fully open reproduction of DeepSeek-R1

Python 25,013 2,330 Updated Jul 10, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,801 1,441 Updated Jun 30, 2025

Fast and memory-efficient exact attention

Python 18,303 1,801 Updated Jul 11, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,441 2,240 Updated Feb 1, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,932 2,322 Updated Jul 12, 2025

Powerline is a statusline plugin for vim, and provides statuslines and prompts for several other applications, including zsh, bash, tmux, IPython, Awesome and Qtile.

Python 14,560 1,001 Updated Sep 30, 2024

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,227 1,825 Updated Jul 3, 2024

Ongoing research training transformer models at scale

Python 12,845 2,924 Updated Jul 11, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,001 1,490 Updated Apr 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,856 1,790 Updated Jul 12, 2025

Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait..…

Python 9,895 907 Updated Oct 1, 2020

Fully automated homelab from empty disk to running services with a single command.

Python 8,715 822 Updated Jun 29, 2025

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,260 1,069 Updated Jul 1, 2025

The Sphinx documentation generator

Python 7,213 2,233 Updated Jul 10, 2025

Example models using DeepSpeed

Python 6,567 1,101 Updated Jul 8, 2025
Next