KKZ20

Follow

🍋

Miss is pass

Zhongkai Zhao KKZ20

🍋

Miss is pass

Follow

mlsys @ ByteDance

125 followers · 161 following

ByteDance
Shanghai
15:39 (UTC +08:00)
@zzk_zhao

Achievements

Achievements

Lists (11)

Sort

Curious

30 repositories

Flow-Depth-Segmentation

Front-end

HPC-LLM

Interactive Program Repair

Learning

43 repositories

MI2G

NCD

Version Compatibility Issue

21 repositories

Visual Storytelling

Whatever

15 repositories

Starred repositories

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,252 108 Updated Nov 18, 2025

thedaviddias / Front-End-Checklist

🗂 The perfect Front-End Checklist for modern websites and meticulous developers

71,810 6,616 Updated Feb 14, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,615 290 Updated Nov 28, 2025

purdue-hcss / SecureChain

Python 46 13 Updated Sep 3, 2025

sst / opencode

The AI coding agent built for the terminal.

TypeScript 34,349 2,817 Updated Nov 29, 2025

OpenHands / OpenHands

🙌 OpenHands: Code Less, Make More

Python 65,285 7,970 Updated Nov 28, 2025

MiroMindAI / MiroRL

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 180 14 Updated Aug 27, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,785 452 Updated Nov 27, 2025

hesreallyhim / awesome-claude-code

A curated list of awesome commands, files, and workflows for Claude Code

Python 17,369 985 Updated Nov 28, 2025

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 500 112 Updated Nov 28, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,282 187 Updated Nov 27, 2025

li-plus / flash-preference

Accelerate LLM preference tuning via prefix sharing with a single line of code

Python 51 Updated Jul 4, 2025

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 570 32 Updated Nov 28, 2025

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,363 578 Updated Oct 27, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,680 306 Updated Nov 28, 2025

ByteDance-Seed / Seed-Thinking-v1.5

819 17 Updated Jun 9, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,354 109 Updated Nov 29, 2025

ByteDance-Seed / ByteCheckpoint

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 254 18 Updated Jul 10, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,032 328 Updated Nov 28, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,097 238 Updated Nov 29, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,820 301 Updated Jun 12, 2025

microsoft / ai-agents-for-beginners

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 45,697 15,585 Updated Nov 18, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,789 4,016 Updated Nov 28, 2025

ericyangyu / PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,161 154 Updated Oct 1, 2024

NUS-HPC-AI-Lab / oh-my-server

Roff 30 5 Updated Sep 4, 2023

KodCode-AI / kodcode

✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork

Python 296 17 Updated Sep 6, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,884 308 Updated Mar 10, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,885 905 Updated Sep 30, 2025

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,368 77 Updated Aug 3, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,932 286 Updated May 15, 2025

Starred topics

Node.js

Babel

Webpack

Vue.js