Skip to content
View MoonBall's full-sized avatar

Organizations

@nodejs

Block or report MoonBall

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference code for Llama models

Python 58,438 9,779 Updated Jan 26, 2025

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 80,814 7,724 Updated Jun 29, 2025

An annotated implementation of the Transformer paper.

Jupyter Notebook 6,314 1,365 Updated Apr 7, 2024

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 3,153 320 Updated Apr 30, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 19,723 2,022 Updated Jun 28, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 23,712 5,689 Updated Aug 14, 2024

Memray is a memory profiler for Python

Python 14,074 411 Updated Jun 12, 2025

Hands-On GPU Programming with Python and CUDA, published by Packt

Python 388 172 Updated Aug 10, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,596 544 Updated May 3, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,360 1,261 Updated Jun 23, 2025

Video+code lecture on building nanoGPT from scratch

Python 4,184 640 Updated Aug 13, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,385 1,601 Updated Jun 29, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,105 1,666 Updated Jun 29, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,841 279 Updated May 15, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 46,637 5,932 Updated Jun 29, 2025

The official Typescript SDK for Model Context Protocol servers and clients

TypeScript 8,095 981 Updated Jun 27, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 14,441 1,740 Updated Jun 29, 2025

A guidance language for controlling large language models.

Jupyter Notebook 20,394 1,112 Updated Jun 27, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,902 2,527 Updated Aug 12, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,476 290 Updated Jun 27, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,262 356 Updated Jun 28, 2025

腾讯柠檬清理是针对macOS系统专属制定的清理工具。主要功能包括重复文件和相似照片的识别、软件的定制化垃圾扫描、可视化的全盘空间分析、内存释放、浏览器隐私清理以及设备实时状态的监控等。重点聚焦清理功能,对上百款软件提供定制化的清理方案,提供专业的清理建议,帮助用户轻松完成一键式清理。

Objective-C 5,749 767 Updated Jun 10, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 42,308 7,061 Updated Dec 9, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,164 2,874 Updated Aug 15, 2024

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,372 454 Updated Jun 29, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 64,298 7,324 Updated Jun 29, 2025

收集和梳理垂直领域的开源模型、数据集及评测基准。

2,509 200 Updated Dec 26, 2023

LangChain for Go, the easiest way to write LLM-based programs in Go

Go 6,998 870 Updated Jun 28, 2025

Integrate the DeepSeek API into popular softwares

32,993 3,650 Updated May 13, 2025
Next