gigit0000

Follow

William Song gigit0000

Follow

Ready to dispatch!

13 followers · 40 following

Kim Baksa's Lab, South Korea
13:13 (UTC +09:00)

Achievements

Achievements

Stars

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,762 324 Updated Nov 28, 2025

cornserve-ai / cornserve

Easy, Fast, and Scalable Multimodal AI

Python 75 5 Updated Nov 28, 2025

kuterd / nv_isa_solver

Nvidia Instruction Set Specification Generator

Python 299 16 Updated Jul 9, 2024

gpu-mode / triton-index

Cataloging released Triton kernels.

274 14 Updated Sep 9, 2025

Ma-Lab-Berkeley / deep-representation-learning-book

Learning Deep Representations of Data Distributions

TeX 650 51 Updated Nov 29, 2025

siboehm / ShallowSpeed

Small scale distributed training of sequential deep learning models, built on Numpy and MPI.

Python 151 7 Updated Oct 19, 2023

Lightning-AI / forked-pdb

Python pdb for multiple processes

Python 66 8 Updated May 24, 2025

aphrodite-engine / aphrodite-engine

Large-scale LLM inference engine

C++ 1,599 176 Updated Nov 24, 2025

bloomberg / memray

Memray is a memory profiler for Python

Python 14,620 432 Updated Nov 22, 2025

jalalirs / arielml

Python 7 Updated Jul 26, 2025

ShawnZhong / compiler-explorer-triton

Forked from compiler-explorer/compiler-explorer

Triton Support in Compiler Explorer

TypeScript 5 Updated Aug 5, 2025

compiler-explorer / compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

TypeScript 18,259 1,964 Updated Nov 27, 2025

Owen718 / FlexAttention-Examples

This repo provides several classic attention variant implementation based on FlexAttention API.

Python 2 1 Updated May 18, 2025

vllm-project / speculators

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 134 16 Updated Nov 21, 2025

OpenMathLib / OpenBLAS

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,137 1,617 Updated Nov 23, 2025

kherrick / hacker-news

Hacker News

HTML 13 5 Updated Nov 30, 2025

nasa03 / llamafile

Forked from mozilla-ai/llamafile

Distribute and run LLMs with a single file.

C++ 1 Updated Jul 23, 2024

mozilla-ai / llamafile

Distribute and run LLMs with a single file.

C 23,436 1,242 Updated Nov 24, 2025

vosen / ZLUDA

CUDA on non-NVIDIA GPUs

Rust 13,522 860 Updated Nov 29, 2025

Syllo / nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 9,822 346 Updated Oct 25, 2025

TheCodeTraveler / HackerNews

A .NET MAUI app for displaying the top posts on Hacker News that demonstrates text sentiment analysis gathered using artificial intelligence

C# 281 40 Updated Nov 24, 2025

oz123 / awesome-c

A curated list of awesome C frameworks, libraries, resources and other shiny things. Inspired by all the other awesome-... projects out there.

10,840 902 Updated Nov 7, 2025

RoyalCities / RC-Home-Assistant-Low-VRAM

Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model recommendations - settings designed for low VRAM systems.

216 18 Updated Jul 27, 2025

ACIDBURN2501 / debug-macros

Debug Module for Embedded Systems

C 1 Updated May 3, 2025

gigit0000 / dia

Forked from nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 1 Updated Jul 6, 2025

thibmaek / awesome-raspberry-pi

📝 A curated list of awesome Raspberry Pi tools, projects, images and resources

Shell 15,496 1,067 Updated Nov 10, 2025

rogerallen / llama2.cu

Forked from karpathy/llama2.c

Inference Llama 2 in one file of pure C & one file with CUDA

C 31 1 Updated Oct 14, 2023

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,893 1,642 Updated Nov 19, 2025

JohnClaw / chatllm.v

V-lang api wrapper for llm-inference chatllm.cpp

C 6 Updated Nov 20, 2024

modular / modular

The Modular Platform (includes MAX & Mojo)

Mojo 25,271 2,738 Updated Nov 29, 2025