-
ComfyUI Public
Forked from comfyanonymous/ComfyUIThe most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Python GNU General Public License v3.0 UpdatedJun 29, 2025 -
ragflow Public
Forked from infiniflow/ragflowRAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Python Apache License 2.0 UpdatedJun 18, 2025 -
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
Python MIT License UpdatedJun 13, 2025 -
RAG_Techniques Public
Forked from NirDiamant/RAG_TechniquesThis repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Jupyter Notebook Other UpdatedJun 9, 2025 -
llm_counts Public
Forked from harleyszhang/llm_countsllm theoretical performance analysis tools and support params, flops, memory and latency analysis.
Python UpdatedMay 29, 2025 -
WeClone Public
Forked from xming521/WeClone欢迎star⭐。🚀从聊天记录创造数字分身的一站式解决方案💡 使用微信聊天记录微调大语言模型,让大模型有“那味儿”,并绑定到聊天机器人,实现自己的数字分身。 数字克隆/数字分身/数字永生/声音克隆/LLM/大语言模型/微信聊天机器人/LoRA
Python GNU Affero General Public License v3.0 UpdatedMay 11, 2025 -
Awesome-LLM-System-Papers Public
Forked from AmadeusChan/Awesome-LLM-System-PapersUpdatedMay 10, 2025 -
Awesome-Multimodal-Large-Language-Models Public
Forked from BradyFU/Awesome-Multimodal-Large-Language-Models✨✨Latest Advances on Multimodal Large Language Models
UpdatedApr 15, 2025 -
data-engineer-handbook Public
Forked from DataExpert-io/data-engineer-handbookThis is a repo with links to everything you'd ever want to learn about data engineering
Makefile UpdatedNov 15, 2024 -
The-Art-of-Linear-Algebra Public
Forked from kenjihiranabe/The-Art-of-Linear-AlgebraGraphic notes on Gilbert Strang's "Linear Algebra for Everyone"
PostScript Creative Commons Zero v1.0 Universal UpdatedNov 13, 2024 -
Category_Theory_Machine_Learning Public
Forked from bgavran/Category_Theory_Machine_LearningList of papers studying machine learning through the lens of category theory
Python UpdatedOct 15, 2024 -
AI-System-School Public
Forked from HuaizhengZhang/AI-Infra-from-Zero-to-Hero🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
MIT License UpdatedAug 14, 2024 -
-
LLMSys-PaperList Public
Forked from AmberLJC/LLMSys-PaperListLarge Language Model (LLM) Systems Paper List
UpdatedJul 25, 2024 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedJul 25, 2024 -
-
triton-linalg Public
Forked from Cambricon/triton-linalgDevelopment repository for the Triton-Linalg conversion
C++ Apache License 2.0 UpdatedJul 8, 2024 -
lectures Public
Forked from gpu-mode/lecturesMaterial for cuda-mode lectures
Jupyter Notebook Apache License 2.0 UpdatedJun 13, 2024 -
FStar Public
Forked from FStarLang/FStarA Proof-oriented Programming Language
F* Apache License 2.0 UpdatedMay 16, 2024 -
tutorial-multi-gpu Public
Forked from FZJ-JSC/tutorial-multi-gpuEfficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
Cuda MIT License UpdatedMay 7, 2024 -
lightllm Public
Forked from ModelTC/lightllmLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python Apache License 2.0 UpdatedApr 25, 2024 -
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
GNU General Public License v3.0 UpdatedApr 24, 2024 -
CUDA-Learn-Note Public
Forked from xlite-dev/LeetCUDA🎉CUDA 笔记 / 高频面试题汇总 / C++笔记,个人笔记,更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
Cuda GNU General Public License v3.0 UpdatedMar 9, 2024 -
mlir-tutorial Public
Forked from j2kun/mlir-tutorialMLIR For Beginners tutorial
C++ UpdatedMar 7, 2024 -
kernl Public
Forked from ELS-RD/kernlKernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Jupyter Notebook Apache License 2.0 UpdatedFeb 16, 2024 -
llm-numbers Public
Forked from ray-project/llm-numbersNumbers every LLM developer should know
UpdatedJan 16, 2024 -
-
LLMsPracticalGuide Public
Forked from Mooler0410/LLMsPracticalGuideA curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
UpdatedNov 22, 2023 -
ML-For-Beginners Public
Forked from microsoft/ML-For-Beginners12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
HTML MIT License UpdatedNov 17, 2023 -
annotated_deep_learning_paper_implementations Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…
Jupyter Notebook MIT License UpdatedAug 15, 2023