- A small village in south China.
Stars
A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
[FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository
TiCDC pulls change logs out of TiDB and pushes to kinds of systems.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Fast inference engine for Transformer models
Moyubie is a cross-platform, serverless and AI powered IM (Instant Messaging) APP
Config to run TiFlash in Cloud Native mode
rockset / rocksdb-cloud
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage optimized for AWS
The Fastest Distributed Database for Transactional, Analytical, and AI Workloads. Welcome to our community: https://discord.gg/74cF8vbNEs
The analytical engine for TiDB and TiDB Cloud. Try free: https://tidbcloud.com/free-trial
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
A modular implementation of timely dataflow in Rust
DuckDB is an analytical in-process SQL database management system
TensorBase is a new big data warehousing with modern efforts.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
DataFusion has now been donated to the Apache Arrow project
𝗔𝗜-𝗡𝗮𝘁𝗶𝘃𝗲 𝗗𝗮𝘁𝗮 𝗪𝗮𝗿𝗲𝗵𝗼𝘂𝘀𝗲. Blazing analytics, fast search, geo insights, vector AI. Built for multimodal analytics, Open-source Snowflake alternative. https://databend.com
Measure Amazon S3's performance from any location.
The Time Series Data Library (TSDL) was created by Rob Hyndman, Professor of Statistics at Monash University, Australia.





