Linzar-Slytherin

Follow

Linzar-Slytherin

Follow

0 followers · 1 following

sdu
jinan

Popular repositories Loading

DistServe DistServe Public

Forked from LLMServe/DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook
SwiftTransformer SwiftTransformer Public

Forked from LLMServe/SwiftTransformer

High performance Transformer implementation in C++.

C++
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
DeepSpeed-MII DeepSpeed-MII Public

Forked from deepspeedai/DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python
SpotServe SpotServe Public

Forked from Hsword/SpotServe

SpotServe: Serving Generative Large Language Models on Preemptible Instances

Jupyter Notebook
SDS SDS Public

Jupyter Notebook