-
Shanghai Jiao Tong University
-
vllm-fork Public
Forked from HabanaAI/vllm-forkA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 9, 2025 -
optimum-intel Public
Forked from huggingface/optimum-intelAccelerate inference of 🤗 Transformers with Intel optimization tools
Jupyter Notebook Apache License 2.0 UpdatedJul 9, 2025 -
vllm-hpu-extension Public
Forked from HabanaAI/vllm-hpu-extensionPython Apache License 2.0 UpdatedMay 20, 2025 -
optimum-habana Public
Forked from huggingface/optimum-habanaEasy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Python Apache License 2.0 UpdatedFeb 19, 2025 -
tgi-gaudi Public
Forked from huggingface/tgi-gaudiLarge Language Model Text Generation Inference on Habana Gaudi
Python Apache License 2.0 UpdatedAug 22, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJun 19, 2024 -
GenAIComps Public
Forked from opea-project/GenAICompsGenAI components at micro-service level; GenAI service composer to create mega-service
-
GenAIEval Public
Forked from opea-project/GenAIEvalEvaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
-
optimum Public
Forked from huggingface/optimum🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Python Apache License 2.0 UpdatedFeb 27, 2024 -
bigcode-evaluation-harness Public
Forked from bigcode-project/bigcode-evaluation-harnessA framework for the evaluation of autoregressive code generation language models.
Python Apache License 2.0 UpdatedJul 14, 2023 -
intel-extension-for-transformers Public
Forked from intel/intel-extension-for-transformersExtending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inf…
C++ Apache License 2.0 UpdatedFeb 16, 2023 -
CUDA-Programming-Guide-in-Chinese Public
Forked from HeKun-NVIDIA/CUDA-Programming-Guide-in-ChineseThis is a Chinese translation of the CUDA programming guide
UpdatedJan 5, 2023 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedNov 10, 2021 -
lpot Public
Forked from intel/neural-compressorIntel® Low Precision Optimization Tool, targeting to provide a unified low precision inference interface cross different deep learning frameworks, and support auto-tune with specified accuracy crit…
Python Apache License 2.0 UpdatedJun 11, 2021