Skip to content
View DongyuXu77's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report DongyuXu77

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Linux kernel source tree

C 205,104 57,889 Updated Oct 16, 2025

An open collection of methodologies to help with successful training of large language models.

Python 536 44 Updated Feb 15, 2024

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,145 81 Updated Aug 28, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,433 2,281 Updated Oct 17, 2025

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 263 30 Updated Nov 27, 2023

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 382 37 Updated Apr 20, 2024

A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.

Python 1,512 102 Updated Nov 4, 2024

Large Context Attention

Python 743 53 Updated Oct 13, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,814 523 Updated Oct 17, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,247 1,263 Updated Oct 6, 2025

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 477 29 Updated Mar 19, 2024
Python 256 31 Updated Jun 6, 2025

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,280 1,830 Updated Jul 3, 2024

A fast MoE impl for PyTorch

Python 1,804 196 Updated Feb 10, 2025

Benchmarking Deep Learning operations on different hardware

C++ 1,096 240 Updated Apr 25, 2021

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 899 170 Updated Dec 30, 2024

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,685 384 Updated Sep 17, 2025

A PyTorch Native LLM Training Framework

Python 875 51 Updated Sep 12, 2025

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Jupyter Notebook 268 13 Updated Jul 30, 2024

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 390 13 Updated Jul 9, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,386 197 Updated Oct 16, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 44,766 6,453 Updated Oct 17, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 154,253 13,397 Updated Oct 17, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,602 667 Updated Oct 17, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,723 3,202 Updated Oct 17, 2025

The official Meta Llama 3 GitHub site

Python 29,041 3,470 Updated Jan 26, 2025

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,823 1,492 Updated Oct 3, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 75,638 11,095 Updated Oct 17, 2025
Next