DongyuXu77

Follow

🎯

Focusing

XUDONGYU DongyuXu77

🎯

Focusing

Follow

17 followers · 54 following

Chinese Academic of Sciences
Beijing, China
01:07 (UTC -12:00)
https://dongyuxu77.github.io/

Achievements

Achievements

Highlights

Pro

Stars

torvalds / linux

Linux kernel source tree

C 205,104 57,889 Updated Oct 16, 2025

huggingface / llm_training_handbook

An open collection of methodologies to help with successful training of large language models.

Python 536 44 Updated Feb 15, 2024

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,145 81 Updated Aug 28, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,433 2,281 Updated Oct 17, 2025

OpenBMB / ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 263 30 Updated Nov 27, 2023

thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 382 37 Updated Apr 20, 2024

16131zzzzzzzz / EveryoneNobel

A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.

Python 1,512 102 Updated Nov 4, 2024

haoliuhl / ringattention

Large Context Attention

Python 743 53 Updated Oct 13, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,814 523 Updated Oct 17, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,247 1,263 Updated Oct 6, 2025

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 477 29 Updated Mar 19, 2024

google-research / meliad

Python 256 31 Updated Jun 6, 2025

microsoft / nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,280 1,830 Updated Jul 3, 2024

laekov / fastmoe

A fast MoE impl for PyTorch

Python 1,804 196 Updated Feb 10, 2025

baidu-research / DeepBench

Benchmarking Deep Learning operations on different hardware

C++ 1,096 240 Updated Apr 25, 2021

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 899 170 Updated Dec 30, 2024

facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,685 384 Updated Sep 17, 2025

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 875 51 Updated Sep 12, 2025

flame / how-to-optimize-gemm

C 1,930 359 Updated Jul 29, 2023

Strivin0311 / long-llms-learning

A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks

Jupyter Notebook 268 13 Updated Jul 30, 2024

OpenLMLab / LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 390 13 Updated Jul 9, 2024

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,386 197 Updated Oct 16, 2025

run-llama / llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 44,766 6,453 Updated Oct 17, 2025

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 154,253 13,397 Updated Oct 17, 2025

openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,602 667 Updated Oct 17, 2025

google-research / vision_transformer

Jupyter Notebook 11,875 1,414 Updated Mar 6, 2025

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,723 3,202 Updated Oct 17, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,041 3,470 Updated Jan 26, 2025

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,823 1,492 Updated Oct 3, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 75,638 11,095 Updated Oct 17, 2025