Skip to content
View lgstd's full-sized avatar
💭
relative
💭
relative

Block or report lgstd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Flash-Muon: An Efficient Implementation of Muon Optimizer

Python 135 9 Updated Jun 15, 2025

Muon is Scalable for LLM Training

1,088 48 Updated Mar 28, 2025

🔬 A Researcher-Friendly Framework for Time Series Analysis. Train Any Model on Any Dataset!

Python 10 3 Updated Jun 25, 2025

This is the official code and supplementary materials for our AAAI-2024 paper: MASTER: Market-Guided Stock Transformer for Stock Price Forecasting. MASTER is a stock transformer for stock price for…

Python 336 79 Updated Jun 26, 2025

股票AI操盘手:从学习、模拟到实盘,一站式平台。包含股票知识、策略实例、大模型、因子挖掘、传统策略、机器学习、深度学习、强化学习、图网络、高频交易、C++部署和聚宽实例代码等,可以方便学习、模拟及实盘交易

Jupyter Notebook 3,599 751 Updated Jun 25, 2025

efinance 是一个可以快速获取基金、股票、债券、期货数据的 Python 库,回测以及量化交易的好帮手!🚀🚀🚀

Python 2,426 545 Updated Mar 15, 2025

Code release for DynamicTanh (DyT)

Python 965 79 Updated Mar 30, 2025
Python 29 3 Updated May 26, 2025

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 232 17 Updated Aug 23, 2024
Python 738 60 Updated May 24, 2024

Python library designed to integrate Kolmogorov Arnold Networks with recurrent mechanisms.

Python 5 Updated Apr 16, 2025

An offical implementation of "TimeKAN: KAN-based Frequency Decomposition Learning Architecture for Long-term Time Series Forecasting" (ICLR 2025)

Python 62 14 Updated Feb 16, 2025

A Comprehensive Survey of Deep Learning for Multivariate Time Series Forecasting: A Channel Strategy Perspective

29 1 Updated Apr 4, 2025
Python 121 14 Updated Jun 9, 2025

PyTorch Implementation of FinMamba

34 2 Updated Jun 27, 2025

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Python 176 21 Updated Jun 21, 2025

[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Python 72 5 Updated Jun 2, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,139 61 Updated Feb 25, 2025
Python 60 6 Updated Mar 10, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,633 872 Updated Apr 29, 2025

Muon: An optimizer for hidden layers in neural networks

Python 947 49 Updated Jun 24, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,823 202 Updated Jun 29, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 706 33 Updated Mar 19, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,811 107 Updated Apr 3, 2025

[ICLR 2024] Official Implementation of "Diffusion-TS: Interpretable Diffusion for General Time Series Generation"

Jupyter Notebook 319 40 Updated Feb 28, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,439 160 Updated Jun 17, 2025

This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.

Python 57 3 Updated Apr 20, 2024

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval

Python 83 6 Updated May 9, 2024
Next