Skip to content
View MaximumEntropy's full-sized avatar
  • Mistral AI
  • San Fransisco, Bay Area

Block or report MaximumEntropy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 691 38 Updated Nov 19, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 986 251 Updated Jul 22, 2025

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,133 1,029 Updated Sep 28, 2025

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 587 60 Updated Sep 15, 2025

Original Implementation of Prompt Tuning from Lester, et al, 2021

Python 695 60 Updated Mar 6, 2025

Pytorch library for fast transformer implementations

Python 1,745 188 Updated Mar 23, 2023

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

Python 159 22 Updated Jun 18, 2024

State-of-the-Art Text Embeddings

Python 17,705 2,700 Updated Oct 21, 2025

Fast and customizable text tokenization library with BPE and SentencePiece support

C++ 319 76 Updated Apr 15, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,912 3,137 Updated Oct 21, 2025

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,630 923 Updated Oct 17, 2025

GPT-3: Language Models are Few-Shot Learners

15,779 2,297 Updated Sep 18, 2020

A summary of must-read papers for Neural Question Generation (NQG)

587 78 Updated Oct 25, 2021

midi processor library for PerformanceRNN & MusicTransformer published by "Google Magenta"

Python 123 30 Updated Aug 1, 2024

Longformer: The Long-Document Transformer

Python 2,171 287 Updated Feb 8, 2023

Music GPT-2 Implementation with Relative Positional Embedding

Jupyter Notebook 77 10 Updated Nov 19, 2019

🏋️ Python / Modern C++ Solutions of All 3716 LeetCode Problems (Weekly Update)

C++ 5,032 1,636 Updated Oct 19, 2025

Dataset for JSB Chorales at different temporal resolutions, with train, validation, test split from Boulanger-Lewandowski (2012).

112 18 Updated Dec 6, 2022

torch-optimizer -- collection of optimizers for Pytorch

Python 3,144 311 Updated Mar 22, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,462 4,585 Updated Oct 20, 2025

CUDA kernels for generalized matrix-multiplication in PyTorch

Jupyter Notebook 85 14 Updated Oct 11, 2021

Code for the paper "Do Massively Pretrained Language Models Make Better Storytellers?"

Jupyter Notebook 76 13 Updated Jun 17, 2022

Fast, general, and tested differentiable structured prediction in PyTorch

Jupyter Notebook 1,117 93 Updated Apr 20, 2022

TextRank implementation for Python 3.

Python 1,264 258 Updated Mar 28, 2023

Ongoing research training transformer models at scale

Python 13,906 3,169 Updated Oct 21, 2025

Tools for downloading and analyzing summaries and evaluating summarization systems. https://summari.es/

Perl 150 25 Updated Aug 8, 2023

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Python 1,121 206 Updated Nov 28, 2022

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,180 1,167 Updated May 28, 2023

An implementation of training for GPT2, supports TPUs

Python 1,418 332 Updated Dec 12, 2022

A mix of GAN implementations including progressive growing

Python 1,628 270 Updated Oct 12, 2021
Next