thevasudevgupta

🎓

enjoying hard work!

Vasudev Gupta thevasudevgupta

🎓

enjoying hard work!

trying to learn what AI learns

206 followers · 113 following

Achievements

Organizations

inferllm Public

LLM inference!

Python MIT License Updated Nov 23, 2025
thevasudevgupta.github.io Public

personal webpage (PUBLIC)

Updated Jul 8, 2025
nano-vllm Public
Forked from GeeeekExplorer/nano-vllm

Nano vLLM

Python 1 MIT License Updated Jun 19, 2025
lighteval Public
Forked from huggingface/lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python MIT License Updated Feb 10, 2025
picotron Public
Forked from huggingface/picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python Apache License 2.0 Updated Dec 20, 2024
search-and-learn Public
Forked from huggingface/search-and-learn

Python Apache License 2.0 Updated Dec 16, 2024
smollm Public
Forked from huggingface/smollm

Everything about the SmolLM & SmolLM2 family of models

Python Apache License 2.0 Updated Dec 2, 2024
torchtitan Public
Forked from pytorch/torchtitan

A native PyTorch Library for large model training

Python BSD 3-Clause "New" or "Revised" License Updated Sep 10, 2024
gpt-triton Public

Triton implementation of GPT/LLAMA

Python 20 2 MIT License Updated Aug 28, 2024
unsloth Public
Forked from unslothai/unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python Apache License 2.0 Updated Aug 14, 2024
cluster-health Public
Forked from imbue-ai/cluster-health

Python Updated Jun 25, 2024
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python BSD 3-Clause "New" or "Revised" License Updated May 24, 2024
datatrove Public
Forked from huggingface/datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python Apache License 2.0 Updated Apr 26, 2024
megablocks Public
Forked from databricks/megablocks

Python Apache License 2.0 Updated Mar 26, 2024
grok Public
Forked from xai-org/grok-1

Grok open release

Python Apache License 2.0 Updated Mar 17, 2024
fms-fsdp Public
Forked from foundation-model-stack/fms-fsdp

Demonstrate throughput of PyTorch FSDP

Python Apache License 2.0 Updated Mar 13, 2024
gpt-fast Public
Forked from meta-pytorch/gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python BSD 3-Clause "New" or "Revised" License Updated Feb 29, 2024
fm-cheatsheet Public
Forked from allenai/fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

Python Updated Feb 29, 2024
OLMo Public
Forked from allenai/OLMo

Modeling, training, eval, and inference code for OLMo

Python Apache License 2.0 Updated Feb 6, 2024
hyperpod Public
Forked from Stability-AI/hyperpod

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Python MIT No Attribution Updated Feb 1, 2024
ml-engineering Public
Forked from stas00/ml-engineering

Machine Learning Engineering Open Book

Python Creative Commons Attribution Share Alike 4.0 International Updated Jan 26, 2024
nanotron Public
Forked from huggingface/nanotron

Minimalistic large language model 3D-parallelism training

Python Apache License 2.0 Updated Jan 19, 2024
ds-toolkit Public

Some useful stuff for a software/ML engineer

data-science git-notes dvc-for-data-science docker-notes markdown-notes

Shell 5 1 Apache License 2.0 Updated Jan 12, 2024
ml-ways Public
Forked from stas00/ml-ways

ML/DL Math and Method notes

Jupyter Notebook Updated Dec 2, 2023
accelerate Public
Forked from huggingface/accelerate

A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Python Apache License 2.0 Updated Nov 27, 2023
ImageBind Public
Forked from facebookresearch/ImageBind

ImageBind One Embedding Space to Bind Them All

Python Other Updated Aug 1, 2023
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Jun 27, 2023
RedPajama-Data Public
Forked from togethercomputer/RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python Apache License 2.0 Updated Jun 14, 2023
biobigbird Public

BigBird for bio-medical domain

flax biomedical jax huggingface bigbird

Python 1 1 Updated Jun 13, 2023
gpt-llama.cpp Public
Forked from keldenl/gpt-llama.cpp

A llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI.

JavaScript MIT License Updated Jun 12, 2023

Vasudev Gupta thevasudevgupta

Achievements

Achievements

Organizations

inferllm Public

Uh oh!

thevasudevgupta.github.io Public

Uh oh!

nano-vllm Public

Uh oh!

lighteval Public

Uh oh!

picotron Public

Uh oh!

search-and-learn Public

Uh oh!

smollm Public

Uh oh!

torchtitan Public

Uh oh!

gpt-triton Public

Uh oh!

unsloth Public

Uh oh!

cluster-health Public

Uh oh!

flash-attention Public

Uh oh!

datatrove Public

Uh oh!

megablocks Public

Uh oh!

grok Public

Uh oh!

fms-fsdp Public

Uh oh!

gpt-fast Public

Uh oh!

fm-cheatsheet Public

Uh oh!

OLMo Public

Uh oh!

hyperpod Public

Uh oh!

ml-engineering Public

Uh oh!

nanotron Public

Uh oh!

ds-toolkit Public

Uh oh!

ml-ways Public

Uh oh!

accelerate Public

Uh oh!

ImageBind Public

Uh oh!

vllm Public

Uh oh!

RedPajama-Data Public

Uh oh!

biobigbird Public

Uh oh!

gpt-llama.cpp Public

Uh oh!