- New Delhi, India
-
06:34
(UTC +05:30) - https://thevasudevgupta.github.io/
- @thevasudevgupta
- in/thevasudevgupta
- https://unboxai.com/
-
-
-
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
-
lighteval Public
Forked from huggingface/lightevalLighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Python MIT License UpdatedFeb 10, 2025 -
picotron Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose
Python Apache License 2.0 UpdatedDec 20, 2024 -
search-and-learn Public
Forked from huggingface/search-and-learnPython Apache License 2.0 UpdatedDec 16, 2024 -
smollm Public
Forked from huggingface/smollmEverything about the SmolLM & SmolLM2 family of models
Python Apache License 2.0 UpdatedDec 2, 2024 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 10, 2024 -
-
unsloth Public
Forked from unslothai/unslothFinetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Python Apache License 2.0 UpdatedAug 14, 2024 -
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 24, 2024 -
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Apache License 2.0 UpdatedApr 26, 2024 -
-
grok Public
Forked from xai-org/grok-1Grok open release
Python Apache License 2.0 UpdatedMar 17, 2024 -
fms-fsdp Public
Forked from foundation-model-stack/fms-fsdpDemonstrate throughput of PyTorch FSDP
Python Apache License 2.0 UpdatedMar 13, 2024 -
gpt-fast Public
Forked from meta-pytorch/gpt-fastSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 29, 2024 -
fm-cheatsheet Public
Forked from allenai/fm-cheatsheetWebsite for hosting the Open Foundation Models Cheat Sheet.
Python UpdatedFeb 29, 2024 -
OLMo Public
Forked from allenai/OLMoModeling, training, eval, and inference code for OLMo
Python Apache License 2.0 UpdatedFeb 6, 2024 -
hyperpod Public
Forked from Stability-AI/hyperpodCollection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Python MIT No Attribution UpdatedFeb 1, 2024 -
ml-engineering Public
Forked from stas00/ml-engineeringMachine Learning Engineering Open Book
Python Creative Commons Attribution Share Alike 4.0 International UpdatedJan 26, 2024 -
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedJan 19, 2024 -
ds-toolkit Public
Some useful stuff for a software/ML engineer
-
ml-ways Public
Forked from stas00/ml-waysML/DL Math and Method notes
Jupyter Notebook UpdatedDec 2, 2023 -
accelerate Public
Forked from huggingface/accelerateA simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python Apache License 2.0 UpdatedNov 27, 2023 -
ImageBind Public
Forked from facebookresearch/ImageBindImageBind One Embedding Space to Bind Them All
Python Other UpdatedAug 1, 2023 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJun 27, 2023 -
RedPajama-Data Public
Forked from togethercomputer/RedPajama-DataThe RedPajama-Data repository contains code for preparing large datasets for training large language models.
Python Apache License 2.0 UpdatedJun 14, 2023 -
biobigbird Public
BigBird for bio-medical domain
-
gpt-llama.cpp Public
Forked from keldenl/gpt-llama.cppA llama.cpp drop-in replacement for OpenAI's GPT endpoints, allowing GPT-powered apps to run off local llama.cpp models instead of OpenAI.
JavaScript MIT License UpdatedJun 12, 2023





