Skip to content
View thevasudevgupta's full-sized avatar
🎓
enjoying hard work!
🎓
enjoying hard work!

Organizations

@analytics-club-iitm @Unbox-AI

Block or report thevasudevgupta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cogito v2 is the large language model developed by DeepCogito team

5 1 Updated Nov 21, 2025

PyTorch native post-training library

Python 5,605 683 Updated Nov 24, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,471 243 Updated Aug 28, 2025

Load compute kernels from the Hub

Python 334 26 Updated Nov 24, 2025

Recipes to train reward model for RLHF.

Python 1,482 103 Updated Apr 24, 2025

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 545 33 Updated May 16, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 73,529 2,253 Updated Nov 24, 2025

Ongoing research training transformer models at scale

Python 14,294 3,313 Updated Nov 25, 2025

Puzzles for learning Triton

Jupyter Notebook 2,132 174 Updated Nov 18, 2024

Implementation of a Transformer, but completely in Triton

Python 277 16 Updated Apr 5, 2022

Flops counter for neural networks in pytorch framework

Python 2,957 308 Updated Aug 20, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,652 6,158 Updated Sep 18, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,156 566 Updated Aug 22, 2025

Helpful tools and examples for working with flex-attention

Python 1,060 64 Updated Nov 18, 2025

Minimalistic large language model 3D-parallelism training

Python 2,333 256 Updated Nov 21, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 38,131 4,132 Updated Nov 24, 2025

Perf monitoring CLI tool for Apple Silicon

Python 4,360 183 Updated Apr 18, 2024

Parallel computing with task scheduling

Python 13,615 1,826 Updated Nov 24, 2025

Extremely fast Query Engine for DataFrames, written in Rust

Rust 36,235 2,481 Updated Nov 24, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,124 2,110 Updated Nov 21, 2025

A Data Streaming Library for Efficient Neural Network Training

Python 1,421 176 Updated Oct 27, 2025

Fast Inference Solutions for BLOOM

Python 564 114 Updated Oct 9, 2024

DataComp: In search of the next generation of multimodal datasets

Python 750 62 Updated Apr 28, 2025

This project shows how to serve an ONNX-optimized image classification model as a web service with FastAPI, Docker, and Kubernetes.

Jupyter Notebook 221 41 Updated Jul 27, 2022

An open-source framework for training large multimodal models.

Python 4,046 316 Updated Aug 31, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,229 4,034 Updated Jul 17, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,981 2,216 Updated Jul 29, 2024

GPU Programming @ IIT Madras

Cuda 2 Updated May 10, 2022

GSoC'2021 | TensorFlow implementation of Wav2Vec2

Jupyter Notebook 90 28 Updated Jan 11, 2022
Next