Lists (3)
Sort Name ascending (A-Z)
Stars
Everything about the SmolLM and SmolVLM family of models
A framework for few-shot evaluation of language models.
This repository contains example notebooks and homeworks demonstrating various techniques in model optimization for Edge ML.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
Muon is an optimizer for hidden layers in neural networks
shivansh-p / easy-GRPO
Forked from johnnycrab/easy-GRPOA simple and explained implementation of (Dr.) GRPO in PyTorch.
A simple and explained implementation of (Dr.) GRPO in PyTorch.
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
shivansh-p / -Fast-lightweight-schema-less-search-backend.-An-alternative-to-Elasticsearch-that-runs-on-a-few-
Forked from valeriansaliou/sonic🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
Quantum Well Simulator for Semiconductor Modeling
Train transformer language models with reinforcement learning.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
shivansh-p / DistillKit
Forked from arcee-ai/DistillKitAn Open Source Toolkit For LLM Distillation
Hackable and optimized Transformers building blocks, supporting a composable construction.
shivansh-p / xformers
Forked from facebookresearch/xformersHackable and optimized Transformers building blocks, supporting a composable construction.
Code for the ongoing GSoC project "Classification of body keypoint trajectories of gesture co-occurring with time expressions".
Code for implementing a Tree of thought based prompting method for the math data GSM8K.
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
shivansh-p / LLaMA-MoE-v2
Forked from OpenSparseLLMs/LLaMA-MoE-v2🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
A Simple Web Crawler implementation in Python, notebook (Google Colab)
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data