Skip to content
View shivansh-p's full-sized avatar

Block or report shivansh-p

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Everything about the SmolLM and SmolVLM family of models

Python 3,431 238 Updated Nov 20, 2025

A framework for few-shot evaluation of language models.

Python 10,802 2,884 Updated Dec 2, 2025

This repository contains example notebooks and homeworks demonstrating various techniques in model optimization for Edge ML.

Jupyter Notebook 2 Updated Apr 14, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 80,308 11,970 Updated Nov 25, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,262 76 Updated May 16, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 391 12 Updated Jul 11, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,063 98 Updated Nov 23, 2025

A simple and explained implementation of (Dr.) GRPO in PyTorch.

Python 1 Updated Sep 10, 2025

A simple and explained implementation of (Dr.) GRPO in PyTorch.

Python 3 1 Updated Sep 10, 2025

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 574 50 Updated Oct 31, 2025

🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.

Jupyter Notebook 1 Updated Feb 12, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 851 68 Updated Nov 24, 2025

Quantum Well Simulator for Semiconductor Modeling

Python 6 Updated Aug 13, 2025

Train transformer language models with reinforcement learning.

Python 16,502 2,330 Updated Dec 2, 2025

The LLM Evaluation Framework

Python 12,432 1,099 Updated Dec 2, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,906 4,032 Updated Dec 2, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,863 1,663 Updated Nov 26, 2025

An Open Source Toolkit For LLM Distillation

Python 1 Updated May 1, 2025
XSLT 166 10 Updated May 2, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,140 743 Updated Dec 1, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 1 Updated Apr 28, 2025

Code for BLT research paper

Python 2,011 184 Updated Nov 3, 2025

Code for the ongoing GSoC project "Classification of body keypoint trajectories of gesture co-occurring with time expressions".

Jupyter Notebook 4 4 Updated Sep 6, 2022

Code for implementing a Tree of thought based prompting method for the math data GSM8K.

Python 2 Updated Apr 12, 2024

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Jupyter Notebook 1 Updated Aug 23, 2024

🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python 1 Updated Dec 3, 2024

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …

Jupyter Notebook 8,943 1,391 Updated Dec 1, 2025
Jupyter Notebook 2 1 Updated Nov 3, 2024

A Simple Web Crawler implementation in Python, notebook (Google Colab)

Jupyter Notebook 1 1 Updated Jun 8, 2021

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 68,984 5,397 Updated Dec 3, 2025
Next