shivansh-p

shivansh-p

7 followers · 75 following

Lists (3)

Sort

Stars

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,431 238 Updated Nov 20, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,802 2,884 Updated Dec 2, 2025

JeanSanchezFelix / EdgeML-Projects

This repository contains example notebooks and homeworks demonstrating various techniques in model optimization for Edge ML.

Jupyter Notebook 2 Updated Apr 14, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 80,308 11,970 Updated Nov 25, 2025

Agent-RL / ReCall

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,262 76 Updated May 16, 2025

PRIME-RL / Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 391 12 Updated Jul 11, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,063 98 Updated Nov 23, 2025

shivansh-p / easy-GRPO

Forked from johnnycrab/easy-GRPO

A simple and explained implementation of (Dr.) GRPO in PyTorch.

Python 1 Updated Sep 10, 2025

johnnycrab / easy-GRPO

A simple and explained implementation of (Dr.) GRPO in PyTorch.

Python 3 1 Updated Sep 10, 2025

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 574 50 Updated Oct 31, 2025

shivansh-p / -Fast-lightweight-schema-less-search-backend.-An-alternative-to-Elasticsearch-that-runs-on-a-few-

Forked from valeriansaliou/sonic

🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.

Jupyter Notebook 1 Updated Feb 12, 2025

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 851 68 Updated Nov 24, 2025

sairbarbaros / QVNTVS

Quantum Well Simulator for Semiconductor Modeling

Python 6 Updated Aug 13, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,502 2,330 Updated Dec 2, 2025

confident-ai / deepeval

The LLM Evaluation Framework

Python 12,432 1,099 Updated Dec 2, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,906 4,032 Updated Dec 2, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,863 1,663 Updated Nov 26, 2025

shivansh-p / DistillKit

Forked from arcee-ai/DistillKit

An Open Source Toolkit For LLM Distillation

Python 1 Updated May 1, 2025

keirp / OpenWebMath

XSLT 166 10 Updated May 2, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,140 743 Updated Dec 1, 2025

shivansh-p / xformers

Forked from facebookresearch/xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 1 Updated Apr 28, 2025

facebookresearch / blt

Code for BLT research paper

Python 2,011 184 Updated Nov 3, 2025

ShreyPandit / Classification-of-body-keypoint-trajectories-of-gesture-co-occurring-with-time-expressions

Code for the ongoing GSoC project "Classification of body keypoint trajectories of gesture co-occurring with time expressions".

Jupyter Notebook 4 4 Updated Sep 6, 2022

ShreyPandit / Tree-of-thought-on-GSM8K

Code for implementing a Tree of thought based prompting method for the math data GSM8K.

Python 2 Updated Apr 12, 2024

shivansh-p / Building-llama3-from-scratch

Forked from FareedKhan-dev/Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Jupyter Notebook 1 Updated Aug 23, 2024

shivansh-p / LLaMA-MoE-v2

Forked from OpenSparseLLMs/LLaMA-MoE-v2

🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

Python 1 Updated Dec 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly