Starred repositories
Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddings recursively. This helps us understand user behaviour on…
Curated list of awesome Cursor Rules .mdc files
Multiple agents with LangGraph and MCP
LangGraph template for a simple ReAct agent with an MCP client to access MCP servers as tools
The official Python SDK for Model Context Protocol servers and clients
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
A blazing fast inference solution for text embeddings models
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Code for explaining and evaluating late chunking (chunked pooling)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
Companion repo for complete Docker course
PIP-Net: Patch-based Intuitive Prototypes Network for Interpretable Image Classification (CVPR 2023)
Uncertainty Quantification 360 (UQ360) is an extensible open-source toolkit that can help you estimate, communicate and use uncertainty in machine learning model predictions.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs
ML-based radioisotope identification and estimation from gamma spectra in Python.
An open source implementation of CLIP.
Google Research
A scikit-learn compatible neural network library that wraps PyTorch
Capstone Project for UW Data Analytics Bootcamp