Skip to content
View seyuboglu's full-sized avatar

Block or report seyuboglu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 68,569 5,358 Updated Nov 26, 2025

Processed / Cleaned Data for Paper Copilot

Python 727 36 Updated Nov 25, 2025

Storing long contexts in tiny caches with self-study

Python 217 22 Updated Oct 17, 2025

KV cache compression via sparse coding

Python 14 2 Updated Oct 26, 2025

A minimalistic framework for transparently training language models and storing comprehensive checkpoints for in-depth learning dynamics research.

Python 293 22 Updated Nov 14, 2025

Model Context Protocol Servers

TypeScript 73,350 8,881 Updated Nov 26, 2025

Big & Small LLMs working together

Python 1,209 138 Updated Nov 25, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,778 452 Updated Nov 26, 2025

Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders

Python 264 33 Updated Oct 31, 2025

Aioli: A unified optimization framework for language model data mixing

Jupyter Notebook 31 4 Updated Jan 17, 2025

Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.

Python 189 22 Updated Mar 7, 2025

[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

Python 131 9 Updated Dec 4, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,157 566 Updated Aug 22, 2025

[NeurIPS 2024] Simple and Effective Masked Diffusion Language Model

Python 569 81 Updated Sep 29, 2025

(NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning

Python 231 25 Updated Jun 10, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,112 256 Updated Jul 25, 2025

The Fast Cross-Platform Package Manager

C++ 7,786 418 Updated Nov 25, 2025

Tile primitives for speedy kernels

Cuda 2,951 202 Updated Nov 26, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,855 1,892 Updated Nov 24, 2025

A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).

160 10 Updated Jan 1, 2025

Triton-based implementation of Sparse Mixture of Experts.

Python 253 22 Updated Oct 3, 2025

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 243 17 Updated Jun 6, 2025

This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"

Python 493 46 Updated Mar 26, 2024

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,914 311 Updated Nov 26, 2025

Understand and test language model architectures on synthetic tasks.

Python 240 39 Updated Sep 25, 2025

Building blocks for foundation models.

579 28 Updated Jan 3, 2024

Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models

Jupyter Notebook 47 8 Updated Oct 31, 2023
Next