Skip to content
View intfloat's full-sized avatar

Block or report intfloat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,344 1,231 Updated Oct 18, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,197 98 Updated Oct 6, 2025

Lightweight coding agent that runs in your terminal

Rust 48,698 5,899 Updated Oct 24, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,235 3,150 Updated Oct 24, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,087 143 Updated Oct 24, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,557 428 Updated Oct 23, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,366 183 Updated Oct 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,680 2,342 Updated Oct 24, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 23,552 2,075 Updated Oct 23, 2025

s1: Simple test-time scaling

Python 6,581 766 Updated Jun 25, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,219 804 Updated Oct 23, 2025

Official inference framework for 1-bit LLMs

Python 24,289 1,879 Updated Jun 3, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,209 51 Updated Nov 16, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,542 2,206 Updated Mar 11, 2025

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 234 13 Updated Sep 12, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,842 376 Updated Oct 17, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,347 3,869 Updated Oct 23, 2025
Jupyter Notebook 687 82 Updated Apr 30, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 748 52 Updated Sep 27, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,360 561 Updated Oct 19, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 218 17 Updated Jun 13, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,338 116 Updated Oct 9, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,061 117 Updated Jul 29, 2024

A programming framework for agentic AI

Python 51,053 7,793 Updated Oct 8, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,075 265 Updated Sep 25, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,320 276 Updated Jul 17, 2025

Generative Representational Instruction Tuning

Jupyter Notebook 675 49 Updated Jun 25, 2025

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 26,919 2,782 Updated Oct 23, 2025

LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)

Python 144 9 Updated Nov 9, 2024
Next