An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,219 804 Updated Oct 23, 2025

deepseek-ai / DeepSeek-R1

91,362 11,767 Updated Jun 27, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,289 1,879 Updated Jun 3, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,209 51 Updated Nov 16, 2024

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,542 2,206 Updated Mar 11, 2025

princeton-nlp / ProLong

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 234 13 Updated Sep 12, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,842 376 Updated Oct 17, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,347 3,869 Updated Oct 23, 2025

huggingface / huggingface-llama-recipes

Jupyter Notebook 687 82 Updated Apr 30, 2025

jzhang38 / EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 748 52 Updated Sep 27, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,360 561 Updated Oct 19, 2024

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 218 17 Updated Jun 13, 2025

NVIDIA / RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,338 116 Updated Oct 9, 2025

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,061 117 Updated Jul 29, 2024

microsoft / autogen

A programming framework for agentic AI

Python 51,053 7,793 Updated Oct 8, 2025

RUC-NLPIR / FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,075 265 Updated Sep 25, 2025

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,320 276 Updated Jul 17, 2025

ContextualAI / gritlm

Generative Representational Instruction Tuning

Jupyter Notebook 675 49 Updated Jun 25, 2025

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 26,919 2,782 Updated Oct 23, 2025

dwzhu-pku / LongEmbed

LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)

Python 144 9 Updated Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Liang Wang intfloat

Achievements

Achievements

Highlights

Block or report intfloat

Stars

Alibaba-NLP / DeepResearch

open-thought / reasoning-gym

openai / codex

sgl-project / sglang

NovaSky-AI / SkyRL

rllm-org / rllm

mll-lab-nu / RAGEN

volcengine / verl

huggingface / smolagents

simplescaling / s1

OpenRLHF / OpenRLHF