Skip to content
View Ben-Louis's full-sized avatar
:shipit:
:shipit:

Block or report Ben-Louis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official code of ARPO & AEPO

Python 812 36 Updated Nov 15, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,627 686 Updated Nov 7, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,160 36 Updated Oct 4, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 63,379 7,657 Updated Dec 1, 2025

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 15,826 1,264 Updated Dec 1, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 731 44 Updated Jun 6, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,792 453 Updated Nov 27, 2025

auto sign cursor

Python 9,687 1,529 Updated Oct 14, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,157 318 Updated Nov 27, 2025

Fully open data curation for reasoning models

Python 2,152 180 Updated Sep 3, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,036 2,705 Updated Dec 2, 2025

A series of technical report on Slow Thinking with LLM

Python 748 41 Updated Aug 13, 2025

Fully open reproduction of DeepSeek-R1

Python 25,702 2,402 Updated Nov 24, 2025

An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.

Python 331 39 Updated Jun 19, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,779 99 Updated Mar 18, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,135 1,813 Updated Feb 26, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,489 822 Updated Nov 9, 2025

The reinforcement learning training code for AgiBot X1.

Python 1,612 499 Updated Oct 23, 2024
Python 966 111 Updated Jan 23, 2025

[ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models

152 1 Updated Dec 5, 2024
Python 92 18 Updated Jul 12, 2022

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 15,331 3,609 Updated Nov 29, 2025

Fast and memory-efficient exact attention

Python 20,830 2,179 Updated Nov 25, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,245 31,281 Updated Dec 1, 2025

清华主题PPT模板

Python 1,488 96 Updated Nov 16, 2025

LLM inference in C/C++

C++ 90,674 13,901 Updated Dec 2, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 6,060 573 Updated Feb 26, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,092 729 Updated May 31, 2024
Next