- Shanghai
Stars
Kimi K2 is the large language model series developed by Moonshot AI team
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
Democratizing Reinforcement Learning for LLMs
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Fully open data curation for reasoning models
verl: Volcano Engine Reinforcement Learning for LLMs
A series of technical report on Slow Thinking with LLM
Fully open reproduction of DeepSeek-R1
An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.
Scalable RL solution for advanced reasoning of language models
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
The reinforcement learning training code for AgiBot X1.
[ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Fast and memory-efficient exact attention
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"





