Stars
Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.
TrustJudge is a probabilistic evaluation framework that reduces score-comparison and pairwise transitivity inconsistencies in LLM-as-a-judge systems.
Xmixers: A collection of SOTA efficient token/channel mixers
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…
Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.
Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction
A simple yet powerful agent framework that delivers with open-source models
[COLM 25] Phased Training for LLM-powered Text Retrieval Models Beyond Data Scaling
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs
[EMNLP 2025] Awesome RAG Reasoning Resources
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
一个面向中文文本纠错任务的综合平台,集学术研究、模型训练、模型评测和推理部署于一体,覆盖拼写纠错与语法纠错两个核心方向。
An Approach to Enhancing the Efficacy of Post-Training Using Synthetic Data by Iterative Data Selection
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
A bibliography and survey of the papers surrounding o1
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
The official repo of INF-34B models trained by INF Technology.