Skip to content
View geekjuruo's full-sized avatar
🤗
Focusing
🤗
Focusing
  • Tsinghua University
  • Beijing Haidian

Organizations

@THUKElab

Block or report geekjuruo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 128 15 Updated Oct 16, 2025

TrustJudge is a probabilistic evaluation framework that reduces score-comparison and pairwise transitivity inconsistencies in LLM-as-a-judge systems.

Python 31 1 Updated Sep 27, 2025

Xmixers: A collection of SOTA efficient token/channel mixers

Python 29 2 Updated Sep 4, 2025

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 316 35 Updated Oct 14, 2025

Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.

Python 828 117 Updated Oct 11, 2025

Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

Macaulay2 3 Updated Aug 13, 2025

A simple yet powerful agent framework that delivers with open-source models

Python 3,517 335 Updated Oct 17, 2025

[COLM 25] Phased Training for LLM-powered Text Retrieval Models Beyond Data Scaling

7 1 Updated Aug 18, 2025

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Python 5 Updated Sep 25, 2025

[EMNLP 2025] Awesome RAG Reasoning Resources

321 25 Updated Jul 24, 2025

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Python 96 8 Updated Sep 17, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,698 1,003 Updated Sep 24, 2025

🙌 OpenHands: Code Less, Make More

Python 64,290 7,785 Updated Oct 17, 2025

一个面向中文文本纠错任务的综合平台,集学术研究、模型训练、模型评测和推理部署于一体,覆盖拼写纠错与语法纠错两个核心方向。

Python 402 29 Updated Aug 21, 2025
Python 922 64 Updated May 22, 2024
Python 428 44 Updated Feb 7, 2025

An Approach to Enhancing the Efficacy of Post-Training Using Synthetic Data by Iterative Data Selection

Python 7 Updated Dec 24, 2024
Python 7 Updated Apr 20, 2025

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python 384 25 Updated Apr 22, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,209 51 Updated Nov 16, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,824 135 Updated Jan 17, 2025

O1 Replication Journey

2,002 64 Updated Jan 14, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,836 376 Updated Oct 17, 2025

The official repo of INF-34B models trained by INF Technology.

Python 35 1 Updated Jul 25, 2024

中山大学知识工程实验室介绍。

36 1 Updated Aug 24, 2025

LLM101n: Let's build a Storyteller

34,848 1,889 Updated Aug 1, 2024
Python 44 8 Updated Dec 12, 2024

The novel benchmark for MLLMs.

Python 3 Updated Dec 16, 2024

Numbers every LLM developer should know

4,260 139 Updated Jan 16, 2024
Next