jweihe

Follow

🦙

Focusing

jweihe jweihe

🦙

Focusing

Follow

14 followers · 54 following

Institute of Computing Technology, CAS
Beijing

Achievements

Achievements

Highlights

Pro

Starred repositories

OpenGVLab / MMIU

[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Python 81 2 Updated Sep 14, 2024

aki66938 / xhs-toolkit

📕 小红书创作者MCP工具包 - 支持与AI客户端集成的内容创作和发布工具

Python 481 57 Updated Jun 27, 2025

bingreeky / MaAS

[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet

Python 110 14 Updated Jun 10, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,656 272 Updated Apr 10, 2025

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,983 178 Updated Aug 13, 2024

jweihe / RLHF-book-Chinese

RLHF中文手册 - 详细解析RLHF全流程优化阶段，涵盖指令调优、奖励模型训练，以及拒绝采样、强化学习和直接对齐算法等关键技术。

TeX 2 1 Updated May 7, 2025

TomBener / quarto-chinese

Quarto template for Chinese academic writing

Lua 57 6 Updated Jun 10, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,425 2,006 Updated Jul 2, 2025

godweiyang / GrabGPU

一款便捷的抢占显卡脚本

Cuda 338 39 Updated Jan 20, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,701 244 Updated Jul 2, 2025

nickscamara / open-deep-research

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 5,799 714 Updated May 7, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 105,477 15,919 Updated Jul 2, 2025

liaokongVFX / LangChain-Chinese-Getting-Started-Guide

LangChain 的中文入门教程

8,287 648 Updated Apr 19, 2025

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,470 584 Updated Oct 24, 2024

HCIILAB / M6Doc

132 6 Updated May 8, 2025

zankner / Hydra

Python 45 1 Updated Feb 19, 2024

pprp / Awesome-LLM-Prune

Awesome list for LLM pruning.

233 9 Updated Dec 15, 2024

yangyifei729 / LaCo

Official implementation for LaCo (EMNLP 2024 Findings)

Jupyter Notebook 17 4 Updated Oct 3, 2024

datawhalechina / agent-tutorial

306 32 Updated Mar 19, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,292 517 Updated May 18, 2025

lz1oceani / verify_cot

Python 133 8 Updated Nov 3, 2023

ytyz1307zzh / RefAug

Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"

Python 55 3 Updated Oct 1, 2024

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,017 119 Updated Jun 1, 2023

percent4 / llm_math_solver

本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。

Python 91 9 Updated Sep 14, 2024

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 963 61 Updated Jun 14, 2025

google-deepmind / alphageometry

Python 4,536 525 Updated Jun 19, 2025

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

536 24 Updated Jun 30, 2025

yaolinli / DeCo

Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models

38 1 Updated Jul 8, 2024

meta-math / MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 437 40 Updated Feb 1, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,425 734 Updated Jun 7, 2025

Starred topics

Python