Skip to content
View jweihe's full-sized avatar
🦙
Focusing
🦙
Focusing
  • Institute of Computing Technology, CAS
  • Beijing

Highlights

  • Pro

Block or report jweihe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Python 81 2 Updated Sep 14, 2024

📕 小红书创作者MCP工具包 - 支持与AI客户端集成的内容创作和发布工具

Python 481 57 Updated Jun 27, 2025

[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet

Python 110 14 Updated Jun 10, 2025

Simple RL training for reasoning

Python 3,656 272 Updated Apr 10, 2025

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,983 178 Updated Aug 13, 2024

RLHF中文手册 - 详细解析RLHF全流程优化阶段,涵盖指令调优、奖励模型训练,以及拒绝采样、强化学习和直接对齐算法等关键技术。

TeX 2 1 Updated May 7, 2025

Quarto template for Chinese academic writing

Lua 57 6 Updated Jun 10, 2025

Train transformer language models with reinforcement learning.

Python 14,425 2,006 Updated Jul 2, 2025

一款便捷的抢占显卡脚本

Cuda 338 39 Updated Jan 20, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,701 244 Updated Jul 2, 2025

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 5,799 714 Updated May 7, 2025

Production-ready platform for agentic workflow development.

TypeScript 105,477 15,919 Updated Jul 2, 2025

LangChain 的中文入门教程

8,287 648 Updated Apr 19, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,470 584 Updated Oct 24, 2024
132 6 Updated May 8, 2025
Python 45 1 Updated Feb 19, 2024

Awesome list for LLM pruning.

233 9 Updated Dec 15, 2024

Official implementation for LaCo (EMNLP 2024 Findings)

Jupyter Notebook 17 4 Updated Oct 3, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,292 517 Updated May 18, 2025
Python 133 8 Updated Nov 3, 2023

Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"

Python 55 3 Updated Oct 1, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,017 119 Updated Jun 1, 2023

本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。

Python 91 9 Updated Sep 14, 2024

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 963 61 Updated Jun 14, 2025

A paper list of some recent works about Token Compress for Vit and VLM

536 24 Updated Jun 30, 2025

Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models

38 1 Updated Jul 8, 2024

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 437 40 Updated Feb 1, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,425 734 Updated Jun 7, 2025
Next