Skip to content
View zhujiem's full-sized avatar

Organizations

@wsdream @anyai @logpai

Block or report zhujiem

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 144 14 Updated Oct 22, 2025

A lightweight LMM-based Document Parsing Model

Python 6,143 424 Updated Oct 25, 2025

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 97 8 Updated Oct 30, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,824 293 Updated Nov 2, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,802 937 Updated Nov 2, 2025

Benchmarking Recommendation Abilities for Large Language Models

Python 22 1 Updated Jul 18, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,719 1,272 Updated Nov 2, 2025
Python 1,542 89 Updated Sep 30, 2025

Open Agent Coding CLI, Koding with GLM, Qwen, Kimi, DeepSeek etc.(welcome to use Kode to summit PR)

TypeScript 3,393 506 Updated Oct 9, 2025

RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation

Python 7 1 Updated Oct 23, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 15,700 1,188 Updated Oct 31, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 47,949 3,958 Updated Oct 31, 2025

[KDD 2025] Quadratic Neural Networks for Click-through Rate Prediction

Python 10 1 Updated May 29, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,689 629 Updated Oct 27, 2025

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,627 185 Updated Apr 20, 2024

使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话,并拥有美观的界面。

Python 136 24 Updated Oct 31, 2025

Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework

Python 2,133 642 Updated Aug 18, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,953 7,116 Updated Nov 3, 2025

Production-ready platform for agentic workflow development.

TypeScript 117,878 18,212 Updated Nov 3, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,287 3,729 Updated Nov 2, 2025
Python 5 Updated May 13, 2025

“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。

367 39 Updated Nov 1, 2025

Source code for Twitter's Recommendation Algorithm

Python 10,386 2,234 Updated Jul 10, 2024

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 161 36 Updated Oct 31, 2025

A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap

Clojure 39,183 2,350 Updated Nov 1, 2025
Python 104 3 Updated Sep 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,018 2,407 Updated Nov 3, 2025

A decoder-only llm-based generative recommendation framework that integrates endogenous and exogenous behavioral and semantic information in a non-intrusive manner

Python 11 Updated Mar 14, 2025

This is the repository for ”ROMA: Recommendation-Oriented Language Model Adaptation Using Multi-Modal Multi-Domain Item Sequences“ in KDD 2025

Python 1 Updated Feb 12, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,800 76 Updated Oct 31, 2025
Next