zhujiem

zhujiem zhujiem

165 followers · 19 following

Hong Kong

Achievements

Organizations

Stars

TencentCloudADP / youtu-embedding

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 144 14 Updated Oct 22, 2025

Yuliang-Liu / MonkeyOCR

A lightweight LMM-based Document Parsing Model

Python 6,143 424 Updated Oct 25, 2025

selous123 / al_sid

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 97 8 Updated Oct 30, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,824 293 Updated Nov 2, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,802 937 Updated Nov 2, 2025

Jyonn / RecBench

Benchmarking Recommendation Abilities for Large Language Models

Python 22 1 Updated Jul 18, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,719 1,272 Updated Nov 2, 2025

QwenLM / Qwen3-Embedding

Python 1,542 89 Updated Sep 30, 2025

shareAI-lab / Kode

Open Agent Coding CLI, Koding with GLM, Qwen, Kimi, DeepSeek etc.（welcome to use Kode to summit PR)

TypeScript 3,393 506 Updated Oct 9, 2025

reczoo / RecBase

RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation

Python 7 1 Updated Oct 23, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 15,700 1,188 Updated Oct 31, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 47,949 3,958 Updated Oct 31, 2025

salmon1802 / QNN

[KDD 2025] Quadratic Neural Networks for Click-through Rate Prediction

Python 10 1 Updated May 29, 2025

bytedance / Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,689 629 Updated Oct 27, 2025

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,627 185 Updated Apr 20, 2024

luyilong2015 / open-webui-pipeline-for-ragflow

使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话，并拥有美观的界面。

Python 136 24 Updated Oct 31, 2025

open-webui / pipelines

Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework

Python 2,133 642 Updated Aug 18, 2025

infiniflow / ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,953 7,116 Updated Nov 3, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 117,878 18,212 Updated Nov 3, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,287 3,729 Updated Nov 2, 2025

828Tina / qwen3-ft-swift

Python 5 Updated May 13, 2025

tingaicompass / AI-Compass

“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向，无论你是初学者还是进阶开发者，都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势，并通过实践掌握从理论到落地的全过程。

367 39 Updated Nov 1, 2025

twitter / the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Python 10,386 2,234 Updated Jul 10, 2024

NVIDIA / recsys-examples

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 161 36 Updated Oct 31, 2025

logseq / logseq

A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/roadmap

Clojure 39,183 2,350 Updated Nov 1, 2025

OpenSparseLLMs / MoM

Python 104 3 Updated Sep 17, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,018 2,407 Updated Nov 3, 2025

Indolent-Kawhi / EAGER-LLM

A decoder-only llm-based generative recommendation framework that integrates endogenous and exogenous behavioral and semantic information in a non-intrusive manner

Python 11 Updated Mar 14, 2025

XingyuLu206 / ROMA

This is the repository for ”ROMA: Recommendation-Oriented Language Model Adaptation Using Multi-Modal Multi-Domain Item Sequences“ in KDD 2025

Python 1 Updated Feb 12, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,800 76 Updated Oct 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhujiem zhujiem

Achievements

Achievements

Organizations

Block or report zhujiem

Stars

TencentCloudADP / youtu-embedding

Yuliang-Liu / MonkeyOCR

selous123 / al_sid

tile-ai / tilelang

modelscope / ms-swift

Jyonn / RecBench

Alibaba-NLP / DeepResearch

QwenLM / Qwen3-Embedding

shareAI-lab / Kode

reczoo / RecBase

allenai / olmocr

opendatalab / MinerU

salmon1802 / QNN

bytedance / Dolphin

charent / ChatLM-mini-Chinese

luyilong2015 / open-webui-pipeline-for-ragflow

open-webui / pipelines

infiniflow / ragflow

langgenius / dify

jingyaogong / minimind

828Tina / qwen3-ft-swift

tingaicompass / AI-Compass

twitter / the-algorithm-ml

NVIDIA / recsys-examples

logseq / logseq

OpenSparseLLMs / MoM

volcengine / verl

Indolent-Kawhi / EAGER-LLM

XingyuLu206 / ROMA

Xnhyacinth / Awesome-LLM-Long-Context-Modeling