Stars
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
A Multi-Agent Trading System Based on Internal Contest Mechanism
jina-ai / jina-vdr
Forked from illuin-tech/vidore-benchmarkJina VDR is a multilingual, multi-domain benchmark for visual document retrieval
Industrial-first evaluation benchmark for LLMs in the DevOps/AIOps domain.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…
GraalPy – A high-performance embeddable Python 3 runtime for Java
Ongoing research training transformer models at scale
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
A curated, but incomplete, list of data-centric AI resources.
A powerful tool for creating fine-tuning datasets for LLM
Infisical is the open-source platform for secrets, certificates, and privileged access management.
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
OCR & Document Extraction using vision models
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Model Context Protocol Servers
[ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be …
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.



