Skip to content
View danielhjz's full-sized avatar
🧲
imoud
🧲
imoud

Block or report danielhjz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,690 548 Updated Jul 11, 2024

Qianfan-VL: Domain-Enhanced Universal Vision-Language Models

173 13 Updated Sep 22, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 853 69 Updated Nov 24, 2025

A simple, fast and user-friendly alternative to 'find'

Rust 40,690 958 Updated Dec 2, 2025

Dolt – Git for Data

Go 19,333 592 Updated Dec 4, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,259 1,668 Updated Sep 24, 2025

[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 503 20 Updated Nov 5, 2025

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,495 313 Updated Feb 18, 2025

Demo of a customer service use case implemented with the OpenAI Agents SDK

TypeScript 5,866 905 Updated Aug 25, 2025

A lightweight LMM-based Document Parsing Model

Python 6,327 438 Updated Nov 19, 2025

"DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion" code repository

Python 1,220 304 Updated Jan 2, 2023

PyIceberg

Python 943 400 Updated Dec 3, 2025

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 18,796 2,639 Updated Dec 2, 2025

chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse

C++ 2,535 90 Updated Dec 3, 2025

Curate better data for LLMs

Python 1,063 102 Updated Mar 19, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,756 234 Updated Nov 25, 2025

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

JavaScript 571 58 Updated Dec 3, 2025

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 194 14 Updated Jun 20, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 632 58 Updated Mar 4, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Python 964 136 Updated Jun 21, 2025

[ACL 2025] Official resources of "FinanceReasoning: Benchmarking Financial Numerical Reasoning More Credible, Comprehensive and Challenging".

Python 3 1 Updated May 30, 2025

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

Python 3,586 662 Updated Dec 4, 2025

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,617 1,450 Updated Dec 3, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,502 3,504 Updated Sep 24, 2024

一个轻量化的大模型推理框架

Python 20 Updated May 26, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 10,504 1,707 Updated Dec 4, 2025

国家中小学智慧教育平台 电子课本下载工具,帮助您从智慧教育平台中获取电子课本的 PDF 文件网址并进行下载,让您更方便地获取课本内容。

Python 3,985 466 Updated Nov 27, 2025

Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond

Java 2,036 899 Updated Dec 3, 2025

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python 4,703 974 Updated Dec 2, 2025
Next