
Starred repositories
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Upserts, Deletes And Incremental Processing on Big Data.
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)
OLAP Database Performance Tuning Guide
Open, Multi-modal Catalog for Data & AI
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶
A PyTorch Implementation of Single Shot MultiBox Detector
Open-source simulator for autonomous driving research.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Autoware - the world's leading open-source software project for autonomous driving
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
An open-source NLP research library, built on PyTorch.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…