Stars
ChatMCP is an AI chat client implementing the Model Context Protocol (MCP).
DuckLake is an integrated data lake and catalog format
Apache Doris is an easy-to-use, high performance and unified analytics database.
The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-native platforms for big data.
CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on und…
A full-featured license tool to check and fix license headers and resolve dependencies' licenses.
oap-project / velox
Forked from facebookincubator/veloxA new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
A composable and fully extensible C++ execution engine library for data management systems.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
Spring Boot starter module for gRPC framework.
LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impl…
An antdv-based middle and background management system
A modern, lambda-friendly, 120 character Java formatter.
Bigtop Manager is a modern, AI-driven web application designed to simplify the complexity of bigdata cluster management.
The Metadata Platform for your Data and AI Stack
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
docker systemctl replacement - allows to deploy to systemd-controlled containers without starting an actual systemd daemon (e.g. centos7, ubuntu16)
Upserts, Deletes And Incremental Processing on Big Data.
Apache Flink Kubernetes Operator
Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond
Apache Ambari Logsearch is a sub project of Apache Ambari.
Apache Ambari Infra is a sub project of Apache Ambari.
Apache Ambari Metrics is a sub project of Apache Ambari.
Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and configuration of the leading open source big data components.