Experient Data Platform

A modern data stack for ingestion, processing, and data analytics using Minio, Trino, Spark, and Jupyter

Architecture Layers

Run Locally with Docker-compose

Build the entire data platform

docker-compose up -d

Query Engine Shell

docker container exec -it query-engine trino

Creating schemas through the query engine

CREATE SCHEMA minio.data_lake
WITH (location = 's3a://warehouse/');

CREATE TABLE minio.data_lake.companies
WITH (
    format = 'PARQUET',
    external_location = 's3a://warehouse/companies/'
) 
AS SELECT * FROM operational.business.organizations;

Inpsecting Metadata

Log into the postgres container

docker exec -it "postgres" psql -U admin -d "hive_db"

To inspect them metadata catalog

SELECT * from "DBS";

Shutdown

docker-compose down

Data Processing with Spark

Interactive Scala Shell

docker run -it spark /opt/spark/bin/spark-shell

Interactive Python Shell

docker run -it spark:python3 /opt/spark/bin/pyspark

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
conf		conf
etc		etc
hive-metastore		hive-metastore
jupyter		jupyter
.env		.env
README.md		README.md
architecture.png		architecture.png
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Experient Data Platform

Architecture Layers

Run Locally with Docker-compose

Inpsecting Metadata

Data Processing with Spark

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ahodroj/experient-data-platform

Folders and files

Latest commit

History

Repository files navigation

Experient Data Platform

Architecture Layers

Run Locally with Docker-compose

Inpsecting Metadata

Data Processing with Spark

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages