-
-
minio_dev Public
Forked from minio/minioMinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Go GNU Affero General Public License v3.0 UpdatedFeb 18, 2025 -
streamlit Public
Forked from streamlit/streamlitStreamlit — A faster way to build and share data apps.
Python Apache License 2.0 UpdatedMar 5, 2024 -
cassandra Public
Forked from apache/cassandraMirror of Apache Cassandra
Java Apache License 2.0 UpdatedDec 11, 2023 -
data-engineer-handbook Public
Forked from DataExpert-io/data-engineer-handbookThis is a repo with links to everything you'd ever want to learn about data engineering
UpdatedNov 20, 2023 -
hudi Public
Forked from apache/hudiUpserts, Deletes And Incremental Processing on Big Data.
Java Apache License 2.0 UpdatedNov 5, 2023 -
amazon-sagemaker-examples Public
Forked from aws/amazon-sagemaker-examplesExample 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Jupyter Notebook Apache License 2.0 UpdatedMay 9, 2022 -
aws-mwaa-local-runner Public
Forked from aws/aws-mwaa-local-runnerThis repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) environment locally.
Shell MIT No Attribution UpdatedMar 29, 2022 -
spark-moderndataengineering Public
Forked from newfront/spark-moderndataengineeringThe source code for the book Modern Data Engineering with Apache Spark
Scala Apache License 2.0 UpdatedMar 13, 2022 -
docker-images Public
Forked from oracle/docker-imagesOfficial source for Docker configurations, images, and examples of Dockerfiles for Oracle products and projects
-
spark-excel Public
Forked from nightscape/spark-excelA Spark plugin for reading Excel files via Apache POI
Scala Apache License 2.0 UpdatedFeb 8, 2022 -
awesome-apache-airflow Public
Forked from jghoman/awesome-apache-airflowCurated list of resources about Apache Airflow
Shell UpdatedJan 28, 2022 -
spark-bigquery-connector Public
Forked from GoogleCloudDataproc/spark-bigquery-connectorBigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Java Apache License 2.0 UpdatedJan 26, 2022 -
data-dockerfiles Public
Forked from irbigdata/data-dockerfilesa curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.
Jupyter Notebook UpdatedJan 11, 2022 -
This repository contains the basic definition for the AWS Glue DataCatalog Database
-
terraform-aws-dynamodb Public
This repository contains the basic definition for the AWS DynamoDB table.
HCL UpdatedDec 27, 2021 -
This repository contains the basic definition for the AWS CodeBuild
HCL UpdatedDec 27, 2021 -
terraform-aws-sqs-queue Public
This repository contains the basic definition for the AWS SQS Queue deployment
HCL UpdatedDec 27, 2021 -
terraform-aws-s3-bucket Public
This repository contains the basic definition for the AWS S3 bucket
HCL UpdatedDec 27, 2021 -
terraform-aws-glue-job Public
This repository contains the basic definition for the AWS Glue job deployment
-
aws-glue-developer-guide Public
Forked from awsdocs/aws-glue-developer-guideThe open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
Other UpdatedDec 5, 2021 -
aws-devops-essential Public
Forked from awslabs/aws-devops-essentialIn few hours, quickly learn how to effectively leverage various AWS services to improve developer productivity and reduce the overall time to market for new product capabilities.
Shell Apache License 2.0 UpdatedSep 22, 2021 -
aws-transfer-sftp-ip-whitelisting-workshop Public
Forked from aws-samples/aws-transfer-sftp-ip-whitelisting-workshopMIT No Attribution UpdatedAug 17, 2021 -
easy-rsa Public
Forked from OpenVPN/easy-rsaeasy-rsa - Simple shell based CA utility
Shell Other UpdatedJul 3, 2021 -
packer Public
Forked from hashicorp/packerPacker is a tool for creating identical machine images for multiple platforms from a single source configuration.
Go Mozilla Public License 2.0 UpdatedJul 2, 2021 -
tfc-getting-started Public
Forked from hashicorp/tfc-getting-startedAn example Terraform configuration for Terraform Cloud
Shell Mozilla Public License 2.0 UpdatedJun 27, 2021 -
aws-doc-sdk-examples Public
Forked from awsdocs/aws-doc-sdk-examplesWelcome to the AWS Code Examples Repository. This repo contains code examples used in the AWS documentation, AWS SDK Developer Guides, and more. For more information, see the Readme.rst file below.
Java Apache License 2.0 UpdatedJun 24, 2021 -
terraform-provider-aws Public
Forked from hashicorp/terraform-provider-awsTerraform AWS provider
Go Mozilla Public License 2.0 UpdatedJun 18, 2021 -
aws-glue-libs Public
Forked from awslabs/aws-glue-libsAWS Glue Libraries are additions and enhancements to Spark for ETL operations.
Python Other UpdatedJun 9, 2021 -
provision-codepipeline-glue-workflows Public
Forked from aws-samples/provision-codepipeline-glue-workflowsGit repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows
Python MIT No Attribution UpdatedJun 5, 2021