Stars
The "Python Machine Learning (1st edition)" book code repository and info resource
Base classes to use when writing tests with Spark
Examples for High Performance Spark
More than 2000+ Data engineer interview questions.
A topic-centric list of HQ open datasets.
Apache Spark - A unified analytics engine for large-scale data processing