Hadoop Ecosystem Tools and Spark(PySpark) Code Examples Hands on example on code written in pyspark 2.3 The datasets used for the scripts are available here - https://github.com/subhasis85/DataSets