This is the github project for the following coursera specialization: Advanced Data Science with IBM https://www.coursera.org/launch/advanced-applied-data-science-ibm Amir: For Spark ML code, look under coursera_ml/a2_w1_s3_SparkML_LR.ipynb and similar files