DataScience - ML DEEP LEARNING - LPEI - 120 Days
DataScience - ML DEEP LEARNING - LPEI - 120 Days
Week13-15: Three Capstone Project with Industry-Level Practices and AWS Deployment
● Final Project Kickoff: Industry-Level Capstone Project.
● Capstone Project Presentation and Introduction.
● Project Presentation and Review
Sample Assignments:
Python Assignments
Numpy Assignments
Given a numpy array mat as below, perform the matrics operations using slicing and broadcasting techniques
mat = np.arange(1,26).reshape(5,5)
1. # WRITE CODE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
array([[12, 13, 14, 15],
[17, 18, 19, 20],
[22, 23, 24, 25]])
2. # WRITE CODE HERE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
20
3. # WRITE CODE HERE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
array([[ 2],
[ 7],
[12]])
4. # WRITE CODE HERE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
array([21, 22, 23, 24, 25])
5. # WRITE CODE HERE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
325
6. # WRITE CODE HERE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
7.2111025509279782
7. # WRITE CODE HERE THAT REPRODUCES THE OUTPUT OF THE CELL BELOW
array([55, 60, 65, 70, 75])
Pandas Assignments
1. Write a Pandas program to add, subtract, multiple and divide two Pandas Series.
Sample Series: [2, 4, 6, 8, 10], [1, 3, 5, 7, 9]
exam_data = {'name': ['Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin',
'Jonas'],
'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
Expected Output:
b3 Dima no 9.0
4. Write a Pandas program to select the rows where number of attempts in the examination is less than 2 and
score greater than 15.
Sample DataFrame:
exam_data = {'name': ['Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin',
'Jonas'],
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
5. Write a Pandas program to calculate the mean score for each different student in data frame.
Sample DataFrame:
exam_data = {'name': ['Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin',
'Jonas'],
'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']
MySQL Assignments
1. Write SQL queries to explore the World Population Dataset
2. Use various JOINs and subqueries on the Chinook Database
3. Perform aggregation tasks using the IMDB Movies Dataset
4. Apply window functions on the Airbnb Listings Dataset
5. Optimize complex queries for the COVID-19 Cases Dataset
6. Write CTEs and recursive queries for the Employee Database
7. Conduct a full analysis using SQL on the Sales Data
Supervised Machine Learning Algorithms (Associate Level) Assignments
1. Implement a simple regression model using the Advertising Dataset
2. Apply logistic regression on the Diabetes Dataset.
3. Build a decision tree classifier using the Wine Quality Dataset
4. Use Random Forest to classify the Breast Cancer Dataset
5. Evaluate a classification model using precision, recall, and F1-score on the Heart Disease Dataset.
6. Perform hyperparameter tuning on the Car Evaluation Dataset.
7. Implement k-fold cross-validation on the Housing Prices Dataset.