FAQ's - Supervised Learning

Uploaded by

shreyasgawade12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

FAQ's - Supervised Learning

Uploaded by

shreyasgawade12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

AIML Online

Frequently Asked Questions in Problem Statement

Course: Supervised Learning
PART - A [30 Marks]
* Direct or Self-explanatory questions are not covered in this FAQ.

1. Data Understanding:
1 C. Compare Column names of all the 3 DataFrames and clearly write observations. [1 Mark]
→ Compare the column names of all the three dataframes. As we are going to merge datasets by rows,
checking the column names, order and type is mandatory. Use a simple compare operator to check
whether all 3 dataframes have the same column names. And write your observations from the result.

1 D. Print DataTypes of all the 3 DataFrames. [1 Mark]

→ Print the datatypes of all the 3 dataframes and write your observations.

1 E. Observe and share variation in ‘Class’ feature of all the 3 DaraFrames. [1 Mark]
→ Check the ‘Class’ variable’s distribution and categories.

2. Data Preparation and Exploration:

2 A. Unify all the variations in ‘Class’ feature for all the 3 DataFrames. [1 Marks]
→ Unify the variations reported in the previous step 1.E.
Example - If the ‘Class’ variable of ‘normal’ dataframe has ‘Normal’, ‘normal’ or ‘Nrml’ replace them
with ‘normal’. Similarly, check and unify the ‘class’ for type_s and type_h dataframes.

2 B. Combine all the 3 DataFrames to form a single DataFrame [1 Marks]

→ Combine the 3 datasets into 1. Look at the checkpoint that the final dataframe should have 310 rows
and 7 columns.

1
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
3. Data Analysis:
3 C. Visualize a pairplot with 3 classes distinguished by colors and share insights. [2 Marks]
→ Create a pairplot for the given variables and the color of the data points in the pairplot should be
distinguished by ‘Class’ categories.

4. Model Building:
4 D. Print all the possible performance metrics for both train and test data. [2 Marks]
→ Print the performance metric of classification models that include accuracy, precision, recall, F1 score
etc.

5. Performance Improvement:
5 A. Experiment with various parameters to improve performance of the base model. [2 Marks]
→ So far you would have run the default model, now you can tune the model by changing the
parameters in KNeighborsClassifier() or svm function. Firstly, self-explore what are the parameters
available in the models and check how you can fine-tune it by changing the options. You have to just
research a bit and do it. (Detailed parameter tuning will be covered in feature engineering course)
Reference link for Hyperparameter tuning for a KNN problem -
https://medium.datadriveninvestor.com/k-nearest-neighbors-in-python-hyperparameters-tuning-71673
4bc557f
You can explore and tune the hyperparameters for other models too. You can learn about Gridsearch,
Random search cross validation techniques and use them.

PART - B [30 Marks]

1. Data Understanding and Preparation:
1 D. Change Datatype of below features to ‘Object’ [1 Marks]
‘CreditCard’, ‘InternetBanking’, ‘FixedDepositAccount’, ‘Security’, ‘Level’, ‘HiddenScore’.
[Reason behind performing this operation: - Values in these features are binary i.e. 1/0. But DataType is
‘int’/’float’ which is not expected.]

2
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
→ The variables are of object type with Binary or multi-class outputs like 0,1 or 1,2,3 etc. Hence,
convert them to ‘Object’ type

2. Data Exploration and Analysis:

2 A. Visualize distribution of Target variable ‘LoanOnCard’ and clearly share insights. [2 Marks]
→ Plot a suitable plot to display distribution of Target variable.

2 C. Check for unexpected values in each categorical variable and impute them with the best suitable
value. [2 Marks]
→ Unexpected values mean if all values in a feature are 0/1 then ‘?’, ‘a’, 1.5 are unexpected values
which needs treatment

3. Data Preparation and model building:

3 D. Print evaluation metrics for the model and clearly share insights. [1 Marks]
→ Print the performance metric of classification models that include accuracy, precision, recall, F1 score
etc.

3 E. Balance the data using the right balancing technique. [2 Marks]

→ Target balancing can be done by upsampling the minority class or downsampling the majority class
or by using SMOTE as per target distribution. You can research a bit and do this task.

4. Performance Improvement:
4 A. Train a base model each for SVM, KNN. [4 Marks]
→You have to build a base model without tuning any parameters on the balanced data.

4 B. Tune parameters for each of the models wherever required and finalize a model. [3 Marks]
(Optional: Experiment with various Hyperparameters - Research required)
→ Tune the parameters as performed in Part A, Question 5 A.

3
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited
You can tune the model by changing the parameters in KNeighborsClassifier() or svm function. Firstly,
self-explore what are the parameters available in the models and check how you can fine-tune it by
changing the options. You have to just research a bit and do it. (Detailed parameter tuning will be
covered in feature engineering course)
Reference link for Hyperparameter tuning for a KNN problem -
https://medium.datadriveninvestor.com/k-nearest-neighbors-in-python-hyperparameters-tuning-71673
4bc557f
You can explore and tune the hyperparameters for other models too.

4 C. Print evaluation metrics for final model. [1 Marks]

→ Print the performance metric of the final model that includes accuracy, precision, recall, F1 score etc.

4 D. Share improvement achieved from base model to final model. [2 Marks]

→ Show the performance improvement of that model (comparing its base model & final model
performance report).

MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
30 Days ML Projects Challenge
No ratings yet
30 Days ML Projects Challenge
288 pages
Cheat Sheet: Python For Data Science
100% (1)
Cheat Sheet: Python For Data Science
1 page
Al3451 - Question Bank
100% (1)
Al3451 - Question Bank
12 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
CP4252 Machine Learning Lab Manual
100% (1)
CP4252 Machine Learning Lab Manual
33 pages
ML Questions Answers
No ratings yet
ML Questions Answers
4 pages
Sjg18-046 (03) - Guangri New Control
No ratings yet
Sjg18-046 (03) - Guangri New Control
53 pages
Data Mining Lab Manual CSE VII Sem
No ratings yet
Data Mining Lab Manual CSE VII Sem
63 pages
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
100% (1)
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
1 page
MLLab Manual
No ratings yet
MLLab Manual
24 pages
100 Days of Machine Learning
No ratings yet
100 Days of Machine Learning
14 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
Scikit-Learn Cheat Sheet
No ratings yet
Scikit-Learn Cheat Sheet
1 page
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
Hyperparameter Tuning Mits
No ratings yet
Hyperparameter Tuning Mits
17 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
Study Material For Machine Learning - 1 - 1754721598318
No ratings yet
Study Material For Machine Learning - 1 - 1754721598318
18 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
22K61A0654 2 Sasi Auto
No ratings yet
22K61A0654 2 Sasi Auto
24 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
Python Data Structures Q&A Bank
No ratings yet
Python Data Structures Q&A Bank
8 pages
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
Titanic Akshaya
No ratings yet
Titanic Akshaya
12 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
ML Lab
No ratings yet
ML Lab
29 pages
Assignment 2 Mufan
No ratings yet
Assignment 2 Mufan
9 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
DSASSign 4
No ratings yet
DSASSign 4
11 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Final 2 MLP
No ratings yet
Final 2 MLP
10 pages
Building Good Training Sets UNIT 1 PART2
No ratings yet
Building Good Training Sets UNIT 1 PART2
46 pages
Train
No ratings yet
Train
17 pages
Lab 08 - Data Preprocessing
No ratings yet
Lab 08 - Data Preprocessing
9 pages
83 Sklearn Pipeline
No ratings yet
83 Sklearn Pipeline
8 pages
Lab 04
No ratings yet
Lab 04
2 pages
Data Preprocessing Example Programs1
No ratings yet
Data Preprocessing Example Programs1
9 pages
FAQ's - FMT Project
No ratings yet
FAQ's - FMT Project
3 pages
Test 1
No ratings yet
Test 1
3 pages
SL - Problem Statement
No ratings yet
SL - Problem Statement
3 pages
Lecture Material 10
No ratings yet
Lecture Material 10
9 pages
DWM - END SEM LAB Questions
No ratings yet
DWM - END SEM LAB Questions
9 pages
Ai Chapter 3
No ratings yet
Ai Chapter 3
8 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Supervised Learning - Milestones
No ratings yet
Supervised Learning - Milestones
2 pages
FIT1043 A2 Specification - S2 2024 - Gks6arg
No ratings yet
FIT1043 A2 Specification - S2 2024 - Gks6arg
5 pages
Machine Learning Solutions
No ratings yet
Machine Learning Solutions
6 pages
Introduction to Linear Programming
No ratings yet
Introduction to Linear Programming
17 pages
MBAN Assignment
No ratings yet
MBAN Assignment
2 pages
Scikit-Learn Python Cheat Sheet
No ratings yet
Scikit-Learn Python Cheat Sheet
1 page
Data Science Exam: Classification Task
No ratings yet
Data Science Exam: Classification Task
3 pages
BCOS184
No ratings yet
BCOS184
333 pages
01 134192 066 9559671601 28052022 103753pm
No ratings yet
01 134192 066 9559671601 28052022 103753pm
1 page
Artificial Intelligence Lab 7
No ratings yet
Artificial Intelligence Lab 7
10 pages
KNN Classifier with Car Data
No ratings yet
KNN Classifier with Car Data
2 pages
HW 02
No ratings yet
HW 02
3 pages
Machine Learning 20CSE09
No ratings yet
Machine Learning 20CSE09
3 pages
ML Practice - Set
No ratings yet
ML Practice - Set
2 pages
Cryptography & Network Security Course
No ratings yet
Cryptography & Network Security Course
84 pages
Bus Ticket Reservation
No ratings yet
Bus Ticket Reservation
40 pages
ModalPaperUpload AIML201
No ratings yet
ModalPaperUpload AIML201
7 pages
J1939 Explained - A Simple Intro (2023) - CSS Electronics
No ratings yet
J1939 Explained - A Simple Intro (2023) - CSS Electronics
8 pages
Control Engineering Completion
No ratings yet
Control Engineering Completion
20 pages
Revised Syllabus TY Information Technology W.e.f.ay 2020 21
No ratings yet
Revised Syllabus TY Information Technology W.e.f.ay 2020 21
28 pages
Week 11 12 - Basic Web Page Creation Using Static Website and Online Platform PDF
No ratings yet
Week 11 12 - Basic Web Page Creation Using Static Website and Online Platform PDF
37 pages
GASTAT-700 Interface Protcol V1.06 - 180115
No ratings yet
GASTAT-700 Interface Protcol V1.06 - 180115
21 pages
Excel Skills Lab Guide for MBA Students
No ratings yet
Excel Skills Lab Guide for MBA Students
49 pages
E-Wallet Adoption and Impact Study
No ratings yet
E-Wallet Adoption and Impact Study
30 pages
Tourism MS
No ratings yet
Tourism MS
22 pages
Dataset Penjualan Produk
No ratings yet
Dataset Penjualan Produk
4 pages
Distributed Task Management
No ratings yet
Distributed Task Management
6 pages
Gti 2500 Manual
100% (1)
Gti 2500 Manual
105 pages
ICT Safety and Security Guide
No ratings yet
ICT Safety and Security Guide
7 pages
3D Modelling and Analysis of Encased Steel-Concrete Composite Column
No ratings yet
3D Modelling and Analysis of Encased Steel-Concrete Composite Column
10 pages
INTERMEDIATE PROGRAMMING. Midterm Exam.
No ratings yet
INTERMEDIATE PROGRAMMING. Midterm Exam.
14 pages
Top Strategic Technology Trends For 2022 Cybersecurity Mesh
No ratings yet
Top Strategic Technology Trends For 2022 Cybersecurity Mesh
14 pages
Attendance
No ratings yet
Attendance
2 pages
NCEEICT Conference Paper Format
No ratings yet
NCEEICT Conference Paper Format
5 pages
How To Access XRK Files Data Without Aim Software - 100
No ratings yet
How To Access XRK Files Data Without Aim Software - 100
5 pages
2 Smartforms
No ratings yet
2 Smartforms
7 pages
Massachusetts Institute of Technology
No ratings yet
Massachusetts Institute of Technology
3 pages
Td+Correction Enpu PDF Redresseur Équipement
No ratings yet
Td+Correction Enpu PDF Redresseur Équipement
1 page
Akash Wakade Resume
No ratings yet
Akash Wakade Resume
1 page
Business Stats Analysis Report
No ratings yet
Business Stats Analysis Report
3 pages
RT070 DS R2011 V1.0.3
No ratings yet
RT070 DS R2011 V1.0.3
2 pages