0% found this document useful (0 votes)

46 views8 pages

Assignment II Machine Learning

The document describes an assignment for a machine learning course involving support vector machines (SVM). It includes: 1) A description of the SVM algorithm and how it works. 2) Examples of preprocessing a student performance dataset in Python, including cleaning, aggregating and transforming the data. 3) An example Python code to build an SVM classification model on a social network advertising dataset, including data preprocessing, training and evaluating the model.

Uploaded by

Hussein Ibrahim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views8 pages

Assignment II Machine Learning

Uploaded by

Hussein Ibrahim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

SCHOOL OF TECHNOLOGY

BACHELOR OF INFORMATION SECURITY AND FORENSICS &

BACHELOR OF SOFTWARE DEVELOPMENT & BACHELOR IN INFORMATION
FORENSICS AND SECURITY
MACHINE LEARNING
JANUARY-APRIL 2023
ASSIGNMENT II

MEMBERS.
Ibrahim Hussein 19/05592 BISF
Moses Kipngeno 19/05914 BISF
Everlyne Nelius Irungu 19/05463 BISF
Alice Njeri Kuria 19/05790 BISF
Collins Njoroge 19/02573 BISF
ACTIVITY
1. Describe the Support Vector Machine algorithm.

Support Vector Machine (SVM) is a powerful machine learning algorithm used for
classification and regression tasks.
It works by finding the best hyper plane that separates the data points into different
classes in a high-dimensional space.
The SVM algorithm works through:
i. Data preprocessing: the input data is first preprocessed to ensure that it is in a suitable
format for Support Vector Machine. It may include scaling, normalization and other
transformations to ensure that the data is centered and the features are on similar
scales.
ii. Feature mapping: SVM maps the input data into a higher dimensional space using a
kernel function. This helps find a hyper plane that can effectively separate the data
points given.
iii. Hyper plane selection: SVM then searches for the optimal hyper plane that separates
the data points with maximum margin. The margin is (the distance between the hyper
plane and the closest data points from each class). The larger the margin, the more
confident the algorithm is about its classification.
iv. Support vector identification: The data points closest to the hyper plane on each side
are known as support vectors. These support vectors determine the position of the
hyper plane and are used to calculate the margin.
v. Classification: Once the optimal hyper plane is found, SVM uses it to classify new
data points based on which side of the hyper plane they fall on. If the data point falls
on the positive side of the hyper plane, it is classified as one class, and if it falls on
the negative side, it is classified as the other class.

SVM can therefore handle both linear and non-linearly separable data by using different
kernel functions. Kernel functions used in SVM include linear, polynomial, radial basis
function (RBF), and sigmoid.
SVM is a powerful algorithm for classification tasks and can handle high dimensional
datasets with complex decision boundaries as seen above.
SVM disadvantage is that it’s still not suitable for large datasets because of its high
training time.
2. Preprocess a selected dataset
Data preprocessing is the process of preparing the raw data and making it suitable for machine
learning models. Data preprocessing includes data cleaning for making the data ready to be given
to machine learning model
Below is a dataset containing student performances. We apply various data preprocessing
commands to the dataset as shown below.
import pandas as pd
import numpy as np

#read csv
df_excel = pd.read_csv('StudentsPerformance.csv')
df_excel

#first look
df_excel.describe()

#calculate specific columns

df_excel['math score'].sum()
df_excel['math score'].mean()
df_excel['math score'].max()
df_excel['math score'].min()
df_excel['math score'].count()

#calculate specific rows

df_excel['average'] = (df_excel['math score'] + df_excel['reading score']

+ df_excel['writing score'])/3
df_excel.mean(axis=1)
df_excel.head()

# count
df_excel['gender'].value_counts()

# if condition
df_excel['pass/fail'] = np.where(df_excel['average'] > 70, 'Pass', 'Fail')
df_excel.head()

# multiple conditions
conditions = [
(df_excel['average']>=90),
(df_excel['average']>=80) & (df_excel['average']<90),
(df_excel['average']>=70) & (df_excel['average']<80),
(df_excel['average']>=60) & (df_excel['average']<70),
(df_excel['average']>=50) & (df_excel['average']<60),
(df_excel['average']<50),
]

values = ['A', 'B', 'C', 'D', 'E', 'F']

df_excel['grades'] = np.select(conditions, values)
df_excel.head()

# show first 5 rows

df_excel[['average', 'pass/fail', 'grades']].head()
3. Using an example in Python and a sample dataset build an SVM model.

# Support Vector Machine

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the datasets

datasets = pd.read_csv('Social_Network_Ads.csv')
X = datasets.iloc[:, [2,3]].values
Y = datasets.iloc[:, 4].values

# Splitting the dataset into the Training set and Test set

from sklearn.model_selection import train_test_split

X_Train, X_Test, Y_Train, Y_Test = train_test_split(X, Y, test_size = 0.25,
random_state = 0)

# Feature Scaling

from sklearn.preprocessing import StandardScaler

sc_X = StandardScaler()
X_Train = sc_X.fit_transform(X_Train)
X_Test = sc_X.transform(X_Test)

# Fitting the classifier into the Training set

from sklearn.svm import SVC

classifier = SVC(kernel = 'linear', random_state = 0)
classifier.fit(X_Train, Y_Train)

# Predicting the test set results

Y_Pred = classifier.predict(X_Test)

# Making the Confusion Matrix

from sklearn.metrics import confusion_matrix

cm = confusion_matrix(Y_Test, Y_Pred)

# Visualising the Training set results

from matplotlib.colors import ListedColormap

X_Set, Y_Set = X_Train, Y_Train
X1, X2 = np.meshgrid(np.arange(start = X_Set[:, 0].min() - 1, stop = X_Set[:,
0].max() + 1, step = 0.01),
np.arange(start = X_Set[:, 1].min() - 1, stop = X_Set[:,
1].max() + 1, step = 0.01))
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(),
X2.ravel()]).T).reshape(X1.shape),
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(Y_Set)):
plt.scatter(X_Set[Y_Set == j, 0], X_Set[Y_Set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Support Vector Machine (Training set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

# Visualising the Test set results

from matplotlib.colors import ListedColormap

X_Set, Y_Set = X_Test, Y_Test
X1, X2 = np.meshgrid(np.arange(start = X_Set[:, 0].min() - 1, stop = X_Set[:,
0].max() + 1, step = 0.01),
np.arange(start = X_Set[:, 1].min() - 1, stop = X_Set[:,
1].max() + 1, step = 0.01))
plt.contourf(X1, X2, classifier.predict(np.array([X1.ravel(),
X2.ravel()]).T).reshape(X1.shape),
alpha = 0.75, cmap = ListedColormap(('red', 'green')))
plt.xlim(X1.min(), X1.max())
plt.ylim(X2.min(), X2.max())
for i, j in enumerate(np.unique(Y_Set)):
plt.scatter(X_Set[Y_Set == j, 0], X_Set[Y_Set == j, 1],
c = ListedColormap(('red', 'green'))(i), label = j)
plt.title('Support Vector Machine (Test set)')
plt.xlabel('Age')
plt.ylabel('Estimated Salary')
plt.legend()
plt.show()

Kakuro Cheat Sheet
100% (1)
Kakuro Cheat Sheet
1 page
SVM Guide for Data Scientists
No ratings yet
SVM Guide for Data Scientists
24 pages
SVM Algorithm Guide with Python Code
No ratings yet
SVM Algorithm Guide with Python Code
10 pages
SVM Guide for Data Scientists
No ratings yet
SVM Guide for Data Scientists
48 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
SVM Basics for Computer Science Students
No ratings yet
SVM Basics for Computer Science Students
36 pages
Python Tuple: Exercise-1 With Solution: Write A Python Program To Create A Tuple
No ratings yet
Python Tuple: Exercise-1 With Solution: Write A Python Program To Create A Tuple
23 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
12 pages
SVM Guide: Concepts, Implementation, Tuning
No ratings yet
SVM Guide: Concepts, Implementation, Tuning
13 pages
Exp 5
No ratings yet
Exp 5
14 pages
4-To-1 MUX
No ratings yet
4-To-1 MUX
2 pages
SVM Classifier Techniques Guide
No ratings yet
SVM Classifier Techniques Guide
15 pages
This Is
No ratings yet
This Is
7 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
No ratings yet
Title: Implement Support Vector Machine Classifier: Department of Computer Science and Engineering
5 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
Prediction On Iris
No ratings yet
Prediction On Iris
14 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Machine Learning Algorithms in Bipedal Robot Control
No ratings yet
Machine Learning Algorithms in Bipedal Robot Control
16 pages
Deep Learning
No ratings yet
Deep Learning
25 pages
A Tableau-Based Theorem Proving Method For Intuitionistic Logic
No ratings yet
A Tableau-Based Theorem Proving Method For Intuitionistic Logic
8 pages
The Partial Differential Equation For The Blasius Equation
No ratings yet
The Partial Differential Equation For The Blasius Equation
11 pages
Karachi LTE1800 Model Tuning - Cluster Comparison
No ratings yet
Karachi LTE1800 Model Tuning - Cluster Comparison
18 pages
Aim of The Experiment-Software Required - Theory
No ratings yet
Aim of The Experiment-Software Required - Theory
6 pages
Pushdown Automata Pdas: Fall 2006 Costas Busch - RPI 1
No ratings yet
Pushdown Automata Pdas: Fall 2006 Costas Busch - RPI 1
79 pages
06 Support - Vector - Machine
No ratings yet
06 Support - Vector - Machine
8 pages
Chapter 3 - Reduction of Multiple Subsystems PDF
No ratings yet
Chapter 3 - Reduction of Multiple Subsystems PDF
28 pages
Advanced Control & Robotics Guide
No ratings yet
Advanced Control & Robotics Guide
23 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
Confidence Intervals For The Difference Between Two Means With Tolerance Probability
No ratings yet
Confidence Intervals For The Difference Between Two Means With Tolerance Probability
10 pages
Simulating First Order Dynamical Systems Using Analog Computer
No ratings yet
Simulating First Order Dynamical Systems Using Analog Computer
12 pages
j2020 A Survey of The Usages of Deep Learning For Natural Language Processing
No ratings yet
j2020 A Survey of The Usages of Deep Learning For Natural Language Processing
21 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
Quantum Computing
No ratings yet
Quantum Computing
20 pages
SVM Unit 2
No ratings yet
SVM Unit 2
12 pages
AP For NLP-LO2
No ratings yet
AP For NLP-LO2
38 pages
ML Practical 3
No ratings yet
ML Practical 3
5 pages
Lab Program (SVM From Scratch)
No ratings yet
Lab Program (SVM From Scratch)
2 pages
Support Vector Machine
100% (1)
Support Vector Machine
40 pages
(1-4) Vector Calculus-Differential Part
No ratings yet
(1-4) Vector Calculus-Differential Part
90 pages
ML Assignment-8
No ratings yet
ML Assignment-8
3 pages
OMScheduling PPT
No ratings yet
OMScheduling PPT
38 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Classification Review
No ratings yet
Classification Review
8 pages
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
No ratings yet
Course Title: Fundamentals of Machine Learning Course Code: Group Assignment On
9 pages
Unit 3 Aam
No ratings yet
Unit 3 Aam
30 pages
Classification
No ratings yet
Classification
4 pages
Recommendation of Crop, Fertilizers and Crop Disease Detection System
No ratings yet
Recommendation of Crop, Fertilizers and Crop Disease Detection System
6 pages
TECHNICAL ASSESSMENT-batch4
No ratings yet
TECHNICAL ASSESSMENT-batch4
3 pages
Unit2 Notes What Is A Support Vector Machine
No ratings yet
Unit2 Notes What Is A Support Vector Machine
11 pages
01 Interpolation 1
No ratings yet
01 Interpolation 1
14 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Svmdoc
No ratings yet
Svmdoc
7 pages
SVM Lab.7
No ratings yet
SVM Lab.7
4 pages
Unit 1,2,3
No ratings yet
Unit 1,2,3
17 pages
Integration by Substitution Guide
No ratings yet
Integration by Substitution Guide
35 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
MLT 07
No ratings yet
MLT 07
8 pages
Detailed SVM Presentation
No ratings yet
Detailed SVM Presentation
15 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
79 pages
ML5&6&7&8&9&10
No ratings yet
ML5&6&7&8&9&10
35 pages
Probability & Statistics Course Overview
No ratings yet
Probability & Statistics Course Overview
48 pages
Optimal Monetary Policy - Lecture Notes
No ratings yet
Optimal Monetary Policy - Lecture Notes
6 pages
ML Lecture 14 SVM
No ratings yet
ML Lecture 14 SVM
15 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Purva Rawale Prcatical 4 BDA
No ratings yet
Purva Rawale Prcatical 4 BDA
6 pages
SVM Using Iris Dataset by Hyparlink
No ratings yet
SVM Using Iris Dataset by Hyparlink
19 pages
SVM
No ratings yet
SVM
11 pages
UNIT-II-Support Vector Machine Algorithm
No ratings yet
UNIT-II-Support Vector Machine Algorithm
13 pages
SVM7
No ratings yet
SVM7
53 pages
List of UL Mobility Courses Faculty of Mathematics and Computer Science 2024 25
No ratings yet
List of UL Mobility Courses Faculty of Mathematics and Computer Science 2024 25
2 pages
Algorithm Up To 7 Lectures
No ratings yet
Algorithm Up To 7 Lectures
13 pages
Support Vector Machine For Classification: Name: Saurav Doke Roll No: A-41 PRN: 2264191242040
No ratings yet
Support Vector Machine For Classification: Name: Saurav Doke Roll No: A-41 PRN: 2264191242040
3 pages
SVM Everything
No ratings yet
SVM Everything
5 pages
Unit 4 Iml Introduction To Machine Learning
No ratings yet
Unit 4 Iml Introduction To Machine Learning
25 pages
Niranjan Kumar Singh Encryption Algorithm
No ratings yet
Niranjan Kumar Singh Encryption Algorithm
3 pages
Support Vectors
No ratings yet
Support Vectors
7 pages
Classifying Data Using Support Vector Machines (SVMS) in Python
No ratings yet
Classifying Data Using Support Vector Machines (SVMS) in Python
5 pages
Backstepping Based Nonlinear Sensorless Control of Induction Motor System
No ratings yet
Backstepping Based Nonlinear Sensorless Control of Induction Motor System
8 pages
Emotion Classification Using ML and DL
No ratings yet
Emotion Classification Using ML and DL
8 pages
Da Pra Week 12 (SVM)
No ratings yet
Da Pra Week 12 (SVM)
15 pages
ML Exp 3 Part A
No ratings yet
ML Exp 3 Part A
7 pages
Regime Switching
No ratings yet
Regime Switching
7 pages
MODULE - 4 - PART 2 - Support Vector Machines
No ratings yet
MODULE - 4 - PART 2 - Support Vector Machines
6 pages
Third Year Engineering: Unit II: Supervised Machine Learning
No ratings yet
Third Year Engineering: Unit II: Supervised Machine Learning
11 pages
SVMM Practical
No ratings yet
SVMM Practical
2 pages

Assignment II Machine Learning

Uploaded by

Assignment II Machine Learning

Uploaded by

SCHOOL OF TECHNOLOGY

BACHELOR OF INFORMATION SECURITY AND FORENSICS &

#calculate specific columns

#calculate specific rows

df_excel['average'] = (df_excel['math score'] + df_excel['reading score']

values = ['A', 'B', 'C', 'D', 'E', 'F']

# show first 5 rows

# Support Vector Machine

# Importing the datasets

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

# Fitting the classifier into the Training set

from sklearn.svm import SVC

# Predicting the test set results

# Making the Confusion Matrix

from sklearn.metrics import confusion_matrix

# Visualising the Training set results

from matplotlib.colors import ListedColormap

# Visualising the Test set results

from matplotlib.colors import ListedColormap

You might also like