0% found this document useful (0 votes)

41 views

SVM notes unit 4.docx

Uploaded by

viswathivakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views

SVM notes unit 4.docx

Uploaded by

viswathivakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Support Vector Machine (SVM

Support Vector Machine (SVM) is a powerful machine learning algorithm used

for linear or nonlinear classification, regression, and even outlier detection tasks.
SVMs can be used for a variety of tasks, such as text classification, image
classification, spam detection, handwriting identification, gene expression
analysis, face detection, and anomaly detection. SVMs are adaptable and efficient
in a variety of applications because they can manage high-dimensional data and
nonlinear relationships.
SVM algorithms are very effective as we try to find the maximum separating
hyperplane between the different classes available in the target feature.

Support Vector Machine

Support Vector Machine (SVM) is a supervised machine learning algorithm used
for both classification and regression. Though we say regression problems as well
it’s best suited for classification. The main objective of the SVM algorithm is to
find the optimal hyperplane in an N-dimensional space that can separate the data
points in different classes in the feature space. The hyperplane tries that the
margin between the closest points of different classes should be as maximum as
possible. The dimension of the hyperplane depends upon the number of features.
If the number of input features is two, then the hyperplane is just a line. If the
number of input features is three, then the hyperplane becomes a 2-D plane. It
becomes difficult to imagine when the number of features exceeds three.
Let’s consider two independent variables x1, x2, and one dependent variable which
is either a blue circle or a red circle.

Linearly Separable Data points

From the figure above it’s very clear that there are multiple lines (our hyperplane
here is a line because we are considering only two input features x1, x2) that
segregate our data points or do a classification between red and blue circles. So
how do we choose the best line or in general the best hyperplane that segregates
our data points?

How does SVM work?

One reasonable choice as the best hyperplane is the one that represents the
largest separation or margin between the two classes.

Multiple hyperplanes separate the data from two classes

So we choose the hyperplane whose distance from it to the nearest data point on
each side is maximized. If such a hyperplane exists it is known as
the maximum-margin hyperplane/hard margin. So from the above figure, we
choose L2. Let’s consider a scenario like shown below
Selecting hyperplane for data with outlier

Here we have one blue ball in the boundary of the red ball. So how does SVM
classify the data? It’s simple! The blue ball in the boundary of red ones is an
outlier of blue balls. The SVM algorithm has the characteristics to ignore the
outlier and finds the best hyperplane that maximizes the margin. SVM is robust to
outliers.

Hyperplane which is the most optimized one

So in this type of data point what SVM does is, finds the maximum margin as done
with previous data sets along with that it adds a penalty each time a point crosses
the margin. So the margins in these types of cases are called soft margins. When
there is a soft margin to the data set, the SVM tries to
minimize (1/margin+∧(∑penalty)). Hinge loss is a commonly used penalty. If no
violations no hinge loss. If violations hinge loss proportional to the distance of
violation.
Till now, we were talking about linearly separable data(the group of blue balls
and red balls are separable by a straight line/linear line). What to do if data are
not linearly separable?

Original 1D dataset for classification

Say, our data is shown in the figure above. SVM solves this by creating a new
variable using a kernel. We call a point xi on the line and we create a new variable
yi as a function of distance from origin o.so if we plot this we get something like as
shown below

Mapping 1D data to 2D to become able to separate the two classes

In this case, the new variable y is created as a function of distance from the origin.
A non-linear function that creates a new variable is referred to as a kernel.

Support Vector Machine Terminology

1. Hyperplane: Hyperplane is the decision boundary that is used to separate the

data points of different classes in a feature space. In the case of linear
classifications, it will be a linear equation i.e. wx+b = 0.
2. Support Vectors: Support vectors are the closest data points to the
hyperplane, which makes a critical role in deciding the hyperplane and
margin.
3. Margin: Margin is the distance between the support vector and hyperplane.
The main objective of the support vector machine algorithm is to maximize
the margin. The wider margin indicates better classification performance.
4. Kernel: Kernel is the mathematical function, which is used in SVM to map the
original input data points into high-dimensional feature spaces, so, that the
hyperplane can be easily found out even if the data points are not linearly
separable in the original input space. Some of the common kernel functions
are linear, polynomial, radial basis function(RBF), and sigmoid.
5. Hard Margin: The maximum-margin hyperplane or the hard margin
hyperplane is a hyperplane that properly separates the data points of different
categories without any misclassifications.
6. Soft Margin: When the data is not perfectly separable or contains outliers,
SVM permits a soft margin technique. Each data point has a slack variable
introduced by the soft-margin SVM formulation, which softens the strict
margin requirement and permits certain misclassifications or violations. It
discovers a compromise between increasing the margin and reducing
violations.
7. C: Margin maximisation and misclassification fines are balanced by the
regularisation parameter C in SVM. The penalty for going over the margin or
misclassifying data items is decided by it. A stricter penalty is imposed with a
greater value of C, which results in a smaller margin and perhaps fewer
misclassifications.
8. Hinge Loss: A typical loss function in SVMs is hinge loss. It punishes incorrect
classifications or margin violations. The objective function in SVM is
frequently formed by combining it with the regularisation term.
9. Dual Problem: A dual Problem of the optimisation problem that requires
locating the Lagrange multipliers related to the support vectors can be used to
solve SVM. The dual formulation enables the use of kernel tricks and more
effective computing.

Mathematical intuition of Support Vector Machine

Consider a binary classification problem with two classes, labeled as +1 and -1.
We have a training dataset consisting of input feature vectors X and their
corresponding class labels Y.
The equation for the linear hyperplane can be written as:

The vector W represents the normal vector to the hyperplane. i.e the direction
perpendicular to the hyperplane. The parameter b in the equation represents the
offset or distance of the hyperplane from the origin along the normal vector w.
The distance between a data point x_i and the decision boundary can be
calculated as:

where ||w|| represents the Euclidean norm of the weight vector w. Euclidean
norm of the normal vector W
For Linear SVM classifier :

Optimization:
● For Hard margin linear SVM classifier:

The target variable or label for the ith training instance is denoted by the symbol
ti in this statement. And ti=-1 for negative occurrences (when yi= 0) and
ti=1positive instances (when yi = 1) respectively. Because we require the decision
boundary that satisfy the constraint
● For Soft margin linear SVM classifier:

● Dual Problem: A dual Problem of the optimisation problem that requires

locating the Lagrange multipliers related to the support vectors can be used to
solve SVM. The optimal Lagrange multipliers α(i) that maximize the following
dual objective function

where,
● αi is the Lagrange multiplier associated with the ith training sample.
● K(xi, xj) is the kernel function that computes the similarity between two
samples xi and xj. It allows SVM to handle nonlinear classification problems by
implicitly mapping the samples into a higher-dimensional feature space.
● The term ∑αi represents the sum of all Lagrange multipliers.
The SVM decision boundary can be described in terms of these optimal Lagrange
multipliers and the support vectors once the dual issue has been solved and the
optimal Lagrange multipliers have been discovered. The training samples that
have i > 0 are the support vectors, while the decision boundary is supplied by:

Types of Support Vector Machine

Based on the nature of the decision boundary, Support Vector Machines (SVM)
can be divided into two main parts:
● Linear SVM: Linear SVMs use a linear decision boundary to separate the data
points of different classes. When the data can be precisely linearly separated,
linear SVMs are very suitable. This means that a single straight line (in 2D) or
a hyperplane (in higher dimensions) can entirely divide the data points into
their respective classes. A hyperplane that maximizes the margin between the
classes is the decision boundary.
● Non-Linear SVM: Non-Linear SVM can be used to classify data when it cannot
be separated into two classes by a straight line (in the case of 2D). By using
kernel functions, nonlinear SVMs can handle nonlinearly separable data. The
original input data is transformed by these kernel functions into a
higher-dimensional feature space, where the data points can be linearly
separated. A linear SVM is used to locate a nonlinear decision boundary in this
modified space.

Popular kernel functions in SVM

The SVM kernel is a function that takes low-dimensional input space and
transforms it into higher-dimensional space, ie it converts nonseparable
problems to separable problems. It is mostly useful in non-linear separation
problems. Simply put the kernel, does some extremely complex data
transformations and then finds out the process to separate the data based on the
labels or outputs defined.

Advantages of SVM
● Effective in high-dimensional cases.
● Its memory is efficient as it uses a subset of training points in the decision
function called support vectors.
● Different kernel functions can be specified for the decision functions and its
possible to specify custom kernels.

Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
SVM
No ratings yet
SVM
6 pages
Ann Unit III
No ratings yet
Ann Unit III
20 pages
Unit2 notes What is a Support Vector Machine
No ratings yet
Unit2 notes What is a Support Vector Machine
11 pages
SVM
No ratings yet
SVM
11 pages
Support Vector Machine
No ratings yet
Support Vector Machine
17 pages
Support Vector Machine Algorithm
No ratings yet
Support Vector Machine Algorithm
8 pages
UNIT-III Support Vector Machines
No ratings yet
UNIT-III Support Vector Machines
43 pages
Unit 2
No ratings yet
Unit 2
47 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
4 pages
data mining techniques
No ratings yet
data mining techniques
27 pages
Ankita
No ratings yet
Ankita
10 pages
Support Vector Machine
No ratings yet
Support Vector Machine
31 pages
Machine Learning (CSO851) - Lecture 05
No ratings yet
Machine Learning (CSO851) - Lecture 05
27 pages
Machine Learning Unit-3.3
No ratings yet
Machine Learning Unit-3.3
38 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
5-SVM
No ratings yet
5-SVM
34 pages
SVM - Feb 15
No ratings yet
SVM - Feb 15
34 pages
Support Vector Machine
No ratings yet
Support Vector Machine
13 pages
Unit 2 - SVM - 241016 - 104220
No ratings yet
Unit 2 - SVM - 241016 - 104220
47 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machine-1
No ratings yet
Support Vector Machine-1
12 pages
Unit-4 AI - SVM
No ratings yet
Unit-4 AI - SVM
21 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Support vector Machine.pptx
No ratings yet
Support vector Machine.pptx
18 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
28 pages
SVM notes
No ratings yet
SVM notes
4 pages
Support_Vector_Machine(SVM)[1]
No ratings yet
Support_Vector_Machine(SVM)[1]
103 pages
Chapter 07 SVM
No ratings yet
Chapter 07 SVM
20 pages
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
No ratings yet
27-Module 4 - Support Vector Machine and Naïve Bayes-20-09-2024
31 pages
ML Support Vector Machines 2
No ratings yet
ML Support Vector Machines 2
22 pages
SVM Notes
No ratings yet
SVM Notes
8 pages
DMML Unit4 - SVM
No ratings yet
DMML Unit4 - SVM
50 pages
ML_Lec-19
No ratings yet
ML_Lec-19
20 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
6 pages
SVM (Repaired)
No ratings yet
SVM (Repaired)
39 pages
Support Vector Machine
No ratings yet
Support Vector Machine
19 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
SVM
No ratings yet
SVM
11 pages
Support Vactor Machine Final
No ratings yet
Support Vactor Machine Final
11 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
DataMining_Chapter5
No ratings yet
DataMining_Chapter5
9 pages
SVM Tutorial
No ratings yet
SVM Tutorial
28 pages
Support Vector Machine
No ratings yet
Support Vector Machine
9 pages
Machine Learning(r17a0534) 54 57
No ratings yet
Machine Learning(r17a0534) 54 57
4 pages
Classification Regression: Mostly Used in Classification Problems
No ratings yet
Classification Regression: Mostly Used in Classification Problems
8 pages
Lab5 AI
No ratings yet
Lab5 AI
7 pages
Svm
No ratings yet
Svm
52 pages
SVM.pptx
No ratings yet
SVM.pptx
67 pages
Support Vector Machine
No ratings yet
Support Vector Machine
40 pages
Support Vector Machine
No ratings yet
Support Vector Machine
21 pages
16 SVM
No ratings yet
16 SVM
41 pages
Presentation On Support Vector Machine (SVM)
100% (2)
Presentation On Support Vector Machine (SVM)
22 pages
Support Vector Machines
No ratings yet
Support Vector Machines
16 pages
Support Vector Machine
No ratings yet
Support Vector Machine
52 pages
Support Vector Machine
No ratings yet
Support Vector Machine
8 pages
Support Vector Machine: Fundamentals and Applications
From Everand
Support Vector Machine: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
From Everand
Bresenham Line Algorithm: Efficient Pixel-Perfect Line Rendering for Computer Vision
Fouad Sabry
No ratings yet
selfstudys_com_file
No ratings yet
selfstudys_com_file
9 pages
BAN UNIT 5
No ratings yet
BAN UNIT 5
47 pages
UNIT 3.pptx
No ratings yet
UNIT 3.pptx
39 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
25 pages
The Cross Entropy Method For Classification
No ratings yet
The Cross Entropy Method For Classification
8 pages
Fair SVM
No ratings yet
Fair SVM
10 pages
Discriminative and Generative Methods For Bags of Features: Zebra Non-Zebra
No ratings yet
Discriminative and Generative Methods For Bags of Features: Zebra Non-Zebra
40 pages
Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics Max A. Little - The ebook is available for instant download, no waiting required
100% (1)
Machine Learning for Signal Processing: Data Science, Algorithms, and Computational Statistics Max A. Little - The ebook is available for instant download, no waiting required
61 pages
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
No ratings yet
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
15 pages
Unit 1 DMW
No ratings yet
Unit 1 DMW
41 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
Support Vector Machine in Machine Condition Monitoring and Fault Diagnosis
No ratings yet
Support Vector Machine in Machine Condition Monitoring and Fault Diagnosis
15 pages
Machine Learning for Decision Sciences with Case Studies in Python 1st Edition S. Sumathi all chapter instant download
100% (1)
Machine Learning for Decision Sciences with Case Studies in Python 1st Edition S. Sumathi all chapter instant download
50 pages
MFML PDF
No ratings yet
MFML PDF
101 pages
A Machine Learning Analysis of Health Records of Patients With Chronic Kidney Disease at Risk of Cardiovascular Disease
No ratings yet
A Machine Learning Analysis of Health Records of Patients With Chronic Kidney Disease at Risk of Cardiovascular Disease
14 pages
Seminar: Applied Machine Learning: Annalisa Marsico
No ratings yet
Seminar: Applied Machine Learning: Annalisa Marsico
12 pages
ML Ques Bank For 3rd Unit PDF
No ratings yet
ML Ques Bank For 3rd Unit PDF
8 pages
Intro ML PDF
No ratings yet
Intro ML PDF
232 pages
Chapter 4 ML
No ratings yet
Chapter 4 ML
30 pages
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
No ratings yet
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
23 pages
Data-driven_Two-step_Day-ahead_Electricity_Price_Forecasting_Considering_Price_Spikes
No ratings yet
Data-driven_Two-step_Day-ahead_Electricity_Price_Forecasting_Considering_Price_Spikes
11 pages
Reliable Composite Fault Diagnosis of Hydraulic Systems Based On Linear
No ratings yet
Reliable Composite Fault Diagnosis of Hydraulic Systems Based On Linear
15 pages
Introduction To Human Activity Recognition
No ratings yet
Introduction To Human Activity Recognition
10 pages
Reading Faces, Recommending Choices ASystematicReviewof Facial Emotion Recognition and RecommendationSystems
No ratings yet
Reading Faces, Recommending Choices ASystematicReviewof Facial Emotion Recognition and RecommendationSystems
12 pages
Statistical Analysis Techniques in Particle Physics Fits Density Estimation and Supervised Learning 1st Edition Dr. Ilya Narsky instant download
No ratings yet
Statistical Analysis Techniques in Particle Physics Fits Density Estimation and Supervised Learning 1st Edition Dr. Ilya Narsky instant download
48 pages
AIML Course File
No ratings yet
AIML Course File
31 pages
Internship
No ratings yet
Internship
22 pages
2019-Liu-Machine Learning For Predicting Thermodynamic Properties of Pure Fluids and Their Mixtures
No ratings yet
2019-Liu-Machine Learning For Predicting Thermodynamic Properties of Pure Fluids and Their Mixtures
8 pages
Module-I Machine Learning1
No ratings yet
Module-I Machine Learning1
20 pages
Tao 2021
No ratings yet
Tao 2021
19 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
Multivariate Statistical Machine Learning Methods For Genomic Prediction 1st Ed 2022 Montesinos Lpez instant download
No ratings yet
Multivariate Statistical Machine Learning Methods For Genomic Prediction 1st Ed 2022 Montesinos Lpez instant download
69 pages
Cs3491 Aiml Unit 3 Qbank
No ratings yet
Cs3491 Aiml Unit 3 Qbank
50 pages
ML UNIT-IV Notes
100% (1)
ML UNIT-IV Notes
23 pages

SVM notes unit 4.docx

Uploaded by

SVM notes unit 4.docx

Uploaded by

Support Vector Machine (SVM

Support Vector Machine (SVM) is a powerful machine learning algorithm used

Support Vector Machine

Linearly Separable Data points

How does SVM work?

Multiple hyperplanes separate the data from two classes

Hyperplane which is the most optimized one

Original 1D dataset for classification

Mapping 1D data to 2D to become able to separate the two classes

Support Vector Machine Terminology

1. Hyperplane: Hyperplane is the decision boundary that is used to separate the

Mathematical intuition of Support Vector Machine

● Dual Problem: A dual Problem of the optimisation problem that requires

Types of Support Vector Machine

Popular kernel functions in SVM

You might also like