0% found this document useful (0 votes)

32 views

ML notes

Uploaded by

Hajra bibi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

ML notes

Uploaded by

Hajra bibi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

1

Introduction
Machine learning is a subfield of artificial intelligence,
which is defined as the capability of a machine to simulate
intelligent human behavior and to perform complex tasks
in a manner that is similar to the way humans solve
problems.
To understand machine learning, you need to know the
algorithms that drive the opportunities of machine
learning and it’s limitations.
In general, machine learning algorithms are used in a
wide range of applications, like fraud detection,
computer vision, autonomous vehicles, predictive analytics
where it is not computationally feasible to develop
conventional algorithms that meet the requirements of
real time and predictive nature of work.
There are three basic functions of machine learning
algorithms –
Descriptive – Explaining with the help of data
Predictive – Predicting with the help of data
Prescriptive – Suggesting with the help of data

Types of Machine Learning

Algorithms
In the field of machine learning, there are multiple
algorithms that help us reach descriptive, predictive, and
prescriptive result sets based on parameters defined. Let’s
learn what these algorithms are in the next section –
2

Supervised Learning
Machines are taught by example in supervised learning.
As an operator provides the machine learning algorithm
with a known dataset with desired inputs and outputs, it
must determine how to arrive at the inputs and outputs.
Unlike operators who know the correct answers to
problems, algorithms identify patterns in data, make
predictions based on observations, and learn from them.
Until the algorithm achieves a high level of
accuracy/performance, it makes predictions and is
corrected by the operator.
Under the umbrella of supervised learning fall:
Classification: Observed values are used to draw
conclusions about new observations and determine which
category they belong to in classification tasks. When a
program filters emails as ‘spam’ or ‘not spam’, it must
analyze existing observational data to determine which
emails are spam or not spam.
Regression: In regression tasks, the learning machine
must estimate and understand the relationships between
variables in a system by analyzing only one dependent
variable, as well as a number of other variables that are
constantly changing. Regression analysis is particularly
useful for forecasting and prediction.
3

10. Confusion Matrix

The confusion matrix is a tool used to evaluate the performance of classification models. It provides
a detailed breakdown of prediction results, including:

True Positives (TP): Correct positive predictions.

True Negatives (TN): Correct negative predictions.

False Positives (FP): Incorrectly predicted positives.

False Negatives (FN): Incorrectly predicted negatives.

Metrics like accuracy, precision, recall, and F1-score are derived from the confusion matrix to assess
a model’s performance.

These topics represent the foundational elements of machine learning. Understanding them equips
individuals with the knowledge to build models that address real-world challenges and contribute to
advancements in various domains.

Unsupervised Learning
It is possible to identify patterns using the machine
learning algorithm, without using an answer key or an
operator to provide instructions. Instead, the machine
analyzes available data in order to determine correlations
and relationships. It is left up to the machine learning
algorithm to interpret large data sets and address them
accordingly in an unsupervised learning environment. The
algorithm tries to organize the data in a manner that
describes the data’s structure. The data might be grouped
into clusters or arranged in a more organized manner.
As it assesses more data, its ability to make decisions on
that data gradually improves and becomes more refined.
The following fall under the unsupervised learning
category:
Clustering: A clustering technique involves grouping
similar data (based on defined criteria). It is useful for
segmenting data and finding patterns in each group.
4

Association rule: Discovering relationships between

seemingly independent databases or other data
repositories through association rules.

Decision Tree Algorithms

It is important to understand that in the case of Decision
Tree, a model of decision is constructed based on the
attribute values of the data. The decisions keep branching
out and a prediction decision is made for the given record
based on the attribute values.
It is commonly used for classification and regression
problems to train decision trees because they are
generally fast and accurate.
Some of the commonly used decision tree algorithms are:
Classification and Regression Tree
5

Clustering based Algorithms

We can use a machine learning technique known as
clustering to classify data points into a specific group
given a dataset with a variety of data points. In order for a
data set to be grouped, we need to use a clustering
algorithm. A clustering algorithm falls under the umbrella
of unsupervised learning algorithms as it is based on the
theoretical understanding that data points that belong to
the same group have similar properties.
Some commonly used Clustering Algorithms are:
1. K-means clustering
2. Mean-shift clustering
6

Association Rule
As a rule-based machine learning method, it is useful for
discovering relationships between different features in a
large dataset by using a number of rules.
It basically finds patterns in data which might include:

{Bread} => [Milk] | [Jam]

{Soda} => [Chips]

As shown in the above figure, given the sale of items,

every time a customer buys bread, he also buys milk.
Same happens with Soda, where he buys chips along with
it.

Regression Algorithms
As the name implies, regression analyses are designed to
estimate the relationship between an independent variable
(features) and a dependent variable (label). A linear
regression is the method that is most widely used in
regression analysis.
Some of the most commonly used Regression Algorithms
are:
1. Linear Regression
2. Logistic Regression
7

Support Vector Machine

A Support Vector Machine (SVM) is a supervised machine
learning algorithm used for both classification and regression tasks.
While it can be applied to regression problems, SVM is best suited
for classification tasks. The primary objective of the SVM algorithm is to
identify the optimal hyperplane in an N-dimensional space that can
effectively separate data points into different classes in the feature space.
The algorithm ensures that the margin between the closest points of
different classes, known as support vectors, is maximized.
The dimension of the hyperplane depends on the number of features. For
instance, if there are two input features, the hyperplane is simply a line,
and if there are three input features, the hyperplane becomes a 2-D plane.
As the number of features increases beyond three, the complexity of
visualizing the hyperplane also increases.

Consider two independent variables, x1 and x2, and one dependent

variable represented as either a blue circle or a red circle.
 In this scenario, the hyperplane is a line because we are working with
two features (x1 and x2).
 There are multiple lines (or hyperplanes) that can separate the data
points.
 The challenge is to determine the best hyperplane that maximizes the
separation margin between the red and blue circles.
8

From the figure above it’s very clear that there are multiple lines (our
hyperplane here is a line because we are considering only two input
features x1, x2) that segregate our data points or do a classification
between red and blue circles

What is Classification in Machine Learning?

Classification is a supervised machine learning method where the model tries to
predict the correct label of a given input data. In classification, the model is fully
trained using the training data, and then it is evaluated on test data before being
used to perform prediction on new unseen data.

For instance, an algorithm can learn to predict whether a given email is spam or ham
(no spam), as show below.
9

KNN
KNN is a simple, supervised machine learning (ML) algorithm that can be
used for classification or regression tasks - and is also frequently used in
missing value imputation. It is based on the idea that the observations
closest to a given data point are the most "similar" observations in a data set,
and we can therefore classify unforeseen points based on the values of the
closest existing points. By choosing K, the user can select the number of
nearby observations to use in the algorithm.

Here, we will show you how to implement the KNN algorithm for
classification.

Example
Start by visualizing some data points:

import matplotlib.pyplot as plt

x = [4, 5, 10, 4, 3, 11, 14 , 8, 10, 12]

y = [21, 19, 24, 17, 16, 25, 24, 22, 21, 21]
classes = [0, 0, 1, 0, 0, 1, 1, 0, 1, 1]

plt.scatter(x, y, c=classes)
plt.show()

Alignerr InfoSec Policy V1 Final
No ratings yet
Alignerr InfoSec Policy V1 Final
6 pages
Machine Learning For Marketers PowerPoint Presentation Storyboard
No ratings yet
Machine Learning For Marketers PowerPoint Presentation Storyboard
25 pages
Understanding Computers and Cognition PDF
No ratings yet
Understanding Computers and Cognition PDF
231 pages
MACHINE LEARNING Updated
No ratings yet
MACHINE LEARNING Updated
12 pages
ML Unit-1 (CEC)
No ratings yet
ML Unit-1 (CEC)
108 pages
Mechine Learning
No ratings yet
Mechine Learning
106 pages
AIAP Field Guide v4
No ratings yet
AIAP Field Guide v4
26 pages
MSS_report-technical-standardization
No ratings yet
MSS_report-technical-standardization
32 pages
What Is AI & Machine
No ratings yet
What Is AI & Machine
8 pages
Cloud Computing - Opportunities & Threats
No ratings yet
Cloud Computing - Opportunities & Threats
6 pages
The Fundamental Concepts Behind Deep Learning
No ratings yet
The Fundamental Concepts Behind Deep Learning
22 pages
Machine Learning
100% (1)
Machine Learning
12 pages
Comments & Notes On The Digital-Personal-Data-Protection-Act
No ratings yet
Comments & Notes On The Digital-Personal-Data-Protection-Act
12 pages
Artificial Intelligence Algorithm and It's Application in Games
No ratings yet
Artificial Intelligence Algorithm and It's Application in Games
15 pages
PYTHON LAB MANUAL-2024 (Autonomous)
No ratings yet
PYTHON LAB MANUAL-2024 (Autonomous)
33 pages
History of AI
No ratings yet
History of AI
3 pages
PECB-780 Certification Maintenance and Re-Certification Process 2.4
No ratings yet
PECB-780 Certification Maintenance and Re-Certification Process 2.4
5 pages
AN INTRODUCTION TO FOO1312f 1
No ratings yet
AN INTRODUCTION TO FOO1312f 1
15 pages
01 Presentation AI 42
No ratings yet
01 Presentation AI 42
164 pages
Cs3491 Aiml Unit 3 Qbank
No ratings yet
Cs3491 Aiml Unit 3 Qbank
50 pages
ML 1
No ratings yet
ML 1
79 pages
How To Implement The ISO 27001 Standard
No ratings yet
How To Implement The ISO 27001 Standard
28 pages
Networking Basics
No ratings yet
Networking Basics
90 pages
AI Question Bank With Solutions - 2021-22
No ratings yet
AI Question Bank With Solutions - 2021-22
45 pages
modern abattoir
No ratings yet
modern abattoir
39 pages
Information Security and Compliance Policies
No ratings yet
Information Security and Compliance Policies
5 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
44 pages
Module 2 AI
No ratings yet
Module 2 AI
132 pages
Developers Google Com Machine Learning Glossary
No ratings yet
Developers Google Com Machine Learning Glossary
85 pages
AI Ethics - Use 5 Common Guidelines As Your Starting Point
No ratings yet
AI Ethics - Use 5 Common Guidelines As Your Starting Point
13 pages
An Introduction To Firewalls
No ratings yet
An Introduction To Firewalls
21 pages
Chapter 2 - Service Level Agreements (SLA)
No ratings yet
Chapter 2 - Service Level Agreements (SLA)
23 pages
Machine Learning: BITS Pilani
No ratings yet
Machine Learning: BITS Pilani
52 pages
CCAK VS Advance Cloud Governance
No ratings yet
CCAK VS Advance Cloud Governance
8 pages
Explainable AI For Cybersecurity Automation, Intelligence and Trustworthiness 5
No ratings yet
Explainable AI For Cybersecurity Automation, Intelligence and Trustworthiness 5
24 pages
Wireless Communication Security Manju Khari Manisha Bharti M Niranjanamurthy download
100% (1)
Wireless Communication Security Manju Khari Manisha Bharti M Niranjanamurthy download
83 pages
7 More Steps To Mastering Machine Learning With Python - Page1
No ratings yet
7 More Steps To Mastering Machine Learning With Python - Page1
8 pages
Smplex 6220
No ratings yet
Smplex 6220
3 pages
c9d dInfoSecAwareness
No ratings yet
c9d dInfoSecAwareness
118 pages
Ethical Considerations in Artificial Intelligence Navigating The Moral Landscape
No ratings yet
Ethical Considerations in Artificial Intelligence Navigating The Moral Landscape
2 pages
Introduction To Machine Learning (CS419M)
No ratings yet
Introduction To Machine Learning (CS419M)
25 pages
9780138225575_Sample
No ratings yet
9780138225575_Sample
91 pages
Software Quality Assurance (SQA)
No ratings yet
Software Quality Assurance (SQA)
8 pages
AI Unit 3
No ratings yet
AI Unit 3
85 pages
ManagmentControlIPB PDF
100% (1)
ManagmentControlIPB PDF
32 pages
Simplilearn Deep Learning
No ratings yet
Simplilearn Deep Learning
6 pages
M.E. Cse.
0% (1)
M.E. Cse.
62 pages
AI Intro and Use Cases
No ratings yet
AI Intro and Use Cases
6 pages
The EUs AI Act and Its Human Rights Impacts
No ratings yet
The EUs AI Act and Its Human Rights Impacts
6 pages
2023 Updated Huawei H12-711_V40-ENU Exam Dumps - PDF Room
No ratings yet
2023 Updated Huawei H12-711_V40-ENU Exam Dumps - PDF Room
27 pages
Artificial Intelligence and Cybersecurity
100% (1)
Artificial Intelligence and Cybersecurity
8 pages
tic-tac-cybersecurity-services-for-NIS2-compliance
No ratings yet
tic-tac-cybersecurity-services-for-NIS2-compliance
29 pages
Concept of Digital Laboratory
No ratings yet
Concept of Digital Laboratory
2 pages
Sla Kpi
No ratings yet
Sla Kpi
1 page
Unit - 1 IBM Artificial Intelligence
No ratings yet
Unit - 1 IBM Artificial Intelligence
33 pages
Ns 5
No ratings yet
Ns 5
28 pages
Machine Learning With Python Unit 1-17-84 Final13092024
No ratings yet
Machine Learning With Python Unit 1-17-84 Final13092024
68 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
IBM - INTRODUCTION TO ARTIFICIAL INTELLIGENCE (AI)
No ratings yet
IBM - INTRODUCTION TO ARTIFICIAL INTELLIGENCE (AI)
95 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Hands-On Python Webinar: Schedule
No ratings yet
Hands-On Python Webinar: Schedule
1 page
SMAI-M20-01: CSE-471: Statistical Methods in AI: C. V. Jawahar
No ratings yet
SMAI-M20-01: CSE-471: Statistical Methods in AI: C. V. Jawahar
10 pages
Reading
No ratings yet
Reading
14 pages
Video Understanding With Large Language Models - A Survey
No ratings yet
Video Understanding With Large Language Models - A Survey
24 pages
Altair & google aiml Intership
No ratings yet
Altair & google aiml Intership
42 pages
Deep Learning Applications in Agriculture: A Short Review: January 2020
No ratings yet
Deep Learning Applications in Agriculture: A Short Review: January 2020
13 pages
New Notes
No ratings yet
New Notes
5 pages
IITK PCC GenAI-AIML
No ratings yet
IITK PCC GenAI-AIML
32 pages
Amazon Laloo Offer Letter
No ratings yet
Amazon Laloo Offer Letter
2 pages
DM Unit 3
No ratings yet
DM Unit 3
63 pages
BPM Trends 2024 en
No ratings yet
BPM Trends 2024 en
11 pages
Problem Statements Hackathon
No ratings yet
Problem Statements Hackathon
16 pages
Economic Project - IMPACT OF ARTTIFICIAL INTELLIGENCE (AI) ON BANKING SECTOR IN INDIA
No ratings yet
Economic Project - IMPACT OF ARTTIFICIAL INTELLIGENCE (AI) ON BANKING SECTOR IN INDIA
21 pages
The Role of Social Media in Strengthening Multicultural Tolerance Among Digital Citizenship
No ratings yet
The Role of Social Media in Strengthening Multicultural Tolerance Among Digital Citizenship
11 pages
Machine Learning - Decision Trees
No ratings yet
Machine Learning - Decision Trees
17 pages
2 Mayank_Vatsa
No ratings yet
2 Mayank_Vatsa
39 pages
Technical Seminar Presentation
No ratings yet
Technical Seminar Presentation
15 pages
Venus Investment Alliance Begins Strategic Deployment of KI-Handelsroboter 6.0 Under Michael Schmidt
No ratings yet
Venus Investment Alliance Begins Strategic Deployment of KI-Handelsroboter 6.0 Under Michael Schmidt
4 pages
ML Question Bank
No ratings yet
ML Question Bank
4 pages
Ass 3
No ratings yet
Ass 3
2 pages
A9[1]
No ratings yet
A9[1]
57 pages
417 - Artificial Intelligence Blue Print Pre Board 1 2022 23 Pune Region
No ratings yet
417 - Artificial Intelligence Blue Print Pre Board 1 2022 23 Pune Region
1 page
How To Write Chapter 2 Review of Related Literature
100% (1)
How To Write Chapter 2 Review of Related Literature
5 pages
MSR (Initialization Better Than Xavier)
No ratings yet
MSR (Initialization Better Than Xavier)
9 pages
2024_6COSC020W_CW(1)
No ratings yet
2024_6COSC020W_CW(1)
7 pages
Actual CAT 2023 (Answer Keys)
No ratings yet
Actual CAT 2023 (Answer Keys)
104 pages
ML Lect1
100% (1)
ML Lect1
51 pages
Explainable Recommendation A Survey And New Perspectives Yongfeng Zhang Xu Chen pdf download
100% (2)
Explainable Recommendation A Survey And New Perspectives Yongfeng Zhang Xu Chen pdf download
44 pages
LITERATURE SURVEY On Moving Object Detection
100% (1)
LITERATURE SURVEY On Moving Object Detection
2 pages