Evaluation 1

The document discusses the importance of evaluation in the AI project cycle, focusing on various methods to assess model performance, particularly in the context of predicting forest fires. It introduces key evaluation terms such as True Positive, True Negative, False Positive, and False Negative, and explains the use of confusion matrices and metrics like accuracy and precision. The document emphasizes that while accuracy is a useful measure, it is not sufficient on its own, and highlights the need for additional metrics to fully evaluate model performance.

Uploaded by

neelaraje33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Evaluation 1

Uploaded by

neelaraje33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

EVALUATION

INTRODUCTION
Introduction
• Evaluation is the fifth stage in the AI project cycle.
• In modelling, we can make different types of models
• How do we check if one is better than the other?
• That’s where Evaluation comes into play.
• In the Evaluation stage, we will explore different
methods of evaluating an AI model.
• Model Evaluation is an integral part of the model
development process.
• It helps to find the best model that represents our
data and how well the chosen model will work in the
future
What is evaluation?
• Evaluation is the process of understanding
the reliability of any AI model, based on
outputs by feeding test dataset into the
model and comparing with actual
answers.
Evaluation
• There can be different Evaluation techniques,
depending of the type and purpose of the model.
• Remember that it’s not recommended to use the
data we used to build the model to evaluate it.
• This is because our model will simply remember
the whole training set, and will therefore always
predict the correct label for any point in the
training set. This is known as overfitting.
• Let us go through various terms which are very
important to the evaluation process
Model Evaluation Terminologies
• There are various new terminologies which come into
the picture when we work on evaluating our model. Let’s
explore them with an example of the Forest fire scenario.
• The Scenario
Imagine that you have come up with an AI based
prediction model which has been deployed in a forest
which is prone to forest fires. Now, the objective of the
model is to predict whether a forest fire has broken out
in the forest or not. Now, to understand the efficiency of
this model, we need to check if the predictions which it
makes are correct or not. Thus, there exist two
conditions which we need to ponder upon: Prediction
and Reality. The prediction is the output which is given
by the machine and the reality is the real scenario in the
forest when the prediction has been made. Now let us
look at various combinations that we can have with these
two conditions.
Case 1: Is there a forest fire?

• Here, we can see in the picture that a forest fire has

broken out in the forest. The model predicts a Yes which
means there is a forest fire. The Prediction matches with
the Reality. Hence, this condition is termed as True
Positive.
Case 2: Is there a forest fire?

• Here there is no fire in the forest hence the reality is

No. In this case, the machine too has predicted it
correctly as a No. Therefore, this condition is termed
as True Negative.
Case 3: Is there a forest fire?

• Here the reality is that there is no forest fire. But the

machine has incorrectly predicted that there is a
forest fire. This case is termed as False Positive.
Case 4: Is there a forest fire?

• Here, a forest fire has broken out in the forest because of

which the Reality is Yes but the machine has incorrectly
predicted it as a No which means the machine predicts
that there is no Forest Fire. Therefore, this case becomes
False Negative.
Confusion matrix
• The result of comparison between the prediction
and reality can be recorded in what we call the
confusion matrix.
• The confusion matrix allows us to understand
the prediction results.
• Note that confusion matrix is not an
evaluation metric but a record which can
help in evaluation.
Confusion matrix
Let us once again take a look at the four conditions
that we went through in the Forest Fire example:
Let us now take a look at the
confusion matrix:

Prediction and Reality can be easily mapped

together with the help of this confusion matrix.
Evaluation Methods
• Now as we have gone through all the possible
combinations of Prediction and Reality, let us see
how we can use these conditions to evaluate the
model.
• Accuracy
Accuracy is defined as the percentage of correct
predictions out of all the observations. A prediction
can be said to be correct if it matches the reality.
Here, we have two conditions in which the
Prediction matches with the Reality: True Positive
and True Negative.
Formula for Accuracy of a model

• Here, total observations cover all the possible cases

of prediction that can be True Positive (TP), True
Negative (TN), False Positive (FP) and False
Negative (FN).
Accuracy as an evaluation technique
• As we can see, Accuracy talks about how true the
predictions are by any model.
• Is high accuracy equivalent to good
performance?
• How much percentage of accuracy is reasonable
to show good performance?
Analysing Accuracy
• Let us go back to the Forest Fire example.
• Assume that the model always predicts that
there is no fire.
• But in reality, there is a 2% chance of forest fire
breaking out.
• In this case, for 98 cases, the model will be right
but for those 2 cases in which there was a forest
fire, then too the model predicted no fire.
• Here,
True Positives = 0
True Negatives = 98
Total cases = 100
Therefore, accuracy becomes:
(98 + 0) / 100 = 98%

This is a fairly high accuracy for an AI model. But

this parameter is useless for us as the actual cases
where the fire broke out are not taken into account.
Hence, there is a need to look at another parameter
which takes account of such cases as well.
Accuracy alone is not enough
Precision
• Precision is defined as the percentage of true
positive cases versus all the cases where the
prediction is true.
• That is, it takes into account the True Positives
and False Positives.
Precision as an evaluation technique
• Going back to the Forest Fire example, in this case, assume
that the model always predicts that there is a forest fire
irrespective of the reality.
• In this case, all the Positive conditions would be taken into
account that is, True Positive (Prediction = Yes and Reality =
Yes) and False Positive (Prediction = Yes and Reality = No).
In this case, the fire fighters will check for the fire all the time
to see if the alarm was True or False.
• You might recall the story of the boy who falsely cries out that
there are wolves every time and so when they actually arrive,
no one comes to his rescue.
• Similarly, here if the Precision is low (which means there are
more False alarms than the actual ones) then the fire fighters
would get complacent and might not go and check every time
considering it could be a false alarm.
Precision as an evaluation technique
• This makes Precision an important evaluation
criteria. If Precision is high, this means the True
Positive cases are more, giving lesser False
alarms.
• But again, is good Precision equivalent to a good
model performance? Why?
Drawback of precision
• Let us consider that a model
has 100% precision. Which
means that whenever the
machine says there’s a fire,
there is actually a fire (True
Positive).
• In the same model, there can
be a rare exceptional case
where there was actual fire but
the system could not detect it.
This is the case of a False
Negative condition.
• But the precision value would
not be affected by it because it
does not take FN into account.
Is precision then a good
parameter for model
performance? We will look at
a few more metrics and
decide.
Difference between Accuracy and
Precision
• Accuracy is how close a value is to its true value.
An example is how close an arrow gets to the
bull's-eye center. Precision is how repeatable a
measurement is. An example is how close a
second arrow is to the first one (regardless of
whether either is near the mark).

• In a set of measurements, accuracy is closeness

of the measurements to a specific value, while
precision is the closeness of the measurements
to each other.

BDO Supplementary Application Form
No ratings yet
BDO Supplementary Application Form
1 page
EVALUATION
No ratings yet
EVALUATION
12 pages
EVALUATION PPT
No ratings yet
EVALUATION PPT
25 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
Evaluation - Grade 10 AI
No ratings yet
Evaluation - Grade 10 AI
12 pages
Evaluation
No ratings yet
Evaluation
32 pages
EvaluationNotes
No ratings yet
EvaluationNotes
12 pages
c10 Ai Evaluation -2024-25
No ratings yet
c10 Ai Evaluation -2024-25
29 pages
Ch 7 - notes evaluation
No ratings yet
Ch 7 - notes evaluation
3 pages
Evaluation AI X
No ratings yet
Evaluation AI X
6 pages
04 Evaluation Revision Notes
No ratings yet
04 Evaluation Revision Notes
5 pages
Part B Chapter 7 (Evaluation)
No ratings yet
Part B Chapter 7 (Evaluation)
5 pages
AI-Evaluation
No ratings yet
AI-Evaluation
30 pages
Ch-EVALUATION
No ratings yet
Ch-EVALUATION
7 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
AI SS CH 7 LM
No ratings yet
AI SS CH 7 LM
39 pages
5.10AI -2B
No ratings yet
5.10AI -2B
15 pages
Evaluation__1646538719041
No ratings yet
Evaluation__1646538719041
65 pages
Evaluation Class X
50% (2)
Evaluation Class X
19 pages
Evaluation in AI
No ratings yet
Evaluation in AI
20 pages
AI Evaluation
No ratings yet
AI Evaluation
24 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
EVALUATION - notes
No ratings yet
EVALUATION - notes
15 pages
X Unit 7 Evaluation
No ratings yet
X Unit 7 Evaluation
5 pages
Unit 7 - Evaluation
No ratings yet
Unit 7 - Evaluation
7 pages
Unit 7 - AI (Evaluation)
No ratings yet
Unit 7 - AI (Evaluation)
28 pages
Evaluation 1 7
No ratings yet
Evaluation 1 7
7 pages
A Field of Computer Science That Focuses On Enabling Computers To Identify and Understand Objects and People in Images and Videos
No ratings yet
A Field of Computer Science That Focuses On Enabling Computers To Identify and Understand Objects and People in Images and Videos
136 pages
417_AI_Handbook_Class9_Evaluation
No ratings yet
417_AI_Handbook_Class9_Evaluation
5 pages
UNIT 7 EVALUATION.docx
No ratings yet
UNIT 7 EVALUATION.docx
13 pages
Ch 07 Evaluation
No ratings yet
Ch 07 Evaluation
25 pages
517-c-30072-Assignment Chapter Evaluation
No ratings yet
517-c-30072-Assignment Chapter Evaluation
10 pages
EVALUATION
No ratings yet
EVALUATION
4 pages
AI Evaluation
No ratings yet
AI Evaluation
3 pages
Evaluation Worksheet
No ratings yet
Evaluation Worksheet
2 pages
Evaluation Question Answers
No ratings yet
Evaluation Question Answers
7 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
MS EVALUATION WORKSHEET
No ratings yet
MS EVALUATION WORKSHEET
3 pages
Evaluation
No ratings yet
Evaluation
12 pages
Evaluation notes
No ratings yet
Evaluation notes
12 pages
Part B Unit 7 Evaluation
No ratings yet
Part B Unit 7 Evaluation
11 pages
EvaluationQuestions Class 10 Ai
No ratings yet
EvaluationQuestions Class 10 Ai
6 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
Q ClassX AI Evaluation
No ratings yet
Q ClassX AI Evaluation
12 pages
1051637-Worksheet Part b Unit7 Evaluation
No ratings yet
1051637-Worksheet Part b Unit7 Evaluation
5 pages
Unit-7 Evaluation Notes
No ratings yet
Unit-7 Evaluation Notes
9 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
Evaluation-Important Questions
No ratings yet
Evaluation-Important Questions
12 pages
Evaluation New
No ratings yet
Evaluation New
42 pages
Evaluation Exercise
No ratings yet
Evaluation Exercise
3 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
12 pages
Aiunit 7 10
No ratings yet
Aiunit 7 10
4 pages
Evaluation 2
No ratings yet
Evaluation 2
15 pages
Evaluation
No ratings yet
Evaluation
2 pages
2.Confusion matrix and Performmance Metrics
No ratings yet
2.Confusion matrix and Performmance Metrics
15 pages
Class 10 Chapter 7 -EVALUATION
No ratings yet
Class 10 Chapter 7 -EVALUATION
17 pages
UNIT-3
No ratings yet
UNIT-3
13 pages
IAI&ML UNIT-5
No ratings yet
IAI&ML UNIT-5
15 pages
Chapter 7 (Evaluation)
No ratings yet
Chapter 7 (Evaluation)
2 pages
Errors of Regression Models: Bite-Size Machine Learning, #1
From Everand
Errors of Regression Models: Bite-Size Machine Learning, #1
Lee Baker
No ratings yet
COT-Rating-Sheet-for-Proficient-Teacher-for-SY-2024-2025 (1)
No ratings yet
COT-Rating-Sheet-for-Proficient-Teacher-for-SY-2024-2025 (1)
2 pages
Rev Transcription Style Guide v3.3
No ratings yet
Rev Transcription Style Guide v3.3
18 pages
Admit Card
No ratings yet
Admit Card
5 pages
Welding Technology
100% (1)
Welding Technology
77 pages
Assignment Judy 1
No ratings yet
Assignment Judy 1
5 pages
MODULE 3 (EDPM)-GOKUL
No ratings yet
MODULE 3 (EDPM)-GOKUL
12 pages
Comparative Analysis NVMe vs SATA
No ratings yet
Comparative Analysis NVMe vs SATA
3 pages
Information Sheet: 1 M 1 CM 1 CM 1 MM 1 MM 1 MM
No ratings yet
Information Sheet: 1 M 1 CM 1 CM 1 MM 1 MM 1 MM
6 pages
Purdue Supplementals
No ratings yet
Purdue Supplementals
2 pages
Immediate Flood Report #1 - April 17-18 2016
No ratings yet
Immediate Flood Report #1 - April 17-18 2016
15 pages
Philippine Pop Culture Royce Written Report
No ratings yet
Philippine Pop Culture Royce Written Report
20 pages
Ins Pirin G: Diploma Course Guide
No ratings yet
Ins Pirin G: Diploma Course Guide
72 pages
Scicent PPT 9 3 e
No ratings yet
Scicent PPT 9 3 e
91 pages
S 85 625 2016WebSum
No ratings yet
S 85 625 2016WebSum
13 pages
Modul Conversation Ma'Had
No ratings yet
Modul Conversation Ma'Had
142 pages
Slow Internet Connection in The Philippines
100% (5)
Slow Internet Connection in The Philippines
12 pages
2-Story Ceilings Abound
No ratings yet
2-Story Ceilings Abound
7 pages
0438
No ratings yet
0438
9 pages
Blackboard Chapter 6
No ratings yet
Blackboard Chapter 6
24 pages
Ppc Philippine Literature and Comics
No ratings yet
Ppc Philippine Literature and Comics
4 pages
BS English Ahmedabad 22 03
No ratings yet
BS English Ahmedabad 22 03
18 pages
Shirley Setia Sound Tech Rider 2019 101 - NMuIZkp
No ratings yet
Shirley Setia Sound Tech Rider 2019 101 - NMuIZkp
6 pages
On Concepts, Modules, and Language: Cognitive Science at Its Core 1st Edition Roberto G. De Almeida - Own the ebook now with all fully detailed content
No ratings yet
On Concepts, Modules, and Language: Cognitive Science at Its Core 1st Edition Roberto G. De Almeida - Own the ebook now with all fully detailed content
64 pages
Bt 1 Metals Lecture
No ratings yet
Bt 1 Metals Lecture
80 pages
Ralph 1-3
No ratings yet
Ralph 1-3
27 pages
KEC Notice of AGM_FY 2020-21
No ratings yet
KEC Notice of AGM_FY 2020-21
14 pages
Broken Sonnet by Hale
No ratings yet
Broken Sonnet by Hale
2 pages
Medium Term Development Plan 2019 2024
No ratings yet
Medium Term Development Plan 2019 2024
63 pages
Model:: AE2425Z-GS3C
100% (1)
Model:: AE2425Z-GS3C
14 pages