Evaluation 1
Evaluation 1
INTRODUCTION
Introduction
• Evaluation is the fifth stage in the AI project cycle.
• In modelling, we can make different types of models
• How do we check if one is better than the other?
• That’s where Evaluation comes into play.
• In the Evaluation stage, we will explore different
methods of evaluating an AI model.
• Model Evaluation is an integral part of the model
development process.
• It helps to find the best model that represents our
data and how well the chosen model will work in the
future
What is evaluation?
• Evaluation is the process of understanding
the reliability of any AI model, based on
outputs by feeding test dataset into the
model and comparing with actual
answers.
Evaluation
• There can be different Evaluation techniques,
depending of the type and purpose of the model.
• Remember that it’s not recommended to use the
data we used to build the model to evaluate it.
• This is because our model will simply remember
the whole training set, and will therefore always
predict the correct label for any point in the
training set. This is known as overfitting.
• Let us go through various terms which are very
important to the evaluation process
Model Evaluation Terminologies
• There are various new terminologies which come into
the picture when we work on evaluating our model. Let’s
explore them with an example of the Forest fire scenario.
• The Scenario
Imagine that you have come up with an AI based
prediction model which has been deployed in a forest
which is prone to forest fires. Now, the objective of the
model is to predict whether a forest fire has broken out
in the forest or not. Now, to understand the efficiency of
this model, we need to check if the predictions which it
makes are correct or not. Thus, there exist two
conditions which we need to ponder upon: Prediction
and Reality. The prediction is the output which is given
by the machine and the reality is the real scenario in the
forest when the prediction has been made. Now let us
look at various combinations that we can have with these
two conditions.
Case 1: Is there a forest fire?