Skip to content

Question: CH04 - Is the test set part of the train set? #16

@dominicdill

Description

@dominicdill

I am confused by the use of train and test sets for these time series models.

Here you setup your entire dataset and split it into test and train sets.
image
image

It seems like you would want to evaluate your model by seeing how well it performs on the test set using only the train set. But in the notebook the test set is used when evaluating the model.

image

image

To actually see how well the model predicts into the future, wouldn't we instead want to set the df parameter of the rolling_forcast function to train and then add lines to that function that extend the supplied df with the predictions?

Currently, because df_dff (which holds both train and test data) is used, you're not truly seeing how well your models predict into the future.

Am I misunderstanding? Is this what we want to be doing?

I changed the code to do as I think I'm expecting:

image

And I see this:
image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions