Rolling Cross-validation for Time-series

To properly handle time-series (and time-dependent data in general), we should implement a Rolling Cross-validation to add to our existing CV & TrainTest modes.

We are currently merging various time-series functionality from the internal repo to this repo via https://github.com/dotnet/machinelearning/pull/977 _"Port Time Series"_. This PR does not include a rolling cross-validation, used heavily in time-series tasks.

Rolling CV is better for time dependent datasets by always testing on data which is _newer_ than the training data. Standard CV leaks future data in to the training set. Other names of Rolling CV include { walk-forward / roll-forward / rolling origin / window } CV.

Background on method:
http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.TimeSeriesSplit.html
https://otexts.org/fpp2/accuracy.html#time-series-cross-validation
https://stats.stackexchange.com/questions/14099/using-k-fold-cross-validation-for-time-series-model-selection
https://robjhyndman.com/hyndsight/tscv/
https://www.kaggle.com/c/recruit-restaurant-visitor-forecasting/discussion/46602
https://towardsdatascience.com/time-series-nested-cross-validation-76adba623eb9


To further investigate missing time-series components, the [Azure ML Forecasting Toolkit](https://docs.microsoft.com/en-us/python/api/azuremlftk/) is a good package listing components needed for this task:  
* Metrics: [MAPE, MASE_single_grain, SMAPE](https://docs.microsoft.com/en-us/python/api/azuremlftk/ftk.metrics.metrics?view=azure-forecasting-py)
* Models: [ARIMA, etc](https://docs.microsoft.com/en-us/python/api/azuremlftk/ftk.models?view=azure-forecasting-py)
* etc



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rolling Cross-validation for Time-series #1026

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Rolling Cross-validation for Time-series #1026

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions