Feature selection/preprocessing before EBM fitting to speed up fitting?

I’m using tsfresh to generate tabular data from my time series. I have 3 channels per time series, and it generates 775 features each, so I have 2325 features total.

Fitting an EBM on my dataset (300 samples and 2325 features) takes almost an hour, which makes hyperparameter optimization nearly infeasible. Default EBM performance on my data is poor (70% accuracy), so I feel like hyperparameter optimization is probably necessary.

I assume that feature reduction is probably not necessary and may be discouraged with EBMs, but I’m considering using it just to speed up the time. Explainability is important to me, so I don’t want to use PCA. I’m considering using SelectKBest as a pre-processing step, but I’m not sure if there are other better ways to speed things up.

What can I do?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature selection/preprocessing before EBM fitting to speed up fitting? #566

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature selection/preprocessing before EBM fitting to speed up fitting? #566

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions