CrossValidation results suffering from oversampling/augmentation #97

dominikmn · 2021-04-21T14:15:35Z

Problem

In our current GridSearch approach we train the models on the oversampled/augmented train set.
On the same set, we do perform the cross-validation.
This is a problem as the model sees samples in the validation-split that it already saw in the train-split.
Hence, models that overfit will be favored by the GridSearch.

Resources

https://imbalanced-learn.org/dev/miscellaneous.html#custom-samplers

dominikmn added bug Something isn't working enhancement New feature or request labels Apr 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CrossValidation results suffering from oversampling/augmentation #97

CrossValidation results suffering from oversampling/augmentation #97

dominikmn commented Apr 21, 2021

CrossValidation results suffering from oversampling/augmentation #97

CrossValidation results suffering from oversampling/augmentation #97

Comments

dominikmn commented Apr 21, 2021

Problem

Resources