Super Learning with Repeated Cross Validation

Krzysztof Mnich; Aneta Polewko-Klim; Agnieszka Kitlas Golińska; Wojciech Lesiński; Witold R. Rudnicki

doi:10.1109/icdmw51313.2020.00089

Super Learning with Repeated Cross Validation

Krzysztof Mnich(University of Białystok), Aneta Polewko-Klim(Institute of Computer Science), Agnieszka Kitlas Golińska(University of Białystok), Wojciech Lesiński(University of Białystok), Witold R. Rudnicki(Institute of Computer Science)

Unknown

November 1, 2020

10.1109/icdmw51313.2020.00089

Cited by 9

Abstract

Super learner algorithm was created to combine results of multiple base learners with the use of cross validation. However, in many cases it does not outperform significantly a simple average of the base results. We propose to apply multiple repeats of cross validation to improve the performance of super learning. Two approaches to application of repeated cross validation were tested on artificial data sets and on real-life, biomedical data sets. One of the approaches, MEAN OUTPUT strategy, proved to significantly improve the results. To reduce the computational complexity of the algorithm, we suggest the use of 3-fold, rather than the previously recommended 10-fold validation. The tests showed, that this simplification does not affect the super learning results.

Related Papers

No related papers found

Powered by citation graph analysis