k-fold Cross Validation in Classification Learner

23 visualizaciones (últimos 30 días)
James
James el 22 de Ag. de 2018
Editada: James el 22 de Ag. de 2018
Hi there, Just wandering how data is partitioned in k-fold cross validation in Classification Learner? Is the data partitioned into k-folds according to class/label or complete random? That is, Is data split up into n subsets according to n classes, partitioned in the class subsets, and then grouped together into the folds used to train/test?

Respuestas (1)

ahmed nebli
ahmed nebli el 22 de Ag. de 2018
in k-fold cv, the the data is splited to k-1 for taining and 1 for testing.(e.g.: if you have 100 subject and you use 10-fold cv it would be splited into 90 subject for training and 10 subjects for testing and you make k iteration each time the date is re-splited to 90 and 10.
  1 comentario
James
James el 22 de Ag. de 2018
Editada: James el 22 de Ag. de 2018
Yes but is the data as a whole split in one step, or are the class subsets split and then merged? So in your e.g. you mentioned subjects, which I presume equate to data instances, but each instance is labelled. My question is whether data is split according to labels so for e.g.2 if there are 4 classes with 100 instances each with 25 instances, is the data split as a whole, or is the data split within the class subsets and then merged to maintain the proportionality?

Iniciar sesión para comentar.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by