Is validation set being used for training in NN?

Question

Ana el 26 de Sept. de 2012

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/49140-is-validation-set-being-used-for-training-in-nn

I'm using the Neural Network toolbox and the "divideind" function. I split the all set into train and validation sets to use the early stopping criteria:

net.divideFcn='divideind'; [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(10000,1:7000,7001:10000);

The thing is I thought the training was performed just with the training set, and each epoch the validation performance was computed, but I realised the NN is being influenced by the all set (training+validation). I really don't know how and I'd like to change it! I tried: [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(10000,1:7000,7001:10000); [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(7000,1:7000); and they give different results (for the same epochs). Using the all set for training the results are more similar but yet different: [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] = divideind(10000,1:10000); And because of this I'm computing the test error separately to be sure the test set is not being used to training!

Do you know what is happening? Do you think I can do it as I need, i.e. 7000 just for training and 3000 just for validation? Thank you!

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Greg Heath el 26 de Sept. de 2012

1
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/49140-is-validation-set-being-used-for-training-in-nn#answer_60085

Abrir en MATLAB Online

% The short answer is no.

Sep 25, 2012 07:37:34 AM, airgouveia@gmail.com wrote:

> I'm using the Neural Network toolbox and the "divideind" function. I

> split the all set into train and validation sets to use the early

> stopping criteria:

> net.divideFcn='divideind';

> [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]

> = divideind(10000,1:7000,7001:10000);

% If you are trying to debug, use a very small data sample and omit command

% ending semicolons

>> net =fitnet;

 net.divideFcn='divideind';
 [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]
 = divideind(10,1:7,8:10)    % Small data set & No semicolon
 ans =      []
= divideind(10,1:7,8:10)
|
Error: The expression to the left of the equals sign is not a valid target for an assignment.

% I cannot get your syntax to work. What version do you have?

% On the other hand

>> net =fitnet;

 net.divideFcn='divideind';
 [trainInd,valInd,testInd] = divideind(10,1:7,8:10)

trainInd = 1 2 3 4 5 6 7

valInd = 8 9 10

testInd = []

% These can be assigned to the net separately

> The thing is I thought the training was performed just with the training

> set, and each epoch the validation performance was computed, but I

> realised the NN is being influenced by the all set (training+validation).

% You are mistaken.

%The validation set affects ONLY the stopping epoch.

> I really don't know how and I'd like to change it! I tried:

> [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]

> = divideind(10000,1:7000,7001:10000);

> [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]

> = divideind(7000,1:7000);

> and they give different results (for the same epochs).

% That is because you did not use the same random number seed (help rng)

> Using the all set for training the results are more similar but yet different:

> [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]

> = divideind(10000,1:10000); And because of this I'm computing the test

> error separately to be sure the test set is not being used to training!

> Do you know what is happening? Do you think I can do it as I need, i.e.

> 7000 just for training and 3000 just for validation?

% You can safely include all three subsets in the division. They work as they should.

% Use the structure tr when training and, after training, omit the semicolon to reveal its contents.

[ net tr ] = train(net,input,output);

tr = tr

Hope this helps.

Thank you for formally accepting my answer.

Greg

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Ana el 27 de Sept. de 2012

Editada: Ana el 27 de Sept. de 2012

Abrir en MATLAB Online

Hi!

    % The short answer is no.
        > I'm using the Neural Network toolbox and the "divideind" function. I
    > split the all set into train and validation sets to use the early
    > stopping criteria:
    > net.divideFcn='divideind';
    > [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]
    > = divideind(10000,1:7000,7001:10000);
    % If you are trying to debug, use a very small data sample and omit command
    % ending semicolons 
    >> net =fitnet;
     net.divideFcn='divideind';
     [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]
     = divideind(10,1:7,8:10)    % Small data set & No semicolon
     ans =      []
      = divideind(10,1:7,8:10)| Error: The expression to the left of the equals sign
         is not a valid target for an assignment.
    % I cannot get your syntax to work. What version do you have?
    % On the other hand
    >> net =fitnet;
     net.divideFcn='divideind';
     [trainInd,valInd,testInd] = divideind(10,1:7,8:10)
    trainInd = 1 2 3 4 5 6 7
    valInd = 8 9 10
    testInd = []
    % These can be assigned to the net separately

My version is R2010b. Before I post the question I checked carefully how the parameters and the indices were inside the training functions, what were the outputs, etc. And they are ok. If I used what you wrote above, the net won't assume those indices inside the training functions. What I do is:

net = feedforwardnet(H, 'trainscg');

net.initFcn='initlay'; net.layers{:}.initFcn='initwb'; net.inputWeights{:,:}.initFcn='rands'; net.layerWeights{:,:}.initFcn='rands'; net.biases{:}.initFcn='rands';

net.derivFcn = 'staticderiv'; net.divideFcn='divideind'; [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd] =divideind(10000,1:7000,7001:10000);

net = configure(net,P1,T1); net = init(net); [net,tr,Y1,E1] = train(net,P1,T1);

    > The thing is I thought the training was performed just with the training
    > set, and each epoch the validation performance was computed, but I
    > realised the NN is being influenced by the all set (training+validation).
    % You are mistaken.
    %The validation set affects ONLY the stopping epoch.
    > I really don't know how and I'd like to change it! I tried:
    > [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]
    > = divideind(10000,1:7000,7001:10000);
    > [net.divideParam.trainInd,net.divideParam.valInd,net.divideParam.testInd]
    > = divideind(7000,1:7000);
    > and they give different results (for the same epochs).
    % That is because you did not use the same random number seed (help rng)

I disagree. I made this simple experiment: I considered 7000 examples, and more 3000 different examples which together with the 7000 lead to a bad learning. I considered: 1. As input I considered 7000, all for training. 2. As input I considered 10000, 7000 for training and 3000 for validation. 3. As input I considered 10000, all for training.

All the results are different but 2/3 are much more worst then 1. If the training is not considering the 3000 examples that I consider for validation, then the training performance should be similar for 1 and 2. And it is not. Thank you very much. Ana

Greg Heath el 27 de Sept. de 2012

>I disagree. I made this simple experiment: I considered 7000 examples, >and more 3000 different examples which together with the 7000 lead to a >bad learning. I considered:

>1. As input I considered 7000, all for training.

>2. As input I considered 10000, 7000 for training and 3000 for validation.

3. As input I considered 10000, all for training.

>All the results are different.

I'm not surprised.

>but 2/3 are much more worst then 1. If the training is not considering >the 3000 examples that I consider for validation, then the training >performance should be similar for 1 and 2. And it is not. Thank you very >much. Ana

>Sizes of trn/val/tst matrices?

What are you using as an unbiased test set? Do you must have more than 10,000 cases?

Oh! You are using the combined val/tst combination as a BIASED test set?

Did you check the training window to see why each experiment was terminated?

Assuming you specified the same random number seed to create the initial weights,

1&3. Successful training procedes to training set convergence or maximum number of epochs.

2. Successful training procedes until one of the two above conditions are reached OR validation error hits a minimum (beyond which the next 6 val errors are monotonically increasing)

In the real world,

1. Convergence may be due to a nonglobal local min.

2. Training may terminate because of other reasons (e.g.,

a. maximum mu reached

b. minimum gradient reached

Hope this helps.

Greg

P.S. I always train 10 nets and look at the combined tabulations. Search for examples using the keywords heath close clear Ntrials

Ana el 28 de Sept. de 2012

Editada: Ana el 28 de Sept. de 2012

For the 3 experiments, I didn't use a test set, I'm just comparing the training performance.

All the 3 cases stopped because they reached the maximum number of epochs (I controled the options for that); and I compared the training performance for the same epochs.

Thanks, Ana

Iniciar sesión para comentar.

Is validation set being used for training in NN?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Community Treasure Hunt

Is validation set being used for training in NN?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

3 comentarios Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo