CNN Training progress plots - Validation accuracy Jumps at last iteration

Question

Mariam Ahmed el 30 de En. de 2019

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/442210-cnn-training-progress-plots-validation-accuracy-jumps-at-last-iteration

Respondida: Kenta el 16 de Jul. de 2020

Dear collegues,

I'm training a CNN on MATLAB and I noticed what you can see in the figure below. As shown in the training progress plots, the validation accruacy jumps at the very last iteration regardless of what's the number of Epoches used in the traning. It is confusing. What could be the reason for that?

Thank you.

#Epoches = 5

#Epoches = 10

Another trail with #Epoches = 10

2 comentarios
Mostrar NingunoOcultar Ninguno

Don Mathis el 19 de Feb. de 2019

Can you post your network layers and training options?

Mariam Ahmed el 19 de Feb. de 2019

Abrir en MATLAB Online

Yes, there it is,

NumClasses = 2;
nor = batchNormalizationLayer('Name','BN'); %Bactch Normilization Layer
act1 = leakyReluLayer('Name','LeakyRELU'); %Non-linear Activiation Layer
p = averagePooling2dLayer([2 2],'Name','POOL');
outputLayer = classificationLayer('Name','Classify');
s1 = softmaxLayer('Name','Softmax');
%---Dropout layer
Dlayer = dropoutLayer(0.35,'Name','drop1');
%%--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
%---Define Layers Activation functions
%%--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
actCONV = act1;
actFC1 = act1;
%%-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
%---Start Creating the CNN structure
%%-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
%----Number of filters in Conv layer
n_f = 6;
%----Number of units in FC1 layer
f1_numO_CH = 7;
%---Input Layer
inputLayer = imageInputLayer([inputSize1 inputSize2 numCH],'Name','Input');
%---1st Layer "Conv1"
c1_numF = n_f;
c1 = convolution2dLayer([f ff],c1_numF,'WeightL2Factor',0.7,'Name','Conv1');
c1.Weights = randn([f ff numCH c1_numF]) * 0.01;
c1.Bias = zeros([1 1 c1_numF]);
%---FC1
f1_numO = f1_numO_CH;
numFinal = c1_numF*ConvOUT;%;*((inputSize + 2*(-f+1))-5);
f1 = fullyConnectedLayer(f1_numO,'Name','FC1');
f1.Weights = randn([f1_numO numFinal]) * 0.01;
f1.Bias = zeros([f1_numO 1]);
%---FC2
f2_numO = NumClasses;
f2 = fullyConnectedLayer(f2_numO,'Name','FC2');
f2.Weights = randn([f2_numO f1_numO]) * 0.01;
f2.Bias = zeros([f2_numO 1]);
%%-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
%---Define the Conv Net Structure
%%-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
convnet=[inputLayer;c1;nor;actCONV;Dlayer;f1;actFC1;f2;s1;outputLayer];
opts = trainingOptions('adam',...
        'MaxEpochs',3,...
        'MiniBatchSize',2^8,...
        'Shuffle','every-epoch',...
        'InitialLearnRate',0.05,...
        'LearnRateSchedule','piecewise',...
        'LearnRateDropPeriod',1,...
        'LearnRateDropFactor',0.68,...
        'ValidationData',{Xtest,Ytest},...
        'ValidationPatience',Inf,...
        'Verbose',false,...
        'Plots','training-progress');
    
    

Thank you.

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Kenta el 16 de Jul. de 2020

1
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/442210-cnn-training-progress-plots-validation-accuracy-jumps-at-last-iteration#answer_466433

https://jp.mathworks.com/matlabcentral/answers/445800-why-do-i-see-a-drop-or-jump-in-my-final-validation-accuracy-when-training-a-deep-learning-network

If your network includes batch normalization layer, the final accuracy and the one during the training process sometimes differ. The reason why it happens is written in detail above. Hope it helps!

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

CNN Training progress plots - Validation accuracy Jumps at last iteration

2 comentarios
Mostrar NingunoOcultar Ninguno

Respuesta aceptada

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

CNN Training progress plots - Validation accuracy Jumps at last iteration

2 comentarios Mostrar NingunoOcultar Ninguno

Respuesta aceptada

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Community Treasure Hunt

2 comentarios
Mostrar NingunoOcultar Ninguno

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos