Convolution Neural Network Checkpoints with Batch Normalization

4 visualizaciones (últimos 30 días)
Joseph Rivera
Joseph Rivera el 15 de Oct. de 2019
Comentada: Sam Leeney el 15 de Dic. de 2022
The main problem that I'm addressing is that checkpoints from trainNetwork() are broken if they have Batch Normalization layers. I think this is a known issue.
I was thinking of running trainNetwork() again on each checkpoint for a single epoch on a small subset of my training images with realmin learning rate just to finish the training without affecting the network too much.
The issue i'm having is setting the average image property in the imageInputLayer to be static. I already have the zerocenter mean value for my entire training dataset and that will be overwritten with the stats from the smaller dataset which is read-only. Do i really have to use the same training set of over 1 mil images at a realmin learning rate to make my checkpoints usable?
Is there another approach i should try?
  1 comentario
Sam Leeney
Sam Leeney el 15 de Dic. de 2022
Infuriating that a product people actually pay for has left a fundamental bug like this unfixed for so long. The checkpoint function is rendered useless. Using MATLAB is becoming more infuriating every day.

Iniciar sesión para comentar.

Respuestas (0)

Categorías

Más información sobre Image Data Workflows en Help Center y File Exchange.

Productos


Versión

R2019b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by