sgdmupdate() is a single step or multiple steps until convergence?

2 visualizaciones (últimos 30 días)
robinho robinho
robinho robinho el 12 de Jul. de 2022
Comentada: Ben el 18 de Sept. de 2023
Is calling sgdmupdate() once training a net over a single step or multiple steps until convergence? If the latter, how is convergence judged?

Respuestas (1)

Akash
Akash el 15 de Sept. de 2023
Editada: Akash el 15 de Sept. de 2023
Hi Robinho,
I understand that you are seeking clarification on whether calling "sgdmupdate" function during training involves a single step or multiple steps until convergence. Additionally, you would like to know how convergence is determined, specifically the criteria used for judging convergence.
The "sgdmupdate" function utilizes multiple steps or epochs to achieve convergence during training. Each step represents a single update of the parameters using "Stochastic gradient descent with momentum (SGDM)".
To determine convergence, you can specify the total number of training epochs using the "numEpochs" parameter. This parameter indicates the number of times the model goes through the entire training dataset. Convergence is typically judged based on the desired level of accuracy or a predefined stopping criterion.
Additionally, the convergence can also be determined by monitoring the "Stop" property of the "TrainingProgressMonitor" object. If the "Stop" property is set to true, it indicates that the training process should be stopped. This property can be triggered, for example, by clicking the "Stop" button during training.
To gain more information about "sgdmupdate" function, including its usage and convergence criteria, I recommend referring to the documentation provided at the below provided link.
Thanks,
Akash.
  2 comentarios
feynman feynman
feynman feynman el 17 de Sept. de 2023
thanks so much. So sgdmupdate runs multiple steps instead of 1?
Ben
Ben el 18 de Sept. de 2023
Each call to sgdmupdate performs one step of the update algorithm specified at the bottom of the documentation page: https://uk.mathworks.com/help/deeplearning/ref/sgdmupdate.html
Typically you use sgdmupdate in custom training loops, which will typically end up calling sgdmupdate multiple times as the loop progresses, and control of convergence is up to the user to manage in the custom loop.

Iniciar sesión para comentar.

Categorías

Más información sobre Graphics Performance en Help Center y File Exchange.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by