I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

Question

Maha Mosalam el 19 de Dic. de 2021

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/1614395-i-am-using-ddpg-if-there-are-four-network-to-algorithm-actor-target-actor-critic-target-crit

Respondida: Yash el 23 de Dic. de 2024

for example online actor=10^-1 and target actor 10^-2...how I can do this in matlab?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Yash el 23 de Dic. de 2024

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/1614395-i-am-using-ddpg-if-there-are-four-network-to-algorithm-actor-target-actor-critic-target-crit#answer_1556284

Abrir en MATLAB Online

Yes, you can use different learning rates for Actor and Critic by specifying them individually when setting up your training options for DDPG agent. Here is a simple code snippet to achieve this:

actorOptimizerOptions = rlOptimizerOptions(LearnRate=1e-1)
criticOptimizerOptions = rlOptimizerOptions(LearnRate=1e-2)
opt = rlDDPGAgentOptions('ActorOptimizerOptions',actorOptimizerOptions,'CriticOptimizerOptions',criticOptimizerOptions)

Refer to this documentation page for more information on creating an object for DDPG agent: https://www.mathworks.com/help/reinforcement-learning/ref/rl.option.rlddpgagentoptions.html

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Community Treasure Hunt

I am using DDPG .If there are four network to algorithm (actor, target actor , critic , target critic) in algorithm, and if possible to use different learning rate to each?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos