The definition of the Target update frequency in Reinforcement Learning Designer.

8 visualizaciones (últimos 30 días)
In DDPG Agent, there are four networks. Online policy, Target policy, Online Q and Target Q.
The [Target update frequency] is used to the Target policy and Target Q in Reinforcement Learning Designer.
Are the Update frequency of the Online policy and Online Q same as the [Target update frequency] ?

Respuesta aceptada

UDAYA PEDDIRAJU
UDAYA PEDDIRAJU el 12 de Mzo. de 2024
Hi Xian,
No, the update frequency of the Online Policy and Online Q networks is not the same as the Target Update Frequency. The Target Update Frequency specifically applies to how often the Target Policy and Target Q networks are updated, which is typically less frequent or managed differently to ensure stability in learning.
  1 comentario
Xian Zheng Hong
Xian Zheng Hong el 16 de Mzo. de 2024
Thanks for answering. Here is my another question.
Are the Online policy and Online Q updated at every time step in Reinforcement Learning Designer Toolbox?

Iniciar sesión para comentar.

Más respuestas (0)

Categorías

Más información sobre Deep Learning Toolbox en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by