How GAE calculates in Reinforement Learning Toolbox(PPO)?

6 visualizaciones (últimos 30 días)
A difference between help center and reference[3] about TD error.
Why in Generalized Advantage Estimator?
https://ww2.mathworks.cn/help/reinforcement-learning/ug/ppo-agents.html

Respuesta aceptada

Emmanouil Tzorakoleftherakis
Emmanouil Tzorakoleftherakis el 16 de Feb. de 2021
Hello,
Thank you for catching this typo - it should be Gt = Dt+V. I have let the documentation team know.

Más respuestas (0)

Categorías

Más información sobre Specialized Power Systems en Help Center y File Exchange.

Etiquetas

Productos


Versión

R2020a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by