How to train DDPG episode reward more better?

4 visualizaciones (últimos 30 días)

hunson yang el 26 de Feb. de 2020

1
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/507677-how-to-train-ddpg-episode-reward-more-better

Comentada: Guoge Tan el 25 de Mayo de 2020

I'm training a DDPG agent from the Reinforcement Learning toolbox. But as you can see, my episode reward never change. I try so many way to fix this problem. Like change the netwoek, Gradient Threshold, Learning Rate. But the result will be the same. I check my reward funtion, if the situation is eligible I will give it some reward or penalty. But its reward is always be same.

Is my condtion have some problem? Or my results are not input into the model? I dont have anyway to do.

2 comentarios
Mostrar NingunoOcultar Ninguno

Emmanouil Tzorakoleftherakis el 28 de Feb. de 2020

How did you set the IsDone flag? This may lead to premature episode termination

Guoge Tan el 25 de Mayo de 2020

Hi, sorry to bother you, but I'd like to ask if your problem is solved or not? I‘m working on a path planning problem using the Reinforcement Learning toolbox on MATLAB R2020a and I also encountered a problem similar to yours.