Applying reinforcement learning with two continuous actions. During training one varies but the other is virtually static.

Bay Jay

4 En. 2023

1 Respuesta

Actualizado a las 15 Feb. 2023

6 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

Hello,

I am trying to train the DDPG agent to control the vehicle's (model:Kinetmatic) steering angle and velocity. The purpose is to train the agent so the vehicle can move from an initial x,y, theta position to final x,y,theta position. One agent is to perform both actions.

The ranges are [-0.78,+0.78] and [-2.5 and 2.5]. In the actor network, a tanh is used and scaling [0.78; 2.5]. During the training, I realised the steering angle is not changing=>stuck at 0.78, but the velocity varies and this affects the training. What could be the reason for this? Is a single agent okay to perform the task? I am still learning RL. Any suggestion would be helpful.

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Respuestas (1)

Emmanouil Tzorakoleftherakis el 24 de En. de 2023

0 votos

You should be able to use a single agent for this task. Since you are using DDPG, the first thing I would check is whether the noise options are set properly for both inputs.

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

Bay Jay el 15 de Feb. de 2023

Yes, it is the IsDone signal. I have tried to open the link you shared but its not working. Could you share again

Emmanouil Tzorakoleftherakis el 15 de Feb. de 2023

Edited the link

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Categorías

Más información sobre Reinforcement Learning Toolbox en Centro de ayuda y File Exchange.

Productos

Versión

R2022b

Etiquetas

Preguntada:

Bay Jay

el 4 de En. de 2023

Comentada:

Emmanouil Tzorakoleftherakis

el 15 de Feb. de 2023

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Applying reinforcement learning with two continuous actions. During training one varies but the other is virtually static.

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuestas (1)

5 comentarios Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

Categorías

Productos

Versión

Etiquetas

Ver también

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos