Understanding Action Dimension Formatting in MATLAB's DDPG with LSTM-Based Networks

Question

Abdolrazzagh el 12 de Mzo. de 2025

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/2175069-understanding-action-dimension-formatting-in-matlab-s-ddpg-with-lstm-based-networks

Respondida: Snehal el 24 de Mzo. de 2025

In MATLAB's Reinforcement Learning Toolbox, when using DDPG with LSTM-based actor and critic networks, the conversion of actions to dlarray is handled automatically. Since users do not have direct control over this process: Are actions formatted with a 'T' (time) dimension or a 'C' (channel) dimension when passed between the actor and critic networks? How does MATLAB structure these actions to ensure compatibility with recurrent layers, such as aligning sequences for LSTM time steps?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Snehal el 24 de Mzo. de 2025

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/2175069-understanding-action-dimension-formatting-in-matlab-s-ddpg-with-lstm-based-networks#answer_1562368

Hello @Abdolrazzagh,

I understand that you are trying to know how actions are formatted by the underlying mechanism when using MATLAB’s DDPG with LSTM-Based Networks.

Actions are formatted with a 'T' (time) dimension to ensure compatibility with LSTM layers.

The data is structured in the ‘CBT’ format to ensure that both actor and critic networks can process sequences effectively.

Therefore, MATLAB automatically handles the reshaping of actions to align them with the expected input format for LSTM layers.

For more insights, you may refer to the following documentation link:

https://www.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlddpgagent.html#mw_692a723c-8e84-4d86-b43d-248fb989e57c:~:text=Create%20DDPG%20Agent%20Using%20Custom%20Recurrent%20Neural%20Networks

Below is the link to a similar MATLAB question addressed previously:

https://www.mathworks.com/matlabcentral/answers/2174967-how-does-matlab-internally-format-actions-as-dlarray-in-ddpg-with-recurrent-networks-lstm?s_tid=answers_rc1-2_p2_MLT

Hope this helps.

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Understanding Action Dimension Formatting in MATLAB's DDPG with LSTM-Based Networks

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Community Treasure Hunt

Understanding Action Dimension Formatting in MATLAB's DDPG with LSTM-Based Networks

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos