Reinforcement Learning experience buffer length and parallelisation toolbox

Question

Tech Logg Ding el 2 de Dic. de 2020

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox

Editada: Emmanouil Tzorakoleftherakis el 3 de Dic. de 2020

Respuesta aceptada: Emmanouil Tzorakoleftherakis

When parallelisation is used when training a DDPG agent with the following settings:

trainOpts.UseParallel = true;
trainOpts.ParallelizationOptions.Mode = 'async';
trainOpts.ParallelizationOptions.StepsUntilDataIsSent = -1;
trainOpts.ParallelizationOptions.DataToSendFromWorkers = 'Experiences';

Does the the parallel simulations have their own experience buffer? This could take up more memory hence I am hoping that only one experience buffer is stored to update the critic network.

From the documentations, it seems like there will only be one experience buffer as the experiences are sent back to the host.

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Emmanouil Tzorakoleftherakis el 3 de Dic. de 2020

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/673448-reinforcement-learning-experience-buffer-length-and-parallelisation-toolbox#answer_564503

Editada: Emmanouil Tzorakoleftherakis el 3 de Dic. de 2020

Hello,

There is one big experience buffer on the host, the size of which you determine as usual in your agent options. Each worker has a much smaller buffer to collect experiences until you reach "StepsUntilDataIsSent".

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Reinforcement Learning experience buffer length and parallelisation toolbox

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Reinforcement Learning experience buffer length and parallelisation toolbox

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos