Reinforcement Learning Toolbox - When does algorithm train?

Hans-Joachim Steinort

17 Sept. 2019

1 Respuesta

Respuesta aceptada

Actualizado a las 26 Sept. 2019

3 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

0 votos

I am currently using the RL-Toolbox with a DQN-Agent built into a long-running process-simulation.

The maximum stepcount is currently 8000 steps per episode.

Unfortunately the documentation seems a little ambiguous to me, so here my question:

Doese the train-function of the RL-Toolbox train the agent at the end of an episode or during the episode when the step count exeeds the minibatch-size (like in the baseline algorithms)?

Thank you in advance.

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Respuesta aceptada

Emmanouil Tzorakoleftherakis el 25 de Sept. de 2019

0 votos

The implementation is based on the algorithm listed here.

Weights are being updated at each time step.

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Hans-Joachim Steinort el 26 de Sept. de 2019

"For each training time step" - that was the line I was looking for (yet looking into the source code lead me to the same conclusion).

After double-checking the baseline-algorithms I found that they do it the same way.

Thank you for your time!

Iniciar sesión para comentar.

Más respuestas (0)

Iniciar sesión para responder a esta pregunta.

Categorías

Más información sobre Reinforcement Learning en Centro de ayuda y File Exchange.

Productos

Versión

R2019a

Etiquetas

Preguntada:

Hans-Joachim Steinort

el 17 de Sept. de 2019

Comentada:

Hans-Joachim Steinort

el 26 de Sept. de 2019

Aceptada:

Emmanouil Tzorakoleftherakis

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Reinforcement Learning Toolbox - When does algorithm train?

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Más respuestas (0)

Categorías

Productos

Versión

Etiquetas

Ver también

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos