Obtaining Information from Reinforcement Learning while Training

Question

Huzaifah Shamim el 4 de Ag. de 2020

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/575119-obtaining-information-from-reinforcement-learning-while-training

Respondida: Stephan el 16 de Ag. de 2020

While training my custom environment using a DQN, I want to be able to store the reward and another value at the end of each episode somewhere so that I may look at it at the end of training. How may I do that?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Stephan el 16 de Ag. de 2020

2
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/575119-obtaining-information-from-reinforcement-learning-while-training#answer_480651

The reward of each training episode is stored in the trainsStats struct which is the output argument of the train function. Inside this struct you find a bunch of informations regarding the training process. See here for what is strored int this struct.

If you want to store additional information you have to do some more effort. For example you could save additional informations inside a .mat-file always if the isDone flag is true in your step function.

The disadvantage of this apporach is, that loading a .mat-file, adding one or more values to it and saving it again can be a time consuming operation. Since you would do this only one time each episode (if the episode is over, indicated by the isDone flag), maybe this is an acceptable way to learn more about your agents behaviour during the training process.

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Obtaining Information from Reinforcement Learning while Training

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Obtaining Information from Reinforcement Learning while Training

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos