Resume training of a DQN agent. How to avoid Epsilon from being reset to max value?

2 visualizaciones (últimos 30 días)

Cecilia S. el 9 de Jun. de 2021

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/852205-resume-training-of-a-dqn-agent-how-to-avoid-epsilon-from-being-reset-to-max-value

Comentada: Cecilia S. el 22 de Jun. de 2021

Respuesta aceptada: Emmanouil Tzorakoleftherakis

When I want to resume training of an agent, I simply load it and set the "resetexperiencebuffer" option to false, but this does not avoid the exploration (depending on epsilon) to start anew. Is there any way to make the agent start from the exact point it left off without manually setting the epsilon value?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Respuesta aceptada

Emmanouil Tzorakoleftherakis el 22 de Jun. de 2021

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/852205-resume-training-of-a-dqn-agent-how-to-avoid-epsilon-from-being-reset-to-max-value#answer_730700

Hello,

This is currently not possible, but it is a great enhancement idea. I have informed the developers about your request and it will be considered for a future release.

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Cecilia S. el 22 de Jun. de 2021

excelent, thank you!!

Iniciar sesión para comentar.

Más respuestas (0)

Iniciar sesión para responder a esta pregunta.

Categorías

AI and Statistics Deep Learning Toolbox

Más información sobre Deep Learning Toolbox en Help Center y File Exchange.

Productos

Versión

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Resume training of a DQN agent. How to avoid Epsilon from being reset to max value?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Resume training of a DQN agent. How to avoid Epsilon from being reset to max value?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos