- You can use something like this. We do not have any examples with Reinforcement Learning Toolbox that show how to use this yet unfortunately.
- If it's challenging to estimate what a good episode reward is, you can run a singe training session for a good number of episodes (e.g. 5k episodes) to get some idea how the agent is doing and then use that knowledge from the training plot to set the 'SaveAgent' option as needed. Most of the time you will need to run multiple training sessions either way to tweak parameters, rewards, etc, so just use the first one to get some intuition.
Hyperparameter optimization and saving the best agents for Reinforcement Learning
14 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
laha_M
el 2 de Dic. de 2020
Comentada: Francisco Serra
el 23 de En. de 2024
I am trying to train my RL agent (ddpg) but it's performing quite poorly. I think it may be a problem with the hyperparameter values since I have not tuning. Now I have two questions--
- If there is anything in MATLAB that may help solve this problem of hyperparameter optimization other than manual trial-and-error?
- How do I save the best performing agent given I don't know the critical values (i.e. don't know the range of the reward)? Basically, I want to save the agent that provides maximum reward or, say, top-5 highest rewarding agents?
Thanks.
0 comentarios
Respuesta aceptada
Emmanouil Tzorakoleftherakis
el 3 de Dic. de 2020
Hello,
2 comentarios
Más respuestas (0)
Ver también
Categorías
Más información sobre Parallel and Cloud en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!