Training Reinforcement Learning Agents --> Use ResetFcn to delay the agent's behaviour in the environment

Question

Federico Toso el 22 de En. de 2024

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/2072781-training-reinforcement-learning-agents-use-resetfcn-to-delay-the-agent-s-behaviour-in-the-enviro

Comentada: Federico Toso el 26 de En. de 2024

Respuesta aceptada: Emmanouil Tzorakoleftherakis

I would like to train my RL Agent in an environment which is represented by an FMU block in Simulink.

Unfortunately whenever a simulation starts I experience some brief natural oscillations in the states before the system reaches the ideal stedy state for the training.

I would like to tell my agent to wait for the steady state to be reached every time, before starting any experience related to the training.

I know that ResetFcn can be called at the beginning of each simulation, but this is usually used to change parameters in the blocks before the simulation starts; is it possible to use it for my specific purposes instead, i.e. to let some time buffer between the beginning of the simulation and the beginning of my agent's action?

If this is not possible, are there other suitable ways to overcome this problem?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Emmanouil Tzorakoleftherakis el 24 de En. de 2024

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/2072781-training-reinforcement-learning-agents-use-resetfcn-to-delay-the-agent-s-behaviour-in-the-enviro#answer_1396816

You can place the RL Agent block inside a triggered subsystem and set the agent's sample time to -1 (see e.g. here). Then set this subsystem to be executed whenever it makes sense for your problem.

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Federico Toso el 26 de En. de 2024

That did the trick!

Thank you very much

Iniciar sesión para comentar.

Training Reinforcement Learning Agents --> Use ResetFcn to delay the agent's behaviour in the environment

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Training Reinforcement Learning Agents --> Use ResetFcn to delay the agent's behaviour in the environment

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

1 comentario Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguosOcultar -1 comentarios más antiguos