How to extract the weights of the actor network (inside the step function in the environment) while training the agent in DDPG RL

Question

Muhammad Nadeem el 2 de Nov. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/2042171-how-to-extract-the-weights-of-the-actor-network-inside-the-step-function-in-the-environment-while

Editada: Harsha Vardhan el 17 de Nov. de 2023

Hello Everyone,

I am building an LQR type controller. I need to extract the weights of the actor network (which is essentially the feedback K) inside the step function of the enviroment during training. The reason i want to do this is that during training i want to see the K (the actor weights) and add stability condition on the closed loop system. My step function is as follows:

function [nextobs,rwd,isdone,loggedSignals] = step(this,action)

%% I want to extract K (the actor network weights)

loggedSignals = [];

x = this.State;

tspan = 0:0.01:this.Ts;

[t2,xk1] = ode15s(@NDAE_func_ode_RL,tspan,x,this.SYS.options1,action,this.SYS.d1,this);

this.State = xk1(end,:)';

nextobs = this.Cd*xk1(end,:)';

rwd = -x'*this.Qd*x - action'*this.Rd*action - 2*x'*this.Nd*action;

isdone = length(xk1(:,1))<length(tspan) || norm(x) < this.GoalThreshold;

end

Any guidance/suggestions would be highly appreciated.

Thanks,

Nadeem

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Harsha Vardhan el 17 de Nov. de 2023

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/2042171-how-to-extract-the-weights-of-the-actor-network-inside-the-step-function-in-the-environment-while#answer_1354982

Editada: Harsha Vardhan el 17 de Nov. de 2023

Abrir en MATLAB Online

Hi,

I understand that you want to extract the weights of the actor network inside the step function of the environment during training in a DDPG reinforcement learning setup.

To extract the weights, you can follow the below steps:

Pass the 'agent' as an argument to the 'step' function.
Inside the ‘step’ function, use the 'getActor' method to obtain the actor function approximator from the agent.
Use the 'getLearnableParameters' method to extract the actor's learnable parameters (weights).

Please check the modified code below:

function [nextobs,rwd,isdone,loggedSignals] = step(this,action, agent)
  %% I want to extract K (the actor network weights)   
  
    %Obtain actor function approximator from the agent
    actor = getActor(agent);
    
    %Obtain learnable parameters from the actor
    params = getLearnableParameters(actor);
    
    loggedSignals = [];
    x = this.State;
    tspan = 0:0.01:this.Ts;
    [t2,xk1] = ode15s(@NDAE_func_ode_RL,tspan,x,this.SYS.options1,action,this.SYS.d1,this);
    this.State = xk1(end,:)';   
    nextobs = this.Cd*xk1(end,:)';
    rwd = -x'*this.Qd*x - action'*this.Rd*action - 2*x'*this.Nd*action;
    isdone = length(xk1(:,1))<length(tspan) || norm(x) < this.GoalThreshold;
end

For more details, please refer to the following documentation:

Extract actor from reinforcement learning agent: https://www.mathworks.com/help/reinforcement-learning/ref/rl.agent.rlqagent.getactor.html
Create Custom Environment Using Step and Reset: https://www.mathworks.com/help/reinforcement-learning/ug/create-matlab-environments-using-custom-functions.html
Deep Deterministic Policy Gradient (DDPG) Agents – Creation and Training: https://www.mathworks.com/help/reinforcement-learning/ug/ddpg-agents.html

Hope this helps in resolving your query!

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

How to extract the weights of the actor network (inside the step function in the environment) while training the agent in DDPG RL

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

How to extract the weights of the actor network (inside the step function in the environment) while training the agent in DDPG RL

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuestas (1)

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos