Reinforcement Learning based quadrotor control using Soft Actor-Critic (the reward is not converging)

5 visualizaciones (últimos 30 días)
Hi, I am trying to control of a rotary wing UAV (quadrotor) by using Soft-Actor Critic methodology, but I have some problems, my reward is increasing continously after the point you see following image, what is the main problem, can you advice for this situation, I am sharing my files (Simulink and m-file). My max reward values should be zero as we define in reward function on Simulink file. This reward function indicates that the difference between desired trajectory and actual trajectory is about zero.
  2 comentarios
SALMAN IJAZ
SALMAN IJAZ el 27 de Nov. de 2024
Hello, can you share your simualtions for detailed understanding and discussion.
Unmanned Aerial and Space Systems
Unmanned Aerial and Space Systems el 3 de Ag. de 2025
Editada: Unmanned Aerial and Space Systems el 3 de Ag. de 2025
Hi Dear Salman, if I`m not wrong you have also put a comment under my other query.
If you work on same field, we can go through the problem, thanks.

Iniciar sesión para comentar.

Respuestas (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by