Wrong suming output in simulink r2023A

1 visualización (últimos 30 días)
Joshi Kumar V
Joshi Kumar V el 6 de Abr. de 2024
Respondida: Steven Lord el 6 de Abr. de 2024
Greetings!
We are training a reinforcement learning agent using the R2023a version in Ubuntu 22.04 LTS. We have built a reward block that should result in a reward value of -150 when the input is under a particular range. We have observed that the input to the summing block in the reward function is of value -150 (-100, -50, 0, 0) while the output of the sum block is of value 106
When we tested with constant values, as shown in the attached image, the sum block resulted in a value of -150, as intended.
This has affected the training of agents drastically.

Respuestas (1)

Steven Lord
Steven Lord el 6 de Abr. de 2024
I don't work with Simulink that much, but I'd check the data types of the various signals in your code.
106 - (-150)
ans = 256
The fact that these two numbers differ by a power of 2 makes me suspicious that there's some conversion with wrapping going on.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by