Reinforcement Learning - PPO agent with hybrid action space
25 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Federico Toso
el 24 de Oct. de 2023
Comentada: Emmanouil Tzorakoleftherakis
el 30 de Oct. de 2023
I have a task which involves both discrete and continuous actions.
I would like to use PPO since it seems suitable in my case. I know that this algorithm support both discrete and continuous actions spaces, but it seems that current Mathworks implementation does not support both of them at the same time.
I was thinking about the following workaround:
- Use two PPO agents (one for discrete actions, the other one for continuous actions)
- Let them share the same critic network (this should be feasible, since they share the same observation space)
- Train them in parallel with Reinforcement Learning App Designer, with syncronous parameter updates
In this way I may be able to achieve a result that resembles what I would get with a single PPO "hybrid" agent.
My questions:
1) Are the above 3 steps possible with current Mathworks implementation? (I'm mostly concerned about possible limitations of Reinforcement Learning App Designer in this sense)
2) Is there any other workaround that would be advisable for my case? (PPO with hybrid action space)
Of course any reference to an existing example would be highly welcome
0 comentarios
Respuestas (1)
Emmanouil Tzorakoleftherakis
el 27 de Oct. de 2023
Hello,
The workaround you suggested makes sense to me. Unfortunately though, bullet #3 is not currently supported. You cannot do multi-agent training in the app (you would have to set the problem up programmatically) and you can also not do paralle multi-agent training (you would have to either train each agent in parallel sequentially, or do multi-agent training without parallel).
Can you provide more details on the application? It will help us prioritize support of hybrid PPO in a future release.
2 comentarios
Emmanouil Tzorakoleftherakis
el 30 de Oct. de 2023
Thank you for the reply. I will take the feedback to the development team.
Ver también
Categorías
Más información sobre Training and Simulation en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!