Fail to start parpool with workers more than 32

15 visualizaciones (últimos 30 días)
yanci zhang
yanci zhang el 26 de Dic. de 2023
Editada: Matthew el 7 de Mayo de 2024
I have been attempting to enable a parallel pool with multiple workers on my local machine. The parallel pool launches normally with 12 workers, but I encounter an error when I try to increase the number to 32. Here are some reference details: Local hardware configuration: CPU(s) = 336, available memory = 694GB MATLAB version: matlab2023b The error message for the 32 workers Parallel pool test is as follows:
Error Report: Failed to initialize the interactive session.
Caused by: Error using parallel.internal.pool.AbstractInteractiveClient>iThrowIfBadParallelJobStatus The interactive communicating job failed with no message.
Interactive client bound to URL: tcp://tcpnodelay=localhost:27370/protocol/catapult and port 27370 Session failed to start when creating InteractiveClient. Error: Error using parallel.internal.pool.AbstractInteractiveClient>iThrowWithCause Failed to initialize the interactive session.
Has anyone encountered a similar issue? I believe that local hardware resources should not be a limiting factor for utilizing more workers, but why am I unable to use more than 12 workers?

Respuestas (1)

Sachin
Sachin el 29 de Dic. de 2023
Hi Yanci,
To address the issue you are experiencing, I kindly suggest performing the following troubleshooting steps in the MATLAB Command Window:
>> restoredefaultpath
>> savepath
After completing these steps, I recommend restarting MATLAB. Once MATLAB has been restarted, attempt to initiate a parallel pool once more.
Should you encounter an error related to the inability to access the file located at C:\Program Files\MATLAB\R2023b\toolbox\local\pathdef.m, it may be necessary to remove the pathdef.m file.
Doing so will allow MATLAB to recreate it upon the next execution of the relevant command.
I trust that the provided solution will assist in resolving your query.
Thanks
  4 comentarios
yanci zhang
yanci zhang el 4 de En. de 2024
It should also be noted that the CPU I am currently using is an AMD EPYC 9634 84-Core Processor.
Matthew
Matthew el 7 de Mayo de 2024
Editada: Matthew el 7 de Mayo de 2024
I was having the same issue in CentOS with a 64-core machine. This is how I fixed it:

Iniciar sesión para comentar.

Categorías

Más información sobre Parallel Computing Fundamentals en Help Center y File Exchange.

Productos


Versión

R2023b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by