parallel code execution on MATLAB cluster

As I run a code on a cluster using spmd, sometimes a worker gets disconnected and the execution stops. In another instance, the job became 'queued' after running for multiple hours and then eventually the execution stopped. What could be potential reasons for these?

1 comentario

Are you using Linux? Could you cofirm the maximum process is sufficient?
ulimit -a

Iniciar sesión para comentar.

Respuestas (0)

Categorías

Más información sobre MATLAB Parallel Server en Centro de ayuda y File Exchange.

Etiquetas

Preguntada:

el 10 de En. de 2018

Comentada:

el 11 de En. de 2018

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by