why is mex parfor slower them mex for?

Josef Shrbeny

29 Ag. 2018

1 Respuesta

Respuesta aceptada

Actualizado a las 3 Sept. 2018

8 Visualizaciones (30 días)

Iniciar sesión para responder a esta pregunta.

Follow Question

Iniciar sesión para responder a esta pregunta.

Follow Question

Mostrar comentarios más antiguos

Abrir en MATLAB Online

0 votos

I am starting to work with the Parallel Computing Toolbox, and just constructed an FIR filter example to compare for and parfor

coefs =  [-0.00393617608745112 -5.95945405003999e-05...] length 1x10498
values = [30.3750000000000 30.3760000000000...] length 1x131000
tic; 
outVal = FIRMP(coefs,values);  
%outVal = FIRMP_mex(coefs,values);  
time = toc;

with function FIRMP

function [result] = FIRMP(coefs, values)
  coefLen = length(coefs);
  valLen = length(values);
  result = zeros(size(values));
  (par)for I = 1 : valLen - coefLen;
    suma = 0;
    for J = 1 : coefLen
      suma = suma + coefs(J)*values(I + J);
    end
    result(I) = suma;
  end
end

I used 4 threads and got this results

for   : time= 13.5s
parfor: time = 5.5s

It is OK, but if I create C++ mex (matlab CODER) and run again, the result has changed

for   : time = 3.1s
parfor: time = 4.3s

why is the 'parfor' in C++ mex slower than 'for'?

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Respuesta aceptada

Ryan Livingston el 29 de Ag. de 2018

Editada: Ryan Livingston el 29 de Ag. de 2018

Abrir en MATLAB Online

1 voto

When I try your example on Linux (Debian 9) using GCC I see a good speedup with parfor in generated MEX:

for    : time = 1.3s
parfor : time = 0.4s

On Windows 10 using Microsoft Visual Studio 2017, I see a much more modest speedup:

for    : time = 1.3s
parfor : time = 1.0s

What compiler and OS are you using?

One thing that may be happening for certain compilers is that each of the parfor loop iterations are very fast. When this is the case, the overhead of managing threads can dominate the loop execution time. This can ruin any possible parallelism gains.

The Coder documentation covers this in some detail:

https://www.mathworks.com/help/coder/ug/acceleration-of-matlab-algorithms-using-parallel-for-loops-parfor.html#btfitu2-2

as does the MATLAB parfor documentation:

https://www.mathworks.com/help/distcomp/decide-when-to-use-parfor.html

6 comentarios
Mostrar 4 comentarios más antiguos Ocultar 4 comentarios más antiguos

Josef Shrbeny el 2 de Sept. de 2018

Ryan, Using variables 1 x :n instead of 1 x :inf (both input and local) solved the problem

Now, the 'parfor mex C++' is about 3x faster than 'for mex C++'. (4 threads) Thank you for all your help. You gave me very useful tips and links.

Ryan Livingston el 3 de Sept. de 2018

You're welcome Josef. Glad to hear you found a solution.

Iniciar sesión para comentar.

Más respuestas (0)

Iniciar sesión para responder a esta pregunta.

Categorías

Más información sobre MATLAB Support for MinGW-w64 C/C++ Compiler en Centro de ayuda y File Exchange.

Productos

Versión

R2016b

Etiquetas

Preguntada:

Josef Shrbeny

el 29 de Ag. de 2018

Comentada:

Ryan Livingston

el 3 de Sept. de 2018

Aceptada:

Ryan Livingston

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

why is mex parfor slower them mex for?

0 comentarios Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

Respuesta aceptada

6 comentarios Mostrar 4 comentarios más antiguos Ocultar 4 comentarios más antiguos

Más respuestas (0)

Categorías

Productos

Versión

Etiquetas

Ver también

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguos Ocultar -2 comentarios más antiguos

6 comentarios
Mostrar 4 comentarios más antiguos Ocultar 4 comentarios más antiguos