How parallelize the solution of sparse matrices using mldivide

Question

0 votos

I am trying to parallelize the solution of

x= A\B (mldivide.

My variables are: x = V(I*J*K), A = A(I*J*K x I*J*K) sparse matrix, vec = u (I,J,K) +V/constant +Bswitch (I*J*K x I*J*K)*V

To do this without parallelization, my code currently does this:

V_stacked = reshape(V,I*J*L,1);
vec = u_stacked + V_stacked/Delta + Bswitch*V_stacked;
V_stacked = A\vec;

To parallelize I have tried

u_stacked = reshape(u,I*J,L);
    V_stacked = reshape(V,I*J,L);
    BswitchTimesVstacked = Bswitch*reshape(V,I*J*L,1);
    BswitchTimesVstacked = reshape(BswitchTimesVstacked,I*J,L);
    vec = u_stacked + V_stacked/Delta + BswitchTimesVstacked;
    tic
    parfor l = 1:L 
        V_stacked(:,l) = A(:,:,l)\vec(:,l);
    end

But as A is still I*J*L times I*J*L, it wont work. I am not sure if 1. what I am doing so far is correct and 2. how to reshape B appropriately.

Mathematical info is here: http://www.princeton.edu/~moll/HACTproject/two_asset_kinked.pdf (section 4)

Any help is highly appreciated :-)

4 comentarios
Mostrar 2 comentarios más antiguos Ocultar 2 comentarios más antiguos

Heiko Weichelt el 1 de Mzo. de 2019

Hi Emil

In order to give you the most helpful answer, let me ask you a few clarifying questions:

What is your motivation behind trying to paralellize the sparse MLDIVIDE step?
What is your expected benefit?
What sizes will you use (smallest and largest)?
Do you have a GPU or a cluster available?

More specific to the code you provided:

Is x = V(I*J*K x I*J*K) and Bswitch (I*J*K x I*J*K) dense or sparse?
If V is I*J*K x I*J*K and you use V_stacked = reshape(V,I*J*L,1), I assume you meant V_stacked = reshape(V,(I*J*L)^2,1);

In general, notice that sparse MLDIVIDE already uses multi-threading, which means the algorithm itself performs some steps parallel. Usually, as long as the matrix fits in memory and MLDIVIDE's additional memory usage (which can be large compared to the original sparse matrix due to fill-in) can be handled by the avaliavle RAM, it is hard to write a faster algorithm yourself.

Also see https://www.mathworks.com/matlabcentral/answers/95958-which-matlab-functions-benefit-from-multithreaded-computation for more information about multi-threading.

Best,

Heiko

Matt J el 1 de Mzo. de 2019

V_stacked(:,l) = A(:,:,l)\vec(:,l);

If A is sparse, how in the above line did you reshape it into a 3D array so as to make it 3D indexable?

Emil Partsch el 1 de Mzo. de 2019

Editada: Emil Partsch el 1 de Mzo. de 2019

I didn't - I made a mistake in the last sentence which I just edited: it should have said "But as A is still I*J*L times I*J*L, it wont work."

The reason I'm doing it as so is that I am trying to replicate how it's done here starting at line 120: https://github.com/ikarib/HANK/blob/master/mycode/HJBUpdate.m

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Follow Question

Answer 1

Matt J el 4 de Mzo. de 2019

0 votos

You can make a 3D stack of sparse matrices A(:,:,l) by converting them to ndSparse type. Then, the parfor construct will work. Alternatively, you can make a block diagonal matrix where all the A(:,:,l) form the diagonal blocks. Then you can solve all systems simultaneously, which would take advantage of Matlab's internal parallelization. I don't know which would be faster.

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

Matt J el 4 de Mzo. de 2019

Was it faster when you used parfor? I would not expect so.

Emil Partsch el 5 de Mzo. de 2019

Editada: Emil Partsch el 5 de Mzo. de 2019

It depends on the size of the I, J and K. For the sizes I originally used, it didn't really matter

Iniciar sesión para comentar.

Answer 2

Matt J el 28 de Feb. de 2019

Editada: Matt J el 28 de Feb. de 2019

Abrir en MATLAB Online

0 votos

I recommend using

V_stacked = pagefun(@mldivide,gpuArray(A),gpuArray(vec));

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Emil Partsch el 1 de Mzo. de 2019

Doing that, I get the error that sparse gpuArrays are not supported for mldivide :-)

Iniciar sesión para comentar.

How parallelize the solution of sparse matrices using mldivide

4 comentarios
Mostrar 2 comentarios más antiguos Ocultar 2 comentarios más antiguos

Respuesta aceptada

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

Más respuestas (1)

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Categorías

Productos

Versión

Etiquetas

Community Treasure Hunt

How parallelize the solution of sparse matrices using mldivide

4 comentarios Mostrar 2 comentarios más antiguos Ocultar 2 comentarios más antiguos

Respuesta aceptada

5 comentarios Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

Más respuestas (1)

1 comentario Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos

Categorías

Productos

Versión

Etiquetas

Ver también

Community Treasure Hunt

4 comentarios
Mostrar 2 comentarios más antiguos Ocultar 2 comentarios más antiguos

5 comentarios
Mostrar 3 comentarios más antiguos Ocultar 3 comentarios más antiguos

1 comentario
Mostrar -1 comentarios más antiguos Ocultar -1 comentarios más antiguos