Why do Nan values greatly increase the cost of sparse multiplication

Question

Jan Neggers el 22 de Oct. de 2025 a las 6:16

4
Enlazar

Enlace directo a esta pregunta

https://la.mathworks.com/matlabcentral/answers/2180779-why-do-nan-values-greatly-increase-the-cost-of-sparse-multiplication

Comentada: Matt J hace alrededor de 7 horas

Consider this code:

clear; close all;
n = 1e6;
m = 1e3;
s = 1e-3;
p = round(s^2 * n * m);
I = randperm(n, p);
J = randperm(m, p);
phi = sparse(I, J, 1, n, m);
Gx = randn(n, 1);
for k = 1:3
    tic
    L = Gx .* phi;
    toc
end
Gx(1) = NaN;
for k = 1:3
    tic
    L = Gx .* phi;
    toc
end

Now consider the output on my macbook pro M1 running Matlab 2024a (so through rosetta).

Elapsed time is 0.001591 seconds.

Elapsed time is 0.000291 seconds.

Elapsed time is 0.000282 seconds.

Elapsed time is 0.573762 seconds.

Elapsed time is 0.548177 seconds.

Elapsed time is 0.560880 seconds.

Notice the very large difference in computation times between the no-NaN version of Gx and the version that contains a single NaN value.

The solution is quite obvious, just remove the NaN values with your favorite method (isnan, isfinite, etc.). So this is not my question.

What intregues me is:

why the computation time changes so drastically?

2 comentarios
Mostrar NingunoOcultar Ninguno

Dyuman Joshi el 22 de Oct. de 2025 a las 7:05

Editada: Dyuman Joshi el 22 de Oct. de 2025 a las 7:20

Abrir en MATLAB Online

Just putting the results using timeit here -

n = 1e6;
m = 1e3;
s = 1e-3;
p = round(s^2 * n * m);
I = randperm(n, p);
J = randperm(m, p);
phi = sparse(I, J, 1, n, m);
Gx = randn(n, 1);
timeit(@() Gx.*phi)
ans = 6.0282e-04
Gx(1) = NaN;
timeit(@() Gx.*phi)
ans = 0.6131
%The version glitch is still here
version
ans = '25.2.0.3007325 (R2025b) Update 1'

The difference is of the same order, 1e3.

Jan Neggers el 22 de Oct. de 2025 a las 12:16

interesting, it seems a mix between the binary singleton expantion and the sparse that causes the issue, because when I make phi full the .* multiplication costs the same with and without the NaN value.

phi = full(phi);

Gx = randn(n, 1);

timeit(@() Gx.*phi)

Gx(1) = NaN;

timeit(@() Gx.*phi)

ans =

2.5634

ans =

2.3063

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Matt J el 22 de Oct. de 2025 a las 11:51

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/2180779-why-do-nan-values-greatly-increase-the-cost-of-sparse-multiplication#answer_1571324

Editada: Matt J el 22 de Oct. de 2025 a las 12:10

Abrir en MATLAB Online

I think you should report it as a bug. If you need a workaround, though, the following equivalent operation, which avoids implicit exapnsion, seems to work fine,

n = 1e6;
m = 1e3;
s = 1e-3;
p = round(s^2 * n * m);
I = randperm(n, p);
J = randperm(m, p);
phi = sparse(I, J, 1, n, m);
Gx = spdiags(randn(n, 1),0,n,n);
timeit(@() Gx*phi)
ans = 5.6075e-04
Gx(1) = NaN;
timeit(@() Gx*phi)
ans = 4.3332e-04

2 comentarios
Mostrar NingunoOcultar Ninguno

Jan Neggers el 22 de Oct. de 2025 a las 12:26

I'll report it as a bug.

Matt J hace alrededor de 7 horas

Abrir en MATLAB Online

Unfortuantely, the workaround I've suggested is affected by a bug as well (which I am investigating here).

n = 1e6;
m = 1e3;
s = 1e-3;
p = round(s^2 * n * m);
I = randperm(n, p);
J = randperm(m, p);
phi = sparse(I, J, 1, n, m);
v=randi(100,n, 1); 
v(1)=nan;
vDiag = spdiags(v,0,n,n); %i.e. vDiag=sparse(diag(v))
anynan(vDiag*phi)  %incorrect
ans = logical
   0
anynan(v.*phi) %correct
ans = logical
   1

Iniciar sesión para comentar.

Answer 2

Steven Lord el 22 de Oct. de 2025 a las 13:05

0
Enlazar

Enlace directo a esta respuesta

https://la.mathworks.com/matlabcentral/answers/2180779-why-do-nan-values-greatly-increase-the-cost-of-sparse-multiplication#answer_1571326

Abrir en MATLAB Online

Let's look at a sample sparse matrix.

S = sprand(1e4, 1e4, 0.1);
status = @(M) fprintf("The matrix has %d non-zero elements, a density of %g.\n", nnz(M)./[1, numel(M)]);
status(S)
The matrix has 9516072 non-zero elements, a density of 0.0951607.

Multiplying S by four different values

Finite non-zero value

If we multiply S by a finite value, the result has the same number of non-zero elements as S.

tic
A1 = S.*2;
toc
Elapsed time is 0.073805 seconds.
status(A1)
The matrix has 9516072 non-zero elements, a density of 0.0951607.

2*0 is a 0, and that doesn't need to get explicitly stored in A1.

anynan(A1) % false
ans = logical
   0

Non-finite values

What happens if we multiply S by a non-finite value? Let's first look at multiplying by NaN.

tic
A2 = S.*NaN;
toc
Elapsed time is 0.667660 seconds.
status(A2)
The matrix has 100000000 non-zero elements, a density of 1.

NaN*0 is NaN, and that does get explicitly stored in A2. In fact, all the elements in A2 are NaN.

anynan(A2)
ans = logical
   1
nnz(isnan(A2))
ans = 100000000

What if we multiply by Inf instead?

tic
A3 = S.*Inf;
toc
Elapsed time is 0.662202 seconds.
status(A3)
The matrix has 100000000 non-zero elements, a density of 1.

Inf*0 is also NaN, and that gets explicitly stored in A3. Not all the elements in A3 are NaN, just the ones that were 0 in S since Inf times a positive finite value is Inf.

anynan(A3)
ans = logical
   1
NaNInA3 = nnz(isnan(A3))
NaNInA3 = 90483928
InfInA3 = nnz(isinf(A3))
InfInA3 = 9516072
NaNInA3 + InfInA3 == numel(A3)
ans = logical
   1

We can see that all the locations where A3 is Inf rather than NaN are the locations corresponding to non-zero values in S.

WereAllInfInA3WhereSWasNonzero = all(isinf(A3) == (S~=0), "all")
WereAllInfInA3WhereSWasNonzero = sparse logical
   (1,1)      1

Zero

And if we multiply S by 0:

tic
A4 = S.*0;
toc
Elapsed time is 0.013630 seconds.
status(A4)
The matrix has 0 non-zero elements, a density of 0.

The result has no non-zero elements.

0 times anything finite (0 or non-zero) is 0, and those don't need to get explicitly stored in A4.

anynan(A4) % false 
ans = logical
   0

Computing A4 is quicker than even computing A1.

Memory and performance comparison

You can see this by looking at how much memory each matrix consumes. The A2 and A3 sparse matrices are not sparsely populated, but they are stored using sparse storage. One term for this is Big Sparse. They'll actually consume more memory than they would if you'd stored them as full!

F2 = full(A2);
F3 = full(A3);
whos S A1 A2 A3 A4 F2 F3
  Name          Size                    Bytes  Class     Attributes

  A1        10000x10000             152337160  double    sparse    
  A2        10000x10000            1600080008  double    sparse    
  A3        10000x10000            1600080008  double    sparse    
  A4        10000x10000                 80024  double    sparse    
  F2        10000x10000             800000000  double              
  F3        10000x10000             800000000  double              
  S         10000x10000             152337160  double    sparse    

Roughly, A2 and A3 take 10 times as much memory as either A1 or S. They also take roughly 10 times as long to compute as the computation of A1. And A4 has no non-zero elements so all the memory it consumes is from the stored index information.

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Dyuman Joshi el 22 de Oct. de 2025 a las 14:11

Editada: Dyuman Joshi el 22 de Oct. de 2025 a las 14:14

Abrir en MATLAB Online

There still the question of why there is a difference of order 1e3.

The matrices generated in OP's code are much sparser than the ones you have used in the code above. Also, the memory taken by L after the operation including NaN is still comparable to the previous, the change being 1x.

n = 1e6;
m = 1e3;
s = 1e-3;
p = round(s^2 * n * m);
I = randperm(n, p);
J = randperm(m, p);
phi = sparse(I, J, 1, n, m);
Gx = randn(n, 1);
for k = 1:3
    tic
    L = Gx .* phi;
    toc
end
Elapsed time is 0.003486 seconds.
Elapsed time is 0.000978 seconds.
Elapsed time is 0.001280 seconds.
whos Gx L
  Name            Size                Bytes  Class     Attributes

  Gx        1000000x1               8000000  double              
  L         1000000x1000              24008  double    sparse    
Gx(1) = NaN;
for k = 1:3
    tic
    L = Gx .* phi;
    toc
end
Elapsed time is 0.636900 seconds.
Elapsed time is 0.613049 seconds.
Elapsed time is 0.616713 seconds.
whos Gx L
  Name            Size                Bytes  Class     Attributes

  Gx        1000000x1               8000000  double              
  L         1000000x1000              40008  double    sparse    

I wonder how does CPU/multithreading come into the picture with spare operations.

Jan Neggers hace alrededor de 10 horas

So this post (by Steven) is more or less in-line with what the bugreport / help center responded after closing the ticket.

Steven's post is helpful, as it reminded me that zero times nan is nan and thus non-zero and thus increases the memory of a sparse object.

However, I still feel that adding a single NaN should not amplify the computation that much, which is precisely highlighted by the answer given by Matt which shows that you can do the computation with the NaN in nearly equivalent time by switching from mtimes to times.

It still feels to me that there is some bug / non-consitent behaviour somewhere, but I leave the subject here.

Matt J hace alrededor de 10 horas

It still feels to me that there is some bug / non-consitent behaviour somewhere

Of course there is. Did you reply to them with the evidence developed here that the slow performance is unrelated to memory?

Iniciar sesión para comentar.

Why do Nan values greatly increase the cost of sparse multiplication

2 comentarios
Mostrar NingunoOcultar Ninguno

Respuesta aceptada

2 comentarios
Mostrar NingunoOcultar Ninguno

Más respuestas (1)

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Why do Nan values greatly increase the cost of sparse multiplication

2 comentarios Mostrar NingunoOcultar Ninguno

Respuesta aceptada

2 comentarios Mostrar NingunoOcultar Ninguno

Más respuestas (1)

4 comentarios Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

2 comentarios
Mostrar NingunoOcultar Ninguno

2 comentarios
Mostrar NingunoOcultar Ninguno

4 comentarios
Mostrar 2 comentarios más antiguosOcultar 2 comentarios más antiguos