diag(sparse matrix) should be a sparse vector? #411

StefanKarpinski · 2017-03-16T23:17:42Z

Currently, when you take the diagonal of a sparse matrix, you get a dense vector:

julia> S = sprand(10, 10, 0.1)
10×10 SparseMatrixCSC{Float64,Int64} with 10 stored entries:
  [3 ,  1]  =  0.284124
  [2 ,  2]  =  0.627361
  [9 ,  5]  =  0.339561
  [5 ,  6]  =  0.848555
  [1 ,  8]  =  0.538286
  [3 ,  8]  =  0.703678
  [5 ,  8]  =  0.0960039
  [8 ,  8]  =  0.00592429
  [5 ,  9]  =  0.170894
  [1 , 10]  =  0.324079

julia> diag(S)
10-element Array{Float64,1}:
 0.0
 0.627361
 0.0
 0.0
 0.0
 0.0
 0.0
 0.00592429
 0.0
 0.0

This should arguably be a sparse vector since for a generic sparse matrix, the diagonal is sparse.

andreasnoack · 2017-03-16T23:34:08Z

As usual with sparse matrices, it depends on the sparsity pattern. Sparse matrices from physical simulations would typically have a banded structure such that the diagonal is dense. For sparse matrix representations of graphs, it is probably less likely. We could do a survey based on Tim Davis' sparse matrix collection.

martinholters · 2017-03-17T13:46:46Z

Putting dense data into a SparseVector is a waste of memory, but only by a smallish factor, while putting sparse data into a Vector may be a gigantic waste of memory (although contrary to the situation with matrices, it is less likely to exceed physical memory). So returning a sparse result would be the safer choice, wouldn't it?

andreasnoack · 2017-03-17T14:22:07Z

I guess that argument applies to all arrays, i.e. even dense arrays. The safe choice that minimizes the worst-case relative memory waste for a dense matrix is to always produce a sparse vector. If the vector happens to have no zeros, there is a 2x waste. Returning a Vector if all the entries are zero is Inf x (except for the metadata) waste. Still, we think A[:,1] should be dense for A::Matrix because we believe that the entries typically will be nonzeros. So there is a trade-off that depends on what you consider a typical sparsity pattern. I don't have strong opinions, though. Just wanted to point out that this might be expensive for a large and very common class of sparse matrices.

JeffBezanson · 2017-03-17T17:39:05Z

I think dense is a reasonable choice here. My impression is that the sparse world generally considers O(n) size small enough.

mbauman · 2017-03-17T17:48:28Z

Given that our matrices are CSC, we're already storing a dense list of size O(n). The story might be different with a COO format.

StefanKarpinski · 2017-03-17T19:30:06Z

A SparseVector result would not be O(n).

tkelman · 2017-03-17T20:59:00Z

off-diagonals using the 2-arg form aren't usually expected to be dense the way the main diagonal will more commonly be. so there's a pretty good argument for returning a sparse vector from diag(A::SparseMatrixCSC, k), and then it would be a bit strange for diag(A) and diag(A, 0) to return different types

fredrikekre · 2017-07-31T08:58:21Z

julia> diag(A)
5-element Array{Float64,1}:
 0.774844
 0.0     
 0.443253
 0.0     
 0.0     

julia> diag(A,0)
5-element SparseVector{Float64,Int64} with 2 stored entries:
  [1]  =  0.774844
  [3]  =  0.44325

I got hit by this inconsistency in JuliaLang/julia#22925

StefanKarpinski · 2017-07-31T15:46:40Z

Would be great if a decision could be made here about which direction to go for consistency and make the different forms of diag match. We need a bat signal for linalg people...

andreasnoack · 2017-07-31T16:13:57Z

The current situation is not completely unreasonable, though, since the diagonal will often be quite dense whereas many off-diagonals will be very sparse but I can also see that it would be nice if the two methods returned the same type.

Following up on @martinholters comment, I'd argue that the safest choice would be the one with the smallest absolute worst case memory usage. It is not the only concern or necessarily the main concern but I'd also argue that diag(A) is much more common than diag(A, k!=0).

Bottomline is that we disagree. Specifying a return type seems like a general problem. Maybe a solution could be to define spdiag?

mbauman · 2017-07-31T20:10:37Z

Possible alternative: make diag be always dense and make it easier to construct diagonal views either through an indexing syntax or some other function. That'd give an O(n) memory solution with fast access and an O(1) memory solution with (deferred) access complexity based on the source array.

tkelman · 2017-07-31T20:19:22Z

diagonal views would be a good idea, but indexing into that would be more expensive than indexing into an eager sparse or dense diagonal slice (copy)

StefanKarpinski · 2017-08-01T18:04:25Z

diag(A) should behave the same as diag(A,0) – I don't care if that's sparse or dense, but it should be consistent.

StefanKarpinski · 2017-08-03T19:45:02Z

@tkelman makes the argument that if we go with always sparse:

It will handle the typical off-diagonal case well;
The potential upside – ratio of dense memory used to sparse memory used – is unbounded;
The potential downside – the inverse ratio – is at most 2x, which is still not bad;
Preserves potentially significant information about structural storage pattern – there's no way to recover this easily if we default to returning dense diagonals.

When someone writes full(diag(A)) we could potentially eventually optimize away the creation of the sparse vector and just write entries directly to the dense output vector without changing any semantics, giving the best of both worlds. The optimization in the other direction is much harder.

Resolved: always return a sparse vector here.

andreasnoack · 2017-08-03T20:08:02Z

Might be worth browsing through the thumbnails at https://www.cise.ufl.edu/research/sparse/matrices/list_by_id.html to get an idea of typical density patterns of diagonals of typical sparse matrices.

StefanKarpinski · 2017-08-03T23:19:58Z

Yes, it's definitely true that the diagonal is often dense. Of course, if O(n) storage is ok for your data (as opposed to O(n^2)) then 2n is still ok, and that's the worst case for a dense diagonal. I would add that in the applications where I've used sparse matrices a lot, the diagonal is almost never dense (they're also usually not square, which may be related).

tkelman · 2017-08-10T17:20:13Z

Looks like diag(::SparseMatrixCSC, ::Integer) is currently dispatching to the AbstractArray method here https://github.com/JuliaLang/julia/blob/8c7f01e974de7ae779f22343086fdd400efe16f2/base/linalg/dense.jl#L279

so might need to generalize the SpDiagIterator here https://github.com/JuliaLang/julia/blob/8c7f01e974de7ae779f22343086fdd400efe16f2/base/sparse/sparsematrix.jl#L3385-L3402

to handle off-diagonals. Haven't looked at it very closely but I think that shouldn't be too complicated?

andreasnoack · 2017-08-10T17:28:44Z

Just tested this and deleting https://github.com/JuliaLang/julia/blob/8c7f01e974de7ae779f22343086fdd400efe16f2/base/sparse/sparsematrix.jl#L3415 seems to be all that is required.

tkelman · 2017-08-10T17:50:15Z

That would fix the return type, but the fallback is probably less efficient? Worth comparing.

ararslan added linear algebra sparse Sparse arrays labels Mar 17, 2017

JeffBezanson assigned fredrikekre Aug 10, 2017

fredrikekre mentioned this issue Aug 14, 2017

diag of SparseMatrixCSC should always return SparseVector JuliaLang/julia#23261

Merged

fredrikekre closed this as completed in JuliaLang/julia#23261 Aug 18, 2017

Sacha0 mentioned this issue Nov 26, 2024

diag specializations for structured matrices yield dense vectors #478

Closed

KristofferC transferred this issue from JuliaLang/julia Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diag(sparse matrix) should be a sparse vector? #411

diag(sparse matrix) should be a sparse vector? #411

StefanKarpinski commented Mar 16, 2017

andreasnoack commented Mar 16, 2017

martinholters commented Mar 17, 2017

andreasnoack commented Mar 17, 2017

JeffBezanson commented Mar 17, 2017

mbauman commented Mar 17, 2017

StefanKarpinski commented Mar 17, 2017

tkelman commented Mar 17, 2017 •

edited

Loading

fredrikekre commented Jul 31, 2017

StefanKarpinski commented Jul 31, 2017

andreasnoack commented Jul 31, 2017

mbauman commented Jul 31, 2017

tkelman commented Jul 31, 2017

StefanKarpinski commented Aug 1, 2017

StefanKarpinski commented Aug 3, 2017 •

edited

Loading

andreasnoack commented Aug 3, 2017

StefanKarpinski commented Aug 3, 2017 •

edited

Loading

tkelman commented Aug 10, 2017 •

edited

Loading

andreasnoack commented Aug 10, 2017

tkelman commented Aug 10, 2017

diag(sparse matrix) should be a sparse vector? #411

diag(sparse matrix) should be a sparse vector? #411

Comments

StefanKarpinski commented Mar 16, 2017

andreasnoack commented Mar 16, 2017

martinholters commented Mar 17, 2017

andreasnoack commented Mar 17, 2017

JeffBezanson commented Mar 17, 2017

mbauman commented Mar 17, 2017

StefanKarpinski commented Mar 17, 2017

tkelman commented Mar 17, 2017 • edited Loading

fredrikekre commented Jul 31, 2017

StefanKarpinski commented Jul 31, 2017

andreasnoack commented Jul 31, 2017

mbauman commented Jul 31, 2017

tkelman commented Jul 31, 2017

StefanKarpinski commented Aug 1, 2017

StefanKarpinski commented Aug 3, 2017 • edited Loading

andreasnoack commented Aug 3, 2017

StefanKarpinski commented Aug 3, 2017 • edited Loading

tkelman commented Aug 10, 2017 • edited Loading

andreasnoack commented Aug 10, 2017

tkelman commented Aug 10, 2017

tkelman commented Mar 17, 2017 •

edited

Loading

StefanKarpinski commented Aug 3, 2017 •

edited

Loading

StefanKarpinski commented Aug 3, 2017 •

edited

Loading

tkelman commented Aug 10, 2017 •

edited

Loading