sparse and transpose on sparse matrices could be faster #12998

KristofferC · 2015-09-07T14:18:51Z

I thought this was worth a separate issue.

In #12981 @mb1234 posted a short comparison script to benchmark few methods on sparse matrices between Julia, Octave and Matlab. The script to perform the table below can be seen in #12981 (comment) except I added an explicit call to gc() between method invocations. The average of 5 runs is shown with the standard deviation in parenthesis.

Func	Julia	Octave	Matlab (single thread)
sparse:	2.478101 (0.161879)	1.233007 (0.098839)	2.678221 (0.104867)
2*A :	0.052062 (0.000805)	0.120093 (0.042131)	0.070358 (0.001547)
A' :	1.131908 (0.010331)	0.810197 (0.075346)	1.159815 (0.246967)
A+B :	0.246413 (0.001277)	0.487021 (0.088988)	0.202465 (0.000276)
A*x :	0.169672 (0.003784)	0.340456 (0.009046)	0.162381 (0.016590)
A'*x :	0.146576 (0.003835)	1.191680 (0.099767)	0.135291 (0.003267)

In general, we are doing really well! However, we can see that for sparse and transpose we have a bit to go to match Octave. Right now, for sparse we are making a CSR-matrix and transposing it in the end. It should be possible to directly construct the CSC-matrix and avoid the transpose.

For transpose I looked at the code and I am not sure what could be done. Anyone have any ideas here?

The text was updated successfully, but these errors were encountered:

KristofferC · 2015-09-07T15:24:16Z

Maybe Octave doesn't sum duplicates in their sparse?

andreasnoack · 2015-09-07T16:57:15Z

It appears that they do.

octave:1> sparse([1,1], [1,1], [1,1])
ans = Compressed Column Sparse (rows = 1, cols = 1, nnz = 1 [100%])

  (1, 1) ->  2

ViralBShah · 2015-10-01T06:41:44Z

The reason for creating a csr and transposing is that we get everything in sorted order. For direct csc construction, you have to have a sorting step. Any idea what octave does? Also, maybe worth testing few different cases for the constructor.

ViralBShah · 2015-10-01T07:44:24Z

Related, see #13400 for a pathological case due to csr construction.

KristofferC · 2015-10-01T10:54:54Z

I took a stab at trying to improve our sparse but I failed. My attempt was to use a bucket for each column and sort the rows into that. I currently use a linear search through each bucket and insert the row at the proper place.

Here is my attempt: https://gist.github.com/KristofferC/554badaa7b9f494f45ee

And results:

n = 10^5
sparsity = 20 / n
accums = 5
nzs_1 = repmat(rand(1:n, Int(sparsity * n^2)), accums)
nzs_2 = repmat(rand(1:n, Int(sparsity * n^2)), accums)
vals = rand(Float64, length(nzs_1))

A2 = @time sparse2(nzs_1, nzs_2, vals)
#   5.090254 seconds (970.25 k allocations: 152.123 MB, 3.83% gc time)
A = @time sparse(nzs_1, nzs_2, vals)
#   1.284557 seconds (23 allocations: 186.918 MB, 3.46% gc time)
A == A2
# true

Maybe someone can find some obvious way to speed my attempt up.

mauro3 · 2015-10-01T12:33:10Z

Here on binary search vs linear search: https://schani.wordpress.com/2010/04/30/linear-vs-binary-search/

KristofferC · 2015-10-01T12:39:57Z

Cool, I'll play around with some techniques there to see if I can get the search faster.

StefanKarpinski · 2015-10-02T21:33:28Z

I have deleted @ScottPJones comments and all the back and forth waste of everyone's time (@KristofferC, primarily) and unlocked the issue so that we can make forward progress on it. @ScottPJones, you may not post here. If you do so, you will be blocked for JuliaLang for a week.

ViralBShah · 2015-10-03T06:11:28Z

@KristofferC I used to have the insertion sort based logic a long time back, and switched to the current one since it was faster.

KristofferC · 2018-10-07T19:47:55Z

We are pretty much the same as MATLAB now so dont think there is a need to keep this issue open.

tkelman added performance Must go faster sparse Sparse arrays labels Sep 7, 2015

KristofferC closed this as completed Oct 1, 2015

JuliaLang locked and limited conversation to collaborators Oct 2, 2015

JuliaLang unlocked this conversation Oct 2, 2015

StefanKarpinski reopened this Oct 2, 2015

ScottPJones mentioned this issue Oct 4, 2015

Slow sparse() construction for matrices with lots of rows #13400

Closed

Sacha0 mentioned this issue Jan 27, 2016

MIT-licensed sparse() parent method and expert driver #14798

Closed

jrevels added the potential benchmark Could make a good benchmark in BaseBenchmarks label Jan 27, 2016

KristofferC closed this as completed Oct 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse and transpose on sparse matrices could be faster #12998

sparse and transpose on sparse matrices could be faster #12998

KristofferC commented Sep 7, 2015

KristofferC commented Sep 7, 2015

andreasnoack commented Sep 7, 2015

ViralBShah commented Oct 1, 2015

ViralBShah commented Oct 1, 2015

KristofferC commented Oct 1, 2015

mauro3 commented Oct 1, 2015

KristofferC commented Oct 1, 2015

StefanKarpinski commented Oct 2, 2015

ViralBShah commented Oct 3, 2015

KristofferC commented Oct 7, 2018

sparse and transpose on sparse matrices could be faster #12998

sparse and transpose on sparse matrices could be faster #12998

Comments

KristofferC commented Sep 7, 2015

KristofferC commented Sep 7, 2015

andreasnoack commented Sep 7, 2015

ViralBShah commented Oct 1, 2015

ViralBShah commented Oct 1, 2015

KristofferC commented Oct 1, 2015

mauro3 commented Oct 1, 2015

KristofferC commented Oct 1, 2015

StefanKarpinski commented Oct 2, 2015

ViralBShah commented Oct 3, 2015

KristofferC commented Oct 7, 2018