Faster blockdiag for uniform input value-index types #36013

irhum · 2020-05-24T13:12:38Z

The current blockdiag implementation in SparseArrays uses type promotion to identify what types the values and index should be for the output SparseArray.

This can be quite slow when the input to blockdiag is many small, sparse arrays (for the sake of example, over 2500+). If all the input sparse arrays have the same value and index types, then it's not necessary to compute them via type promotion and we can directly proceed to creating the new matrix.

This can provide significant speedups; for M generated with

M = [sprand(rand(15:50), rand(15:50), 0.02) for i in 1:2500]

and benchmarking the existing implementation with BenchmarkTools

@benchmark output = blockdiag(M...)

we get

BenchmarkTools.Trial: 
  memory estimate:  192.35 MiB
  allocs estimate:  37956
  --------------
  minimum time:     1.404 s (1.30% GC)
  median time:      1.434 s (1.31% GC)
  mean time:        1.438 s (1.24% GC)
  maximum time:     1.482 s (0.89% GC)
  --------------
  samples:          4
  evals/sample:     1

running the same benchmark, with same M, with the new implementation, we get a speedup of nearly 1000x

@benchmark output = blockdiag(M...)

  memory estimate:  3.00 MiB
  allocs estimate:  5019
  --------------
  minimum time:     1.072 ms (0.00% GC)
  median time:      1.162 ms (0.00% GC)
  mean time:        1.386 ms (11.86% GC)
  maximum time:     5.059 ms (60.41% GC)
  --------------
  samples:          3591
  evals/sample:     1

the current blockdiag implementation is hence split into 3 possible methods

blockdiag(X::AbstractSparseMatrixCSC...) # for the general case
blockdiag(X::AbstractSparseMatrixCSC{Tv, Ti}...) where {Tv, Ti <: Integer} # for increased speed when all input X have the same index and value types
blockdiag(::Type{Tv}, ::Type{Ti}, X::AbstractSparseMatrixCSC...) where {Tv, Ti <: Integer} # internally called by the two above methods, and available to the user when they're dealing with heterogenous inputs, and wish to specifiy the output index and value types manually for the increased speed benefits

irhum · 2020-05-24T15:20:53Z

added another method to deal with the empty input case, blockdiag(), otherwise it defaults to

blockdiag(X::AbstractSparseMatrixCSC{Tv, Ti}...) where {Tv, Ti <: Integer}

instead of

blockdiag(X::AbstractSparseMatrixCSC...)

resulting in a Tv not defined error, making an explicit definition necessary.

The pull request, I would suggest is important given the case where all inputs have the same types for the value and index would be more common (and would benefit from increased performance) than the heterogenous case

dkarrasch

Very nice contribution, @irhum, and welcome! I have one comment that I'm curious what other people think about.

stdlib/SparseArrays/src/sparsematrix.jl

Co-authored-by: Daniel Karrasch <[email protected]>

dkarrasch · 2020-05-24T19:08:38Z

You'll need to rename the function at the call sites as well.

irhum · 2020-05-24T19:32:08Z

The function call sites have been changed to make the appropriate _blockdiag call.

Since we also need to account for the case where no inputs are given, a clean solution seems to be to pass it as an optional argument to one of the methods, specifically:

blockdiag(X::AbstractSparseMatrixCSC{Tv, Ti}...=spzeros(0,0)) where {Tv, Ti <: Integer}

Since we're explicitly handing it, isempty(X) in

Ti = isempty(X) ? Int : promote_type(map(x->eltype(rowvals(x)), X)...)

is always false, and hence can be replaced with

Ti = promote_type(map(x->eltype(rowvals(x)), X)...)

Opinions?

dkarrasch

Looks good in principle, but there is this subtle issue with empty arguments.

dkarrasch · 2020-05-26T14:31:45Z

stdlib/SparseArrays/src/sparsematrix.jl

@@ -3318,16 +3318,23 @@ julia> blockdiag(sparse(2I, 3, 3), sparse(4I, 2, 2))
 ⋅ ⋅ ⋅ ⋅ 4
 ```
 """
+function blockdiag(X::AbstractSparseMatrixCSC{Tv, Ti}...=spzeros(0,0)) where {Tv, Ti <: Integer}


This default value makes the function change behavior for empty arguments. Currently, we have

julia> A = blockdiag() 0×0 SparseMatrixCSC{Union{},Int64} with 0 stored entries julia> B = spzeros(0,0) 0×0 SparseMatrixCSC{Float64,Int64} with 0 stored entries

Changing that behavior may be for the better, but I'm not sure. This definitely requires a broader discussion, I'd say.

I would rather not change the behavior inside this pull request anyway; the goal here is to provide increased speed when dealing with CSC arrays with the same types for the index and values; if needed, a separate issue can be opened whether the default should be changed.

Created a new commit that fixes this (the output is now SparseMatrixCSC{Union{},Int64}) by explicitly spelling out the blockdiag() case, to make it clear what it does today, to avoid any confusions in the future.

dkarrasch · 2020-05-27T09:37:32Z

stdlib/SparseArrays/src/sparsematrix.jl

@@ -3318,16 +3318,25 @@ julia> blockdiag(sparse(2I, 3, 3), sparse(4I, 2, 2))
 ⋅ ⋅ ⋅ ⋅ 4
 ```
 """
+blockdiag() = spzeros(Union{}, Int, 0, 0)


I'd suggest to have

Suggested change

blockdiag() = spzeros(Union{}, Int, 0, 0)

blockdiag() = spzeros(promote_type(), Int, 0, 0)

to avoid hardcoding some "non-sense" type and let the ecosystem decide. Pinging @tkf for this nasty "reduction over empty tuples" issue.

Thanks for pointing this out, it is more sensible this way. I believe that resolves the issue for now, of speeding up this function while still making sure blockdiag() output remains unchanged. Would appreciate what @tkf has to say about the reduction on empty tuple issue

I think Union{} makes sense (i.e., it's the identity element of promote_type as a monoid). Though promote_type() sounds also fine. It's just a LISPy way of saying that we want the identity of promote_type.

Thanks, @tkf. I just had no idea what might be subject to potential future design changes and what is "set in stone". I'll go ahead and merge this as is then.

Co-authored-by: Daniel Karrasch <[email protected]>

irhum added 2 commits May 24, 2020 18:38

Faster blockdiag for uniform input value-index types

03747ef

Update blockdiag to work with no input value

f4dcee8

dkarrasch added the domain:arrays:sparse Sparse arrays label May 24, 2020

dkarrasch reviewed May 24, 2020

View reviewed changes

stdlib/SparseArrays/src/sparsematrix.jl Outdated Show resolved Hide resolved

create internal method

a7f4b58

Co-authored-by: Daniel Karrasch <[email protected]>

irhum added 2 commits May 25, 2020 01:09

make blockdiag use internal _blockdiag

cf85f5c

folding in the case where no inputs are given

cb40c81

irhum requested a review from dkarrasch May 25, 2020 21:34

dkarrasch reviewed May 26, 2020

View reviewed changes

maintain compatibility with existing blockdiag()

4a607e4

irhum requested a review from dkarrasch May 26, 2020 17:02

dkarrasch reviewed May 27, 2020

View reviewed changes

let value type for blockdiag() be system decided

2be77a5

Co-authored-by: Daniel Karrasch <[email protected]>

dkarrasch added the performance Must go faster label May 27, 2020

dkarrasch merged commit d0b2be1 into JuliaLang:master May 27, 2020

simeonschaub pushed a commit to simeonschaub/julia that referenced this pull request Aug 11, 2020

Faster blockdiag for uniform input value-index types (JuliaLang#36013)

6bbd08d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster blockdiag for uniform input value-index types #36013

Faster blockdiag for uniform input value-index types #36013

irhum commented May 24, 2020

irhum commented May 24, 2020

dkarrasch left a comment

dkarrasch commented May 24, 2020

irhum commented May 24, 2020 •

edited

Loading

dkarrasch left a comment

dkarrasch May 26, 2020

irhum May 26, 2020 •

edited

Loading

dkarrasch May 27, 2020

irhum May 27, 2020 •

edited

Loading

tkf May 27, 2020

dkarrasch May 27, 2020

	blockdiag() = spzeros(Union{}, Int, 0, 0)
	blockdiag() = spzeros(promote_type(), Int, 0, 0)

Faster blockdiag for uniform input value-index types #36013

Faster blockdiag for uniform input value-index types #36013

Conversation

irhum commented May 24, 2020

irhum commented May 24, 2020

dkarrasch left a comment

Choose a reason for hiding this comment

dkarrasch commented May 24, 2020

irhum commented May 24, 2020 • edited Loading

dkarrasch left a comment

Choose a reason for hiding this comment

dkarrasch May 26, 2020

Choose a reason for hiding this comment

irhum May 26, 2020 • edited Loading

Choose a reason for hiding this comment

dkarrasch May 27, 2020

Choose a reason for hiding this comment

irhum May 27, 2020 • edited Loading

Choose a reason for hiding this comment

tkf May 27, 2020

Choose a reason for hiding this comment

dkarrasch May 27, 2020

Choose a reason for hiding this comment

irhum commented May 24, 2020 •

edited

Loading

irhum May 26, 2020 •

edited

Loading

irhum May 27, 2020 •

edited

Loading