Enable find on SparseVectors #15747

pranavtbhat · 2016-04-02T19:29:03Z

There seem's to be a discrepancy in the Sparse Matrix interface. Splicing an entire row/column of a sparse matrix yields a sparse vector, and the "find" method isn't defined on sparse vectors. So basically I can't do something like this:

a = sprand(10, 10, 0.3)
find(a[:,1])

But this does work with "full" arrays. Defining find to return the nzind field fixes this problem.

tkelman · 2016-04-02T19:32:09Z

The equivalent function for sparse matrices checks that the nzval is nonzero first.

julia/base/sparse/sparsematrix.jl

Line 833 in 8c17d60

if S.nzval[k] != 0

pranavtbhat · 2016-04-02T19:38:22Z

Oh. So I'll have to iterate through nzval to make sure that the values are non-zero?

nalimilan · 2016-04-02T20:06:30Z

You can use filter for that.

pranavtbhat · 2016-04-03T05:59:31Z

filter turns out to be quite slow. I just modified the SparseMatrix findn to handle SparseVectors. This is significantly faster:

function find{Tv,Ti}(x::SparseVector{Tv,Ti})
    numnz = nnz(x)
    I = Array(Ti, numnz)

    count = 1
    @inbounds for i = 1 : numnz
        if x.nzval[i] != 0
            I[count] = x.nzind[i]
            count += 1
        end
    end

    count -= 1
    if numnz != count
        deleteat!(I, (count+1):numnz)
    end

    return I
end

nalimilan · 2016-04-03T08:45:10Z

You're right that since we expect to find few zero values, filter will be less efficient than your custom version. Though I think you can write this much more simply by creating an empty vector I, calling sizehint!(I, numnz), and then using push! to add the indices of non-zero elements. That way you don't need count and the deleteat! step.

pranavtbhat · 2016-04-03T09:54:16Z

Using the deleteat step was almost twice as fast as using the sizehint step. I'm not sure why. Also, since the findn method for SparseMatrices uses count and deleteat, I guess we can just stick to it for the sake of uniformity. I'll rebase and push soon.

pranavtbhat · 2016-04-03T10:01:51Z

Should I add a separate test for this as well? (One that constructs a sparse vector using a nzval array containing zeros)

nalimilan · 2016-04-03T10:21:20Z

That's fine then (even if it's a bit surprising to me), thanks for checking. Keeping the code consistent with sparse matrices is also a good idea to help maintaining it.

This code path should definitely be tested.

ararslan · 2016-05-05T05:34:05Z

Is this addressed by the now-merged #16110?

pranavtbhat · 2016-05-05T05:40:19Z

Yup I think this is addressed.

tkelman · 2016-05-05T05:42:11Z

It would still be good to have a specialized implementation, since enumerate is likely quite slow on a sparse input

tkelman · 2016-05-05T05:44:35Z

#16110 makes no difference here because these were already AbstractArrays.

ararslan · 2016-05-05T05:49:50Z

Ah right, sorry :|

ViralBShah · 2016-08-16T08:18:02Z

Let's have findnz also. This implementation is significantly faster than the generic one.

ViralBShah · 2016-08-16T08:32:03Z

@pranavtbhat and I were trying to bring this up to date, but some git snafu closed this. A new PR is coming.

pranavtbhat force-pushed the master branch from f102a9f to 05b0d54 Compare April 3, 2016 09:57

pranavtbhat force-pushed the master branch from 05b0d54 to 877c997 Compare April 3, 2016 13:40

ViralBShah added the domain:arrays:sparse Sparse arrays label Apr 4, 2016

pranavtbhat closed this May 5, 2016

tkelman reopened this May 5, 2016

pranavtbhat closed this Aug 16, 2016

pranavtbhat force-pushed the master branch from 877c997 to f964e10 Compare August 16, 2016 08:25

pranavtbhat mentioned this pull request Aug 16, 2016

Define find and findnz for SparseVector #18049

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable find on SparseVectors #15747

Enable find on SparseVectors #15747

pranavtbhat commented Apr 2, 2016

tkelman commented Apr 2, 2016

pranavtbhat commented Apr 2, 2016

nalimilan commented Apr 2, 2016

pranavtbhat commented Apr 3, 2016

nalimilan commented Apr 3, 2016

pranavtbhat commented Apr 3, 2016

pranavtbhat commented Apr 3, 2016

nalimilan commented Apr 3, 2016

ararslan commented May 5, 2016

pranavtbhat commented May 5, 2016

tkelman commented May 5, 2016

tkelman commented May 5, 2016 •

edited

Loading

ararslan commented May 5, 2016

ViralBShah commented Aug 16, 2016

ViralBShah commented Aug 16, 2016

Enable find on SparseVectors #15747

Enable find on SparseVectors #15747

Conversation

pranavtbhat commented Apr 2, 2016

tkelman commented Apr 2, 2016

pranavtbhat commented Apr 2, 2016

nalimilan commented Apr 2, 2016

pranavtbhat commented Apr 3, 2016

nalimilan commented Apr 3, 2016

pranavtbhat commented Apr 3, 2016

pranavtbhat commented Apr 3, 2016

nalimilan commented Apr 3, 2016

ararslan commented May 5, 2016

pranavtbhat commented May 5, 2016

tkelman commented May 5, 2016

tkelman commented May 5, 2016 • edited Loading

ararslan commented May 5, 2016

ViralBShah commented Aug 16, 2016

ViralBShah commented Aug 16, 2016

tkelman commented May 5, 2016 •

edited

Loading