implement `count` using `mapreduce` #34048

stev47 · 2019-12-08T21:02:13Z

This creates the same calling interface for count as for e.g. sum,
namely allowing the dims keyword.
The implementation is also shorter than before without sacrificing
performance.
mapreduce with add_sum may even yield performance benefits through
chunking, though this was not observed in simple tests.

stev47 · 2019-12-09T12:43:57Z

buildbot/tester_win32 fail is unrelated

base/reducedim.jl

stev47 · 2019-12-12T20:15:45Z

added compat annotations and tests.
Question: Do we really want count(Any[]) == 0 as count(Any[1]) throws?

stev47 · 2019-12-14T16:05:44Z

buildbot failures are unrelated

StefanKarpinski · 2020-02-03T19:10:05Z

Would it make sense for @tkf to review since he overhauled the mapreduce machinery recently?

tkf · 2020-02-03T20:56:20Z

Actually, I tweaked only mapfoldl so far. But this patch LGTM anyway.

base/reducedim.jl

stev47 · 2020-04-06T06:54:05Z

this seems ready, freebsd test failure looks unrelated

tkf · 2020-05-01T01:40:20Z

base/reducedim.jl

+count(A::AbstractArray; dims=:) = count(identity, A, dims=dims)
+count(f, A::AbstractArray; dims=:) = mapreduce(_bool(f), add_sum, A, dims=dims, init=0)


It'd be nice to use A::AbstractArrayOrBroadcasted instead (with some tests).

thanks, I changed it. You already added a test for Broadcasted and count() in 2f90dde.
Do you still feel we need more?

base/reducedim.jl

test/reducedim.jl

This creates the same calling interface for `count` as for other mapreduce-type functions like e.g. `sum`, namely allowing the `dims` keyword. The implementation itself is shorter than before without sacrificing performance. More detailed documentation for `count` was added too.

stev47 · 2020-05-01T13:21:48Z

test fail on linuxaarch64 is worrying but seems unrelated

tkf

LGTM. Re-running failed CI: https://build.julialang.org/#/builders/28/builds/10356

Jutho · 2020-05-06T11:28:56Z

I am not quite sure how this is possible, given the minimal code change of this PR, but git bisect points to this commit as the culprit for the following change in type inference:

using LinearAlgebra, Test
X = [randn(n,n) for n = 1:5];
f(iter, init) = mapreduce(sum, +, iter; init = init);
g(iter, init) = mapreduce(norm, +, iter; init = init);
@inferred f(X, 0.)
@inferred g(X, 0.)

f is always inferred before and after this commit, g fails to infer after this commit. However, type inference acts very strangely. In particular, it is important that you run this on a fresh Julia session. In the slightly more elaborate example:

using LinearAlgebra, Test
X = [randn(n,n) for n = 1:5]

f1(iter, init) = reduce(+, sum(b) for b in iter; init = init);
f2(iter, init) = mapreduce(sum, +, iter; init = init);
g1(iter, init) = reduce(+, norm(b) for b in iter; init = init);
g2(iter, init) = mapreduce(norm, +, iter; init = init);
h1(iter, p, init) = reduce(+, norm(b,p) for b in iter; init = init);
h2(iter, p, init) = mapreduce(b->norm(b,p), +, iter; init = init);

@inferred f1(X, 0.)
@inferred f2(X, 0.)
@inferred g1(X, 0.)
@inferred g2(X, 0.)
@inferred h1(X, 2, 0.)
@inferred h2(X, 2, 0.)

f1 and f2 are always inferred, but out of g1, g2, h1 and h2, the one that is ran first is not inferred, and then the others are.

So I guess the norm implementation also plays some role in this, but I am still trying to understand the actual reason for this happening.

Jutho · 2020-05-06T11:39:31Z

I expect the lines

elseif p == 0
        return typeof(float(norm(first(itr))))(count(!iszero, itr))

in norm in LinearAlgebra/src/generic.jl have something to do with this, though norm itself with arbitrary p is perfectly inferred and type stable.

stev47 · 2020-05-06T12:00:59Z

It seems to work if you add type assertions:

        return typeof(float(norm(first(itr))))(count(!iszero, itr))::typeof(float(norm(first(itr))))

Also for me only g1 is failing, h1 works fine. Maybe the compiler is giving up on inference because stuff is getting too deeply nested with the mapreduce count being called within a mapreduce operation, I'm not sure though.

Jutho · 2020-05-06T12:21:09Z

Regarding failing and working, it is the one which is inferred firstly that fails. So if you change the order of the @inferred statements in my code snippet, such that h1 is ran/inferred first in a fresh Julia session, that one fails, but then g1 works fine. That's what I find extremely surprising/disturbing?

tkf · 2020-05-06T22:01:17Z

It seems to work if you add type assertions:

Maybe we should use convert? There is less guarantee for the type constructor to return the actual type.

That's what I find extremely surprising/disturbing?

Yeah, I agree. Maybe some kind of caching bug? I think it is worth opening a dedicated issue.

JeffBezanson reviewed Dec 9, 2019

View reviewed changes

base/reducedim.jl Outdated Show resolved Hide resolved

stev47 force-pushed the feature/count-reduce branch from 43b93ac to 1786182 Compare December 9, 2019 23:38

ararslan added needs compat annotation Add !!! compat "Julia x.y" to the docstring needs tests Unit tests are required for this change labels Dec 11, 2019

stev47 force-pushed the feature/count-reduce branch from 1786182 to 90f7561 Compare December 12, 2019 20:10

stev47 force-pushed the feature/count-reduce branch from 90f7561 to 304116d Compare December 12, 2019 20:30

stev47 force-pushed the feature/count-reduce branch 2 times, most recently from 88cb5df to f976b9a Compare February 2, 2020 11:21

tkf reviewed Feb 4, 2020

View reviewed changes

base/reducedim.jl Outdated Show resolved Hide resolved

stev47 force-pushed the feature/count-reduce branch from f976b9a to 658f631 Compare February 4, 2020 13:50

stev47 force-pushed the feature/count-reduce branch from 658f631 to d3dd296 Compare February 12, 2020 09:14

stev47 force-pushed the feature/count-reduce branch from d3dd296 to f0156c5 Compare March 11, 2020 19:51

stev47 force-pushed the feature/count-reduce branch from f0156c5 to 3a3b4f4 Compare April 23, 2020 21:34

stev47 requested a review from JeffBezanson April 24, 2020 07:38

stev47 force-pushed the feature/count-reduce branch from 3a3b4f4 to 5fcc4cd Compare April 30, 2020 07:44

tkf reviewed May 1, 2020

View reviewed changes

base/reducedim.jl Outdated Show resolved Hide resolved

tkf reviewed May 1, 2020

View reviewed changes

test/reducedim.jl Outdated Show resolved Hide resolved

tkf reviewed May 1, 2020

View reviewed changes

test/reducedim.jl Outdated Show resolved Hide resolved

tkf removed the needs compat annotation Add !!! compat "Julia x.y" to the docstring label May 1, 2020

stev47 force-pushed the feature/count-reduce branch from d453af0 to 9439134 Compare May 1, 2020 09:44

tkf removed the needs tests Unit tests are required for this change label May 1, 2020

tkf approved these changes May 1, 2020

View reviewed changes

tkf added the status:forget me not PRs that one wants to make sure aren't forgotten label May 1, 2020

tkf merged commit 301db97 into JuliaLang:master May 2, 2020

tkf removed the status:forget me not PRs that one wants to make sure aren't forgotten label May 2, 2020

stev47 mentioned this pull request May 8, 2020

problem with the unreliable approximation of Core.Compiler.return_type #35800

Open

kcajf mentioned this pull request May 9, 2020

Added missing kwargs to mapreduce mcabbott/AxisKeys.jl#14

Merged

JeffBezanson mentioned this pull request Aug 4, 2020

apparent inference hang in makie plot #36897

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement `count` using `mapreduce` #34048

implement `count` using `mapreduce` #34048

stev47 commented Dec 8, 2019

stev47 commented Dec 9, 2019

stev47 commented Dec 12, 2019

stev47 commented Dec 14, 2019

StefanKarpinski commented Feb 3, 2020

tkf commented Feb 3, 2020

stev47 commented Apr 6, 2020

tkf May 1, 2020

stev47 May 1, 2020

stev47 commented May 1, 2020

tkf left a comment

Jutho commented May 6, 2020

Jutho commented May 6, 2020 •

edited

Loading

stev47 commented May 6, 2020

Jutho commented May 6, 2020 •

edited

Loading

tkf commented May 6, 2020

		count(A::AbstractArray; dims=:) = count(identity, A, dims=dims)
		count(f, A::AbstractArray; dims=:) = mapreduce(_bool(f), add_sum, A, dims=dims, init=0)

implement count using mapreduce #34048

implement count using mapreduce #34048

Conversation

stev47 commented Dec 8, 2019

stev47 commented Dec 9, 2019

stev47 commented Dec 12, 2019

stev47 commented Dec 14, 2019

StefanKarpinski commented Feb 3, 2020

tkf commented Feb 3, 2020

stev47 commented Apr 6, 2020

tkf May 1, 2020

Choose a reason for hiding this comment

stev47 May 1, 2020

Choose a reason for hiding this comment

stev47 commented May 1, 2020

tkf left a comment

Choose a reason for hiding this comment

Jutho commented May 6, 2020

Jutho commented May 6, 2020 • edited Loading

stev47 commented May 6, 2020

Jutho commented May 6, 2020 • edited Loading

tkf commented May 6, 2020

implement `count` using `mapreduce` #34048

implement `count` using `mapreduce` #34048

Jutho commented May 6, 2020 •

edited

Loading

Jutho commented May 6, 2020 •

edited

Loading