Performance/type inference regression from #12327 #12476

simonster · 2015-08-05T21:43:59Z

It occurs to me that, in the heterogeneous key/value type case, #12327/#12261 causes us to lose type information. Consider basically any code that uses the for (k, v) in dict idiom, e.g.:

function f(a)
    x = 0
    for (k, v) in a
        x += v
    end
    x
end

Now consider what happens if the key and value types aren't the same. Previously, because we do fancy things for tuple destructuring, we could infer that k is of the key type K and v is of the value type V. But we don't do these fancy things for Pair destructuring. code_warntype(f, (Dict{Float64,Int},)) shows that now we can only determine that k and v are both of type Union{K,V}. For the benchmark:

d = Dict([rand() => rand(Int) for i = 1:10000000]);
@time f(d)

On 6c34bc7 I get:

  0.476170 seconds (30.00 M allocations: 457.764 MB, 6.70% gc time)

On the 0.3 branch I get:

elapsed time: 0.127657036 seconds (96 bytes allocated)

The text was updated successfully, but these errors were encountered:

carnaval · 2015-08-05T21:47:25Z

As a short term solution you could add an indexed_next thing for pairs, as well as the corresponding hack in inference.jl.

I have some ideas on how to solve this more generally (this is the same problem as the getindex(::Fact,::Symbol) non leaf return types for matrix factorizations) but it's not gonna happen soon :-)

StefanKarpinski · 2015-08-06T01:50:10Z

Would it help of Pair were defined this way instead:

immutable Pair{A,B}
    pair::Tuple{A,B}
end

That way the Pair type might inherit some of the fancy destructuring for tuples.

simonster · 2015-08-06T02:28:34Z

That might work, but it might not be great for non-isbits since it's an extra object and pointer indirection.

Fix #12476

simonster added performance Must go faster regression Regression in behavior compared to a previous version labels Aug 5, 2015

simonster added a commit that referenced this issue Aug 6, 2015

Fix #12476

240e271

simonster closed this as completed in b3dd473 Aug 9, 2015

simonster added a commit that referenced this issue Aug 9, 2015

Merge pull request #12493 from JuliaLang/sjk/12476

ef949a5

Fix #12476

jrevels mentioned this issue Nov 6, 2015

CI Performance Tracking for v0.5 #13893

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance/type inference regression from #12327 #12476

Performance/type inference regression from #12327 #12476

simonster commented Aug 5, 2015

carnaval commented Aug 5, 2015

StefanKarpinski commented Aug 6, 2015

simonster commented Aug 6, 2015

Performance/type inference regression from #12327 #12476

Performance/type inference regression from #12327 #12476

Comments

simonster commented Aug 5, 2015

carnaval commented Aug 5, 2015

StefanKarpinski commented Aug 6, 2015

simonster commented Aug 6, 2015