RFC: move Vararg specType rewriting into dispatch #16159

vtjnash · 2016-05-02T00:41:21Z

As a start to addressing the need for a specSig representation for vararg calls, this moves the responsibility for forming the varargs argument tuple type from the callee to the caller (e.g. dispatch or specsig). My thought here is that instead of implementing this logic in codegen as a special case, this representation will make it easier to share existing code for allocating and passing around tuples. This means that now we would now have that length(specTypes) == length(argnames). I think this representation (where specTypes maps directly to argnames instead of to the callee's argument tuple) is cleaner than having each consumer effectively re-implement this each time (inference, codegen, generic functions for #12783)

this implements #12783 along the way

This also: - adds an enum classification of Vararg types - implements a few convenience functions - switches some checks from jl_is_va_tuple to jl_va_tuple_kind(t) == JL_VARARG_UNBOUND - updates jl_wrap_vararg to take two inputs - displays such Varargs properly in the REPL

This leaves the old one in place and compares the results of the two. This way we can catch bugs immediately, rather than trying to deduce what went wrong from the indirect consequences.

This also makes tweaks to the algorithm to prevent matches like Array{Tuple{Int}} <: Array{NTuple}

This also turns debugging output back on in preparation for the final implementation The cmdlineargs test fails, but only due to the extra output generated.

Matching the symbols turns out to be really dangerous, it's necessary to match pointers.

Needed for NTuple{Integer} declarations like found in sparsematrix.jl.

On current master, this gave true but Type{Tuple{}} <: Type{Tuple{Vararg}} gave false. Now they both give true.

c.f. "Instantiate Tuple{Vararg{Int,3}} as Tuple{Int,Int,Int}"

this ensures that type-inference gets the same method signature from type-intersection as will be used for the actual call

timholy · 2016-05-02T01:01:10Z

Will this cause the allocation of a tuple for each call to a varargs function?

vtjnash · 2016-05-02T01:28:44Z

no, that's what we do now (where the callee is expected to make the tuple). this instead modifies it such that dispatch is expected to create the tuple (which for specsig, would be inlined into the caller)

JeffBezanson · 2016-05-02T01:52:08Z

To me that seems to involve tuples in more of the calling process, making them harder to remove. Of course, in the specsig case the caller already has full knowledge of the callee, so it doesn't make much difference. But outside that case the caller doesn't know it's calling a varargs function, so it doesn't quite seem right for it to be the caller's responsibility to handle it.

vtjnash · 2016-05-02T02:09:49Z

right, the "caller" is dispatch (or specsig, after inlining a specialization). this changes is intended to make it so that passes that consume a LambdaInfo don't generally need to consider whether a signature is varargs – the types of the arguments simply line up with their names (and slot numbers).

tkelman · 2016-05-02T04:16:58Z

~~CI is failing to trigger here~~ oh it's a PR into a PR, nevermind

timholy · 2016-05-03T02:10:55Z

Does this include a rebase? It's got the whole #11242 set of patches, too, despite the fact that it's against #11242.

Speaking for just myself, I'm fine with you merging this whenever you feel ready to do so.

JeffBezanson · 2016-05-03T03:57:14Z

I don't really see why this is needed for #11242.

I'm also not convinced this is a good idea in general. The core issue is that, currently, every argument tuple type we have refers to the caller's arguments. We need to form these tuple types for dispatch. However, once the arguments are bound to names in the callee, there isn't any need for a tuple type of those. Sure, you can represent any N types with a tuple type, but there is no need for this tuple type itself, which makes it suspect to me.

One symptom is that the result of ml_matches can no longer be understood as a type intersection. It's type intersection plus some rewriting to form a vararg tuple type.

Another symptom is, imagine we want to specialize f(x...) for 3 Int arguments by compiling it the same way we'd compile f(::Int, ::Int, ::Int) today. If we did that, there would never be any tuple involved, so there should be no corresponding tuple type. This is a decision entirely about the compilation of a single method, that should not be exposed outside it (until you get to the very last step of calling it with a specialized signature). So it seems like the wrong abstraction to me to say that you need to form NTuple{3,Int} as part of looking up this method.

vtjnash · 2016-05-03T04:28:40Z

Changing specTypes isn't essential, since we can unambiguously recreate this svec at any point from sig + nreq. However, after dispatch, we also generally don't need to know sig, we need to know the types of the arguments. And so this change would make that information more readily available.

Another symptom is, imagine we want to specialize f(x...) for 3 Int arguments by compiling it the same way we'd compile f(::Int, ::Int, ::Int) today. If we did that, there would never be any tuple involved, so there should be no corresponding tuple type

where did x go? you can't simply just delete a variable and then claim that that there was never a tuple involved. The type of x is still NTuple{Int, 3}, even after you sunk it into the function. This argument is also completely unrelated, since what codegen decides to do with ::Int, ::Int, ::Int is not defined by a particular transform on specTypes, although the general idea is of course to pick it such that it does reflect the expected usage transform. Since often we need to forward or iterate this argument, I believe that passing it as NTuple{3, Int} is likely to be far much more efficient (iteration generally requires spilling to memory) and also generalizes better for NTuple{N, Int} where N is either large or unknown (and is thus passed separately, but the ntuple can still be unboxed).

One symptom is that the result of ml_matches

yeah, i'm not convinced of the value of that transformation timing myself either. it would probably be just as sensible for the caller to continue making that transform if it cares. (e.g. how it happens in type-inference now, which I deleted after moving the transformation into ml_matches)

JeffBezanson · 2016-05-03T04:38:03Z

where did x go?

It's just that x only exists inside the function. Indeed, it would be possible to transform the function such that there were 3 arguments x1, x2, and x3 instead of x, and the caller would not know the difference. I don't think we'll actually do this, but it clarifies the ownership of this information.

vtjnash · 2016-05-03T04:54:47Z

but it clarifies the ownership of this information

except that it doesn't: specTypes is consumed by the callee not the caller. I see no reason that f(x...) and f(x,y,z) need to have the same calling convention, when I think it is likely more efficient for them to use different calling conventions. (and as you pointed out, there's also no particular reason for specTypes to be a tupletype, it could equally well just be an svec, since all the consumers really end up caring about is the types of the argument variables).

timholy · 2016-05-03T08:36:38Z

Jeff's strategy is the one I took in my own attempt at this, #11242 (comment).

JeffBezanson · 2016-05-03T14:49:10Z

I don't necessarily advocate doing that transformation, I just feel like this increases coupling. Actually I really like the idea of renaming specTypes to argTypes and making it an svec. That greatly clarifies the intended meaning here. From there we just need to be a bit careful about whose responsibility it is to initialize that field. ml_matches is probably too early.

As an interesting data point, length(Tuple.name.cache) is ~26000 on master and ~39000 on this branch (the cache grows by 1.5x so this doesn't reflect the actual number of types). That itself is not really a problem, but it does indicate that this introduces a fair number of unnecessary tuple types. Another reason to switch to an svec.

JeffBezanson · 2016-05-03T14:50:18Z

Actually, we could store these types in slottypes and remove the specTypes field entirely.

vtjnash · 2016-05-03T15:10:11Z

Actually, we could store these types in slottypes and remove the specTypes field entirely.

we would need to change lowering such that if an argument is reused as a variable it gets two slots, but i think that's certainly feasible

JeffBezanson · 2016-05-03T15:12:16Z

Yes, it might be better to explicitly introduce new slots for assigned arguments, since codegen basically has to implement that anyway.

JeffBezanson · 2016-05-03T16:24:57Z

#16165 Looks like a case where it would have been more convenient for lowering to have done that transformation.

…ignature ref #16159 ref #12783

timholy and others added 30 commits April 29, 2016 17:58

Add NTuple translation utilities

cde289a

Add flexible Vararg info utilities

458cdde

Add new jl_tuple_subtype_ algorithm

0b53e36

This leaves the old one in place and compares the results of the two. This way we can catch bugs immediately, rather than trying to deduce what went wrong from the indirect consequences.

Test jl_subtype_le more thoroughly, return "new" result

4acbe56

This also makes tweaks to the algorithm to prevent matches like Array{Tuple{Int}} <: Array{NTuple}

Update jl_type_morespecific

6125202

Update type_match for new va tuples

469a244

Implement jl_type_intersect for new Vararg tuples

d0c5848

Update typejoin for new Varargs

a4903e4

This also turns debugging output back on in preparation for the final implementation The cmdlineargs test fails, but only due to the extra output generated.

Delete redundant debugging code

302a951

Instantiate Tuple{Vararg{Int,3}} as Tuple{Int,Int,Int}

5f2cd6a

Make NTuple{N,T} a typealias for Tuple{Vararg{T,N}}

15d95ad

Don't wrap V<:Vararg in TypeVar(:_, V)

ed641b3

Fix problem with NTuple instantiation

6b96ae7

Matching the symbols turns out to be really dangerous, it's necessary to match pointers.

Delete debugging code

162c92d

Add docs and NEWS on Vararg{T,N}

396b0ac

Don't assume a limited set of types for N in Vararg{T,N}

52708c5

Needed for NTuple{Integer} declarations like found in sparsematrix.jl.

Add more Vararg{T,N} tests

11cdf6f

Incorporate type_match_invariance_mask in tuple_match

08f7f8e

Add space in Vararg test expectation

919604e

Provide more useful debugging info from testintersect

8b26138

Fix segfault with invoke

1711ea5

Fix Type{Tuple{}} <: Type{NTuple{TypeVar(:N,true)}}

db39b22

On current master, this gave true but Type{Tuple{}} <: Type{Tuple{Vararg}} gave false. Now they both give true.

Validate Vararg parameters

974a73c

Re-fix #12092

8dc43c5

Rename enum types

908047b

Delete unused inst_tupletype_unchecked_uncached

ddf288b

Fix Vararg method printing

ac9ed2b

normalize more Tuple{Vararg{T,N}} to Tuple{T,...}

9f3f075

c.f. "Instantiate Tuple{Vararg{Int,3}} as Tuple{Int,Int,Int}"

apply type-signature correction to ml-matches

0bd3fb5

this ensures that type-inference gets the same method signature from type-intersection as will be used for the actual call

vtjnash added 2 commits May 1, 2016 20:17

move normalization of Varargs type signature tuple earlier

578fe68

fix typemap va marking rebase error

47101ac

tkelman closed this May 2, 2016

tkelman reopened this May 2, 2016

vtjnash force-pushed the teh/ntuple branch 2 times, most recently from bceb51b to cd910cf Compare May 2, 2016 22:44

timholy force-pushed the teh/ntuple branch 2 times, most recently from e971899 to 92ac59a Compare May 5, 2016 01:21

tkelman closed this May 5, 2016

timholy mentioned this pull request May 7, 2016

RFC: Refactor indexing to use Varargs and ReshapedArrays #16251

Merged

2 tasks

JeffBezanson mentioned this pull request May 17, 2016

RFC: create an extra slot for arguments that get assigned #16410

Merged

tkelman deleted the jn/teh-ntuple branch June 4, 2016 13:46

vtjnash added a commit that referenced this pull request Jun 4, 2016

add function to compute the type of the vararg variable in a method s…

a947044

…ignature ref #16159 ref #12783

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: move Vararg specType rewriting into dispatch #16159

RFC: move Vararg specType rewriting into dispatch #16159

vtjnash commented May 2, 2016 •

edited

Loading

timholy commented May 2, 2016

vtjnash commented May 2, 2016

JeffBezanson commented May 2, 2016

vtjnash commented May 2, 2016

tkelman commented May 2, 2016 •

edited

Loading

timholy commented May 3, 2016

JeffBezanson commented May 3, 2016 •

edited

Loading

vtjnash commented May 3, 2016

JeffBezanson commented May 3, 2016

vtjnash commented May 3, 2016

timholy commented May 3, 2016

JeffBezanson commented May 3, 2016

JeffBezanson commented May 3, 2016

vtjnash commented May 3, 2016

JeffBezanson commented May 3, 2016

JeffBezanson commented May 3, 2016

RFC: move Vararg specType rewriting into dispatch #16159

RFC: move Vararg specType rewriting into dispatch #16159

Conversation

vtjnash commented May 2, 2016 • edited Loading

timholy commented May 2, 2016

vtjnash commented May 2, 2016

JeffBezanson commented May 2, 2016

vtjnash commented May 2, 2016

tkelman commented May 2, 2016 • edited Loading

timholy commented May 3, 2016

JeffBezanson commented May 3, 2016 • edited Loading

vtjnash commented May 3, 2016

JeffBezanson commented May 3, 2016

vtjnash commented May 3, 2016

timholy commented May 3, 2016

JeffBezanson commented May 3, 2016

JeffBezanson commented May 3, 2016

vtjnash commented May 3, 2016

JeffBezanson commented May 3, 2016

JeffBezanson commented May 3, 2016

vtjnash commented May 2, 2016 •

edited

Loading

tkelman commented May 2, 2016 •

edited

Loading

JeffBezanson commented May 3, 2016 •

edited

Loading