Allow external CodeInstances to be added to the execution engine #36400

Keno · 2020-06-23T18:58:53Z

As of #35831, we've had the ability to cache CodeInstances in an
external cache and to replace that cache during jl_generate_native
(e.g. for GPU compilation). This extends the same capability to
CodeInstances to be added to the execution engine.

vtjnash

It seems like this needs to be creating a separate instance of our internal JIT customized on the cg_params to avoid polluting the normal cache. As it is, I wanted to hide this method from users and then eventually hope to reduce it in scope and capability (It currently exists to help handle the transitional boundary between the internal statefulness of the JIT and the external statelessness of our callable/interpreter semantics. So this function felt like an awkward mix of policies and implementations, which is possibly just working around deficiencies in both right now.)

Keno · 2020-06-26T14:28:26Z

Which internal cache are you thinking of that would be polluted? The CodeInstances I'm imagining passing here would not be part of the regular cache structure. Is there another cache I should be looking at?

Keno · 2020-06-30T22:34:13Z

@vtjnash ping. Could you elaborate on which caches you mean?

As of #35831, we've had the ability to cache CodeInstances in an external cache and to replace that cache during jl_generate_native (e.g. for GPU compilation). This extends the same capability to CodeInstances to be added to the execution engine.

Keno · 2020-07-02T00:19:07Z

Rebased.

vtjnash · 2020-07-02T01:13:45Z

This just seems like it could greatly complicated and/or make invalid various other promises in the system through negative interactions with the expectations of this cache (such as incremental compilation and the compile=all flag). There's a reason the external entry points try to work with MethodInstances and hide these objects. Even their presence as an argument here I think might be a bit of an accident: we optionally pass it along to avoid duplicate work (whether a cache re-lookup or re-inference), but it's not entirely straightforward whether we promise to preserve that information. We sometimes also may discover it necessary to inject the result back into the normal cache under a different key. Currently we just ignore bugs that could arise from that situation. This method also sometimes fails for other reasons, which the caller must handle.

Keno · 2020-07-02T01:18:37Z

Can you clarify which cache you're talking about? Are you talking about he various fields inside the CodeInstance?

vtjnash · 2020-07-02T01:48:53Z

No, mostly we consider the fields to be write-once so that there's no mutation issues. But hence why we may want to be able to interact with the various other caches in the system if the result can't fit into the input.

Keno · 2020-07-02T01:52:34Z

Could you please be more precise as to which caches you're thinking of? I was under the impression that at this point the only relevant caches are those inside the code instance objects (potentially returned via the lookup function). If there's any other caches, I obviously need to address those. I really want to be able to have a totally separate set of caches that I can manually manage, so I can do horrible things to the code without worrying about corrupting anything in the main cache.

vtjnash · 2020-07-02T18:52:15Z

Like Method, MethodInstance, and Module, while they in theory can be made to exist outside of the normal structure, some parts of the system are designed to expect that they are not permitted to be.

That said, upon reflection, we do have an existing mechanism for that purpose. We essentially define that eval is the entry point for generally doing horrible things, and then pass it a thunk (basically a CodeInstance wrapped in some metadata) and hope that it knows horrible things are going on and to try not to get in the way too much of that, but to just trust that it'll work out okay eventually.

Keno · 2020-07-02T22:00:36Z

I was really hoping for a specific example of what would go wrong with the CodeInstance. I agree that this isn't a suitable long term solution (for all sorts of reason - e.g. it doesn't precompile properly), but it's useful for experimentation to just completely take a CodeInstance out of the cache hierarchy and be able to operate on it independently. I'm thinking that in the long term, these will probably just go back into the regular method cache, keyed by some context a la Cassette, perhaps with a way to attach specific cgparams to a CodeInfo, but that's a mechanism I don't want to design until I have more of an idea of all the things it needs to do, so these kinds experimental hooks are useful until then.

vtjnash reviewed Jun 23, 2020

View reviewed changes

Keno force-pushed the kf/hooks3 branch from 90f1b6c to c74262c Compare July 1, 2020 22:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow external CodeInstances to be added to the execution engine #36400

Allow external CodeInstances to be added to the execution engine #36400

Keno commented Jun 23, 2020

vtjnash left a comment

Keno commented Jun 26, 2020

Keno commented Jun 30, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

Allow external CodeInstances to be added to the execution engine #36400

Are you sure you want to change the base?

Allow external CodeInstances to be added to the execution engine #36400

Conversation

Keno commented Jun 23, 2020

vtjnash left a comment

Choose a reason for hiding this comment

Keno commented Jun 26, 2020

Keno commented Jun 30, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020