Disable pathologically expensive `SimplifySelectOps` optimization #14

staticfloat · 2023-03-28T00:46:24Z

SimplifySelectOps is a late optimization in LLVM that attempts to translate select(C, load(A), load(B)) into load(select(C, A, B)). However, in order for it to do this optimization, it needs to check that C does not depend on the result of load(A) or load(B). Unfortunately (unlikely Julia and LLVM at the IR level), LLVM does not have a topological order of statements computed at this stage of the compiler, so LLVM needs to iterate through all statements in the function in order to perform this legality check. For large functions, this is extremely expensive, accounting for the majority of all compilation time for such functions. On the other hand, the optimization itself is minor, allowing at most the elision of one additional load (and doesn't fire particularly often, because the middle end can perform similar optimizations). Until there is a proper solution in LLVM, simply disable this optimizations, making LLVM several orders of magnitude faster on real world benchmarks.

`SimplifySelectOps` is a late optimization in LLVM that attempts to translate `select(C, load(A), load(B))` into `load(select(C, A, B))`. However, in order for it to do this optimization, it needs to check that `C` does not depend on the result of `load(A)` or `load(B)`. Unfortunately (unlikely Julia and LLVM at the IR level), LLVM does not have a topological order of statements computed at this stage of the compiler, so LLVM needs to iterate through all statements in the function in order to perform this legality check. For large functions, this is extremely expensive, accounting for the majority of all compilation time for such functions. On the other hand, the optimization itself is minor, allowing at most the elision of one additional load (and doesn't fire particularly often, because the middle end can perform similar optimizations). Until there is a proper solution in LLVM, simply disable this optimizations, making LLVM several orders of magnitude faster on real world benchmarks. X-ref: llvm#60132

vchuravy · 2023-04-16T14:02:48Z

Merged into 14.x as 5c82f53

staticfloat force-pushed the sf/llvm_sso_patch branch from 77b871f to 2105735 Compare March 28, 2023 00:55

staticfloat mentioned this pull request Mar 28, 2023

Carry patch to avoid excessive LLVM time JuliaLang/julia#48681

Closed

vchuravy merged commit 6783826 into julia-release/15.x Apr 16, 2023

vchuravy deleted the sf/llvm_sso_patch branch April 16, 2023 14:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable pathologically expensive `SimplifySelectOps` optimization #14

Disable pathologically expensive `SimplifySelectOps` optimization #14

staticfloat commented Mar 28, 2023 •

edited by Keno

Loading

vchuravy commented Apr 16, 2023

Disable pathologically expensive SimplifySelectOps optimization #14

Disable pathologically expensive SimplifySelectOps optimization #14

Conversation

staticfloat commented Mar 28, 2023 • edited by Keno Loading

vchuravy commented Apr 16, 2023

Disable pathologically expensive `SimplifySelectOps` optimization #14

Disable pathologically expensive `SimplifySelectOps` optimization #14

staticfloat commented Mar 28, 2023 •

edited by Keno

Loading