optimize: revise inlining costs #51599

vtjnash · 2023-10-05T15:23:02Z

Add a bonus for Intrinsics called with mostly constant arguments. We know that simple expressions like x*1 + 0 will get optimized later by LLVM, and also likely fold into other expressions, so try to reflect that in the cost estimated earlier. Additionally rebalance some of the other costs to more accurately reflect what they take in assembly.

Add a bonus for Intrinsics called with mostly constant arguments. We know that simple expressions like `x*1 + 0` will get optimized later by LLVM, and also likely fold into other expressions, so try to reflect that in the cost estimated earlier. Additionally rebalance some of the other costs to more accurately reflect what they take in assembly.

aviatesk

Looks good to me.

aviatesk · 2023-10-05T23:57:18Z

base/compiler/optimize.jl

+ (f === Intrinsics.cglobal || f === Intrinsics.llvmcall) # these hold malformed IR, so argextype will crash on them
+ return cost
+ end
+ aty2 = widenconditional(argextype(ex.args[2], src, sptypes))


widenconditional should be unnecessary at this phase.

I was unsure if some external package might try to request inlining costs before we called this on everything, and I wanted to still give a consistent answer then. I think it is no-cost to be here?

aviatesk · 2023-10-06T00:07:58Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2023-10-06T07:31:36Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

vtjnash · 2023-10-06T13:26:41Z

The benchmark change here is whether we inline the call or dispatch the signature to call

broadcast(::typeof(+), ::Int64, ::NTuple{10, Float64}, ::Int64, ::Vararg{Any})

The penalty for making that choice is higher than it should be, and poorly reflected in the way we compute the inlining cost heuristics, but that is not relevant to this PR.

maleadt · 2023-10-26T11:30:20Z

This change broke GPUCompiler's always_inline mode. MWE:

@inline @generated function sink(i::T) where {T}
    llvmcall_str = """%slot = alloca i64
                     store volatile i64 %0, i64* %slot
                     %value = load volatile i64, i64* %slot
                     ret i64 %value"""
    return :(Base.llvmcall($llvmcall_str, T, Tuple{T}, i))
end

@eval f(x) = $(foldl((e, _) -> :($sink($e) + $sink(x)), 1:100; init=:x))
g() = f(10)

include("newinterp.jl")
@newinterp AlwaysInline
Core.Compiler.OptimizationParams(::AlwaysInline) =
    Core.Compiler.OptimizationParams(; inline_cost_threshold=typemax(Int))
Base.code_ircode(g, Tuple{}; interp=AlwaysInline())

@vtjnash Are we doing something fishy here, or why was this affected by this PR?

aviatesk · 2023-10-26T12:11:17Z

Currently OptimizationParams(; inline_cost_threshold=typemax(Int)) doesn't allow for "always inline", given we cap the inlining cost at typemax(UInt16) (, which is very confusing). We're gonna need #48257 to make this work.

vtjnash added performance Must go faster compiler:optimizer Optimization passes (mostly in base/compiler/ssair/) needs nanosoldier run This PR should have benchmarks run on it labels Oct 5, 2023

aviatesk reviewed Oct 6, 2023

View reviewed changes

Merge branch 'master' into jn/inlining-cost-update-consts

5e6ab7a

vtjnash removed the needs nanosoldier run This PR should have benchmarks run on it label Oct 6, 2023

vtjnash merged commit 0ab032a into master Oct 6, 2023
7 of 8 checks passed

vtjnash deleted the jn/inlining-cost-update-consts branch October 6, 2023 13:24

maleadt mentioned this pull request Oct 26, 2023

always_inline is broken on 1.11 JuliaGPU/GPUCompiler.jl#527

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize: revise inlining costs #51599

optimize: revise inlining costs #51599

vtjnash commented Oct 5, 2023

aviatesk left a comment

aviatesk Oct 5, 2023

vtjnash Oct 6, 2023

aviatesk commented Oct 6, 2023

nanosoldier commented Oct 6, 2023

vtjnash commented Oct 6, 2023

maleadt commented Oct 26, 2023

aviatesk commented Oct 26, 2023

optimize: revise inlining costs #51599

optimize: revise inlining costs #51599

Conversation

vtjnash commented Oct 5, 2023

aviatesk left a comment

Choose a reason for hiding this comment

aviatesk Oct 5, 2023

Choose a reason for hiding this comment

vtjnash Oct 6, 2023

Choose a reason for hiding this comment

aviatesk commented Oct 6, 2023

nanosoldier commented Oct 6, 2023

vtjnash commented Oct 6, 2023

maleadt commented Oct 26, 2023

aviatesk commented Oct 26, 2023