optimizer: determine inlineability at callsite of `is_inlineable` #48257

aviatesk · 2023-01-12T18:46:27Z

Now src::CodeInfo stores an inlining cost that is computed by inlining_cost function no matter if it is lower than or higher than the default inline_cost_threshold. This allows AbstractInterpreters to determine the inlineability of src on the fly.

This PR completes #45378.

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-01-12T19:42:12Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

Keno

LGTM overall

aviatesk · 2023-01-13T04:30:53Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-01-13T06:05:51Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

aviatesk · 2023-01-13T08:48:21Z

@nanosoldier runbenchmarks("inference", vs=":master")

aviatesk · 2023-01-13T09:00:17Z

@nanosoldier runbenchmarks(!"scalar", vs=":master")

nanosoldier · 2023-01-13T09:43:47Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

nanosoldier · 2023-01-13T15:13:40Z

Your job failed.

vtjnash · 2023-01-13T15:37:59Z

      From worker 3:      19fe4202d72 upload report for BenchmarkJob JuliaLang/julia@ec47740 vs. JuliaLang/julia@1ee253d                                  
      From worker 3:                                                                                                                                      
      From worker 3:    If you want to keep it by creating a new branch, this may be a good time                                                          
      From worker 3:    to do so with:                                                                                                                    
      From worker 3:                                                                                                                                            From worker 3:     git branch <new-branch-name> 19fe4202d72                                                                                               From worker 3:                                                                                                                                      
      From worker 3:    Switched to branch 'master'                                                                                                       
      From worker 3:    Your branch is up to date with 'origin/master'.                                                                                   
      From worker 3:    Fetching origin                                                                                                                   
      From worker 3:    From https://github.com/JuliaCI/NanosoldierReports                                                                                
      From worker 3:       45ea81d6d5d..4dc519d6eb9  master     -> origin/master                                                                          
      From worker 3:       2cf9410941a..f19f563e468  gh-pages   -> origin/gh-pages                                                                        
      From worker 3:    HEAD is now at 4dc519d6eb9 upload report for PkgEvalJob JuliaLang/julia@b07484c [2023-01-12]                                      
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/data.tar.zst                                                                    
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/logs/1ee253d3291728fc74051aff7cd6a5ed02b9f77d_against.out                       
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/logs/1ee253d3291728fc74051aff7cd6a5ed02b9f77d_build.err                         
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/logs/1ee253d3291728fc74051aff7cd6a5ed02b9f77d_build.out                         
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/logs/ec4774049d7a30228cdea28a921f0fcb0bbb2d46_build.err                         
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/logs/ec4774049d7a30228cdea28a921f0fcb0bbb2d46_build.out                         
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/logs/ec4774049d7a30228cdea28a921f0fcb0bbb2d46_primary.out                       
      From worker 3:    Auto-merging benchmark/by_hash/ec47740_vs_1ee253d/report.md                                                                       
      From worker 3:    The previous cherry-pick is now empty, possibly due to conflict resolution.                                                       
      From worker 3:    If you wish to commit it anyway, use:                                                                                             
      From worker 3:                                                                                                                                      
      From worker 3:        git commit --allow-empty                                                                                                      
      From worker 3:                                                                                                                                      
      From worker 3:    Otherwise, please use 'git cherry-pick --skip'                                                                                    
      From worker 3:    On branch master                                                                                                                  
      From worker 3:    Your branch is up to date with 'origin/master'.                                                                                   
      From worker 3:                                                                                                                                      
      From worker 3:    You are currently cherry-picking commit 19fe4202d72.                                                                              
      From worker 3:      (all conflicts fixed: run "git cherry-pick --continue")                                                                         
      From worker 3:      (use "git cherry-pick --skip" to skip this patch)                                                                               
      From worker 3:      (use "git cherry-pick --abort" to cancel the cherry-pick operation)                                                             
      From worker 3:                                                                                                                                      
      From worker 3:    nothing to commit, working tree clean                                                                                             
┌ Info: [Node 3 | 2023-01-13T10:13:41.368]: failed job: BenchmarkJob JuliaLang/julia@3c6c967 vs. JuliaLang/julia@1ee253d                                  
│ On worker 3:                                                               
│ NanosoldierError: error when preparing/pushing to report repo: failed process: Process(setenv(`/home/nanosoldier/.julia/artifacts/33c5e3a13ad6427f86436f

@maleadt I needed to fix this manually

rm -r benchmark/by_hash/ec47740_vs_1ee253d/
git add -u
git commit -m temporary
git cherry-pick -n <>
git commit --am -C <>

JeffBezanson · 2023-01-13T19:53:55Z

Will this affect checks like

                (jl_ir_inlining_cost((jl_array_t*)inferred) == UINT16_MAX)) {

in precompile_utils.c?
I guess the query is no longer well-formed, since a function might be inlined in some cases but not others? We could just drop this heuristic but it is pretty effective.

aviatesk · 2023-01-15T08:28:50Z

Will this affect checks like
                (jl_ir_inlining_cost((jl_array_t*)inferred) == UINT16_MAX)) {
in precompile_utils.c? I guess the query is no longer well-formed, since a function might be inlined in some cases but not others? We could just drop this heuristic but it is pretty effective.

Yes, and I think we can safely replace that condition with jl_isa_compileable_sig((jl_tupletype_t*)mi->specTypes, mi->sparam_vals, mi->def.method) since we should have discarded inlineable sources already on Julia-level at:

julia/base/compiler/typeinfer.jl

Lines 349 to 352 in 7bad950

    
           if may_discard_trees(interp) 
        
               cache_the_tree = ci.inferred && ( 
        
                   is_inlineable(interp, ci, InlineeMetaInfo(ci.rettype, mi)) || 
        
                   isa_compileable_sig(mi.specTypes, mi.sparam_vals, def))

(I'm not sure why we don't want to discard non-inlineable sources that are isa_compileable_sig though)

EDIT: we can't use the isa_compileable_sig since there may be a source that is inlineable and isa_compileable_sig.

base/compiler/typeinfer.jl

src/precompile_utils.c

aviatesk · 2023-01-16T10:01:18Z

@nanosoldier runbenchmarks("inference", vs=":master")

nanosoldier · 2023-01-16T10:56:40Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

aviatesk · 2023-01-22T08:02:48Z

src/precompile_utils.c

            if (inferred &&
                inferred != jl_nothing &&
                jl_ir_flag_inferred((jl_array_t*)inferred) &&
-                (jl_ir_inlining_cost((jl_array_t*)inferred) == UINT16_MAX)) {


@vchuravy Could you please elaborate why we have this check and want to precompile sources that aren't inlineable?

cc: @vtjnash and @timholy

My expectation would be that we want a warm cache for inference even for sources we inlined.

@timholy or @vtjnash, could you please provide some insights on why we have this check and whether it's still effective nowadays? The equivalent check is implemented in this commit (3a20e6a), but if we can remove the logic, it would simplify this PR very much.

Okay, so it's pretty effective, since we hit this condition when the isa_compileable_sig(linfo.specTypes, linfo.sparam_vals, def) case in this line holds

julia/base/compiler/typeinfer.jl

Line 344 in 93df7e2

cache_the_tree = ci.inferred && (is_inlineable(ci) || isa_compileable_sig(linfo.specTypes, linfo.sparam_vals, def))

, which happens pretty frequently.

aviatesk · 2023-04-01T16:56:31Z

@nanosoldier runbenchmarks(!"scalar", vs=":master")

nanosoldier · 2023-04-01T22:36:39Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

aviatesk · 2023-04-04T15:10:34Z

@nanosoldier runbenchmarks(!"scalar", vs=":master")

nanosoldier · 2023-04-05T01:17:31Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here.

Now `src::CodeInfo` stores an inlining cost that is computed by `inlining_cost` function no matter if it is lower than or higher than the default `inline_cost_threshold`. This allows `AbstractInterpreter`s to determine the inlineability of `src` on the fly.

Keno approved these changes Jan 13, 2023

View reviewed changes

Base automatically changed from avi/inline-decl to master January 13, 2023 03:58

aviatesk force-pushed the avi/inlining_cost branch from c4f2391 to c4b5274 Compare January 13, 2023 04:00

aviatesk force-pushed the avi/inlining_cost branch from c4b5274 to 3c6c967 Compare January 13, 2023 07:55

aviatesk force-pushed the avi/inlining_cost branch from 3c6c967 to 7bad950 Compare January 13, 2023 13:51

aviatesk commented Jan 15, 2023

View reviewed changes

base/compiler/typeinfer.jl Outdated Show resolved Hide resolved

aviatesk force-pushed the avi/inlining_cost branch from 7bad950 to 1120df3 Compare January 16, 2023 08:33

aviatesk commented Jan 16, 2023

View reviewed changes

src/precompile_utils.c Outdated Show resolved Hide resolved

aviatesk force-pushed the avi/inlining_cost branch from 1120df3 to af51073 Compare January 16, 2023 08:36

maleadt mentioned this pull request Jan 17, 2023

Report URL should be unique JuliaCI/Nanosoldier.jl#83

Open

aviatesk force-pushed the avi/inlining_cost branch 3 times, most recently from 5b5e6ec to 3bebd67 Compare January 22, 2023 08:01

aviatesk commented Jan 22, 2023

View reviewed changes

aviatesk force-pushed the avi/inlining_cost branch 2 times, most recently from c9c9c95 to 14454bf Compare April 1, 2023 16:55

aviatesk force-pushed the avi/inlining_cost branch 4 times, most recently from c49de5a to 7803de2 Compare April 4, 2023 14:48

aviatesk force-pushed the avi/inlining_cost branch from 7803de2 to ad06a5c Compare May 11, 2023 09:05

aviatesk mentioned this pull request Oct 26, 2023

optimize: revise inlining costs #51599

Merged

aviatesk mentioned this pull request Nov 15, 2025

use CC.MAX_INLINE_COST instead of typemax(Int) JuliaGPU/GPUCompiler.jl#741

Merged

Uh oh!

optimizer: determine inlineability at callsite of is_inlineable #48257

Are you sure you want to change the base?

optimizer: determine inlineability at callsite of is_inlineable #48257

Uh oh!

Conversation

aviatesk commented Jan 12, 2023

Uh oh!

nanosoldier commented Jan 12, 2023

Uh oh!

Keno left a comment

Choose a reason for hiding this comment

Uh oh!

aviatesk commented Jan 13, 2023

Uh oh!

nanosoldier commented Jan 13, 2023

Uh oh!

aviatesk commented Jan 13, 2023

Uh oh!

aviatesk commented Jan 13, 2023

Uh oh!

nanosoldier commented Jan 13, 2023

Uh oh!

nanosoldier commented Jan 13, 2023

Uh oh!

vtjnash commented Jan 13, 2023

Uh oh!

JeffBezanson commented Jan 13, 2023

Uh oh!

aviatesk commented Jan 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aviatesk commented Jan 16, 2023

Uh oh!

nanosoldier commented Jan 16, 2023

Uh oh!

aviatesk Jan 22, 2023

Choose a reason for hiding this comment

Uh oh!

vchuravy Jan 22, 2023

Choose a reason for hiding this comment

Uh oh!

aviatesk Apr 2, 2023

Choose a reason for hiding this comment

Uh oh!

aviatesk Apr 4, 2023

Choose a reason for hiding this comment

Uh oh!

aviatesk commented Apr 1, 2023

Uh oh!

nanosoldier commented Apr 1, 2023

Uh oh!

aviatesk commented Apr 4, 2023

Uh oh!

nanosoldier commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

optimizer: determine inlineability at callsite of `is_inlineable` #48257

optimizer: determine inlineability at callsite of `is_inlineable` #48257

aviatesk commented Jan 15, 2023 •

edited

Loading