Suboptimal invoke-related codegen for default trait impls #43150

alexcrichton · 2017-07-10T17:26:23Z

It looks like default methods in traits will always use invoke whereas defined methods will hit the optimization to not use invoke:

// bar.rs
#![crate_type = "rlib"]
pub fn bar() {}

// foo.rs
#![crate_type = "rlib"]
extern crate bar;

pub trait A: Sized {
    fn foo1(self) {
        bar::bar();
    }
    fn foo2(self);
}

impl A for i32 {
    fn foo2(self) {
        bar::bar();
    }
}

fn main() {
    0i32.foo1();
    0i32.foo2();
}

$ rustc bar.rs
$ rustc foo.rs -L . --emit llvm-ir
$ cat foo.ll

will yield this IR, notably:

; foo::A::foo1
; Function Attrs: uwtable
define internal void @_ZN3foo1A4foo117hcfa03453b669bd94E(i32) unnamed_addr #0 personality i32 (i32, i32, i64, %"unwind::libunwind::_Unwind_Exception"*, %"unwind::libunwind::_Unwind_Context"*)* @rust_eh_personality {
start:
  %personalityslot = alloca { i8*, i32 }
; invoke bar::bar
  invoke void @_ZN3bar3bar17h21868f3a4452d05bE()
          to label %bb3 unwind label %cleanup

; ...
}


; <i32 as foo::A>::foo2
; Function Attrs: uwtable
define void @"_ZN30_$LT$i32$u20$as$u20$foo..A$GT$4foo217h68eec0bc305998eeE"(i32) unnamed_addr #0 {
start:
; call bar::bar
  call void @_ZN3bar3bar17h21868f3a4452d05bE()
  br label %bb1

bb1:                                              ; preds = %start
  ret void
}

Note that <i32 as A>::foo1 uses an invoke instruction whereas <i32 as A>::foo2 does not.

The text was updated successfully, but these errors were encountered:

est31 · 2017-07-10T17:52:23Z

Related: #40254 and 8f581cc

arielb1 · 2017-07-11T10:34:58Z

This happens because we don't run MIR optimizations after monomorphization. In the "default trait impl" case, we don't know that Self: Copy, so we have to emit a landing pad that contains a destructor for Self, while in the "fn" case we do know that u32: Copy and don't emit a landing pad with a destructor call.

trans actually knows this landing pad is a no-op, but trans does not, and probably should not, perform that sort of optimization.

alexcrichton · 2017-07-11T13:56:20Z

@arielb1 I'm a little confused, is this not a bug? I'd personally at least consider this a compile time performance issue? We've had a lot of issues in the past with invoke instructions taking much longer to codege in LLVM than call instructions.

arielb1 · 2017-07-12T14:52:57Z

I'm a little confused, is this not a bug? I'd personally at least consider this a compile time performance issue? We've had a lot of issues in the past with invoke instructions taking much longer to codege in LLVM than call instructions.

The "bug" is that we are not doing MIR optimizations on monomorphized code. This is far from the only way that can cause LLVM slowness. The LLVM problem on MSVC is indeed a (different) bug.

alexcrichton · 2017-07-12T17:24:39Z

@arielb1 is there another bug to reference here? This behavior is actively causing bugs so I just want to make sure we don't forget about this.

alexcrichton · 2017-07-17T21:31:27Z

ping @arielb1, just wanted to make sure this wasn't lost, is there a way to make sure we don't forget about this?

arielb1 · 2017-11-23T12:38:13Z

@alexcrichton

i can't imagine us implementing post-monomorphization MIR optimizations in a way that does not fix this, and I find it hard to imagine us special-case-fixing this.

mati865 · 2019-01-08T11:21:58Z

i can't imagine us implementing post-monomorphization MIR optimizations in a way that does not fix this, and I find it hard to imagine us special-case-fixing this.

Is there tracking issue for that?

alexcrichton added A-codegen Area: Code generation T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jul 10, 2017

alexcrichton mentioned this issue Jul 10, 2017

A number of tests fails with undefined references to rust_eh_unwind_resume when codegen-units > 1 #43095

Closed

alexcrichton added O-windows-gnu Toolchain: GNU, Operating system: Windows and removed O-windows-gnu Toolchain: GNU, Operating system: Windows labels Jul 10, 2017

This was referenced Jul 10, 2017

Empty landing pads aren't optimized away on x86_64-pc-windows-gnu #43151

Closed

Undefined references to rust_eh_unwind_resume on x86_64-pc-windows-gnu rust-lang/compiler-builtins#177

Closed

arielb1 closed this as completed Jul 11, 2017

Aaron1011 mentioned this issue Dec 2, 2019

Post-monomorphization MIR optimizations #66969

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suboptimal invoke-related codegen for default trait impls #43150

Suboptimal invoke-related codegen for default trait impls #43150

alexcrichton commented Jul 10, 2017

est31 commented Jul 10, 2017

arielb1 commented Jul 11, 2017

alexcrichton commented Jul 11, 2017

arielb1 commented Jul 12, 2017 •

edited

Loading

alexcrichton commented Jul 12, 2017

alexcrichton commented Jul 17, 2017

arielb1 commented Nov 23, 2017 •

edited

Loading

mati865 commented Jan 8, 2019

Suboptimal invoke-related codegen for default trait impls #43150

Suboptimal invoke-related codegen for default trait impls #43150

Comments

alexcrichton commented Jul 10, 2017

est31 commented Jul 10, 2017

arielb1 commented Jul 11, 2017

alexcrichton commented Jul 11, 2017

arielb1 commented Jul 12, 2017 • edited Loading

alexcrichton commented Jul 12, 2017

alexcrichton commented Jul 17, 2017

arielb1 commented Nov 23, 2017 • edited Loading

mati865 commented Jan 8, 2019

arielb1 commented Jul 12, 2017 •

edited

Loading

arielb1 commented Nov 23, 2017 •

edited

Loading