Regression: Missed fold for x * 0 #50630

jeremy-rifkin · 2021-07-31T01:24:50Z

Extended Description

Clang trunk is not generating optimal code for a (very poor) multiplication routine https://godbolt.org/z/GrjMhsTcv.

int mul(int x, int y) {
    int p = 0;
    while(x--) p += y;
    return p;
}

Should compile to:

mul(int, int):
        mov     eax, edi
        imul    eax, esi
        ret

And it does in clang 12 but not in clang trunk.

If I'm not mistaken it appears a select i1 (icmp eq i32 %x, 0), i32 0, i32 (mul i32 %x, %y) -> mul i32 %x, %y fold is not firing.

The text was updated successfully, but these errors were encountered:

jeremy-rifkin · 2021-07-31T01:24:50Z

assigned to @nerh

ghost · 2021-08-15T14:59:00Z

Aforementioned code snipped is no longer folded into single multiplication instruction after changes in "simplifyWithOpReplaced".

If there are no objections I'd like to take this issue fix the optimization.

RKSimon · 2021-08-15T15:20:58Z

SGTM

Current IR:

define i32 @mul(i32 %0, i32 %1) {
  %3 = icmp eq i32 %0, 0
  %4 = mul i32 %1, %0
  %5 = select i1 %3, i32 0, i32 %4
  ret i32 %5
}

To get rid of that select we will need freeze: https://alive2.llvm.org/ce/z/maX0k-

ghost · 2021-08-21T10:18:47Z

rotateright · 2021-09-15T13:10:31Z

We have the expected codegen again (just a multiply, no cmov) after:
https://reviews.llvm.org/rGf5d89523567b

RKSimon mentioned this issue Apr 12, 2021

[Meta] Missed combines using freeze #49274

Open

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 11, 2021

This issue was closed.