-
Notifications
You must be signed in to change notification settings - Fork 13.4k
amdgcn failing to select llvm.vector.reduce.fmaximum.v2f32(<2 x float> intrinsic #67815
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@llvm/issue-subscribers-bug
```
LLVM ERROR: Cannot select: 0x1743efdb048: f32 = fmaximum # D:1 0x1743efe0e58, 0x1743efe0ec8
```
Detailed dump here More context on iree-org/iree#15064 |
@llvm/issue-subscribers-backend-amdgpu
```
LLVM ERROR: Cannot select: 0x1743efdb048: f32 = fmaximum # D:1 0x1743efe0e58, 0x1743efe0ec8
```
Detailed dump here More context on iree-org/iree#15064 |
debug dump with debug symbols here |
@jayfoad Could you take a look or help us find someone familiar with this part of the codebase? |
I have not looked in any detail, but IR has 3 different kinds of floating point max with different rules for nans and signed zeros: fmaxnum, fmaxnum_ieee and fmaximum. The backend does not natively support fmaximum. I'm not sure whether sdag should have lowered it to something that the backend does support. Is it possible you could switch to using a different flavor of max in the first place? Or do you really need the fmaximum semantics? |
@arsenm is probably the most familiar with this. |
on the mlir side this is the change that introduced this semantics It seems to have done after some discussions @kuhar any thoughts how to unblock amdgpu on this? I do believe that the correct thing to do is that the backend should support thse |
Somehow nobody ever implemented legalization for maximum/minimum. There's nothing AMDGPU specific about this |
I think this is just missing SD node selection on the AMDGPU side, for example the NVPTX backend selects it here
|
Looks like #67301 might fix this. |
Detailed dump here
More context on iree-org/iree#15064
The text was updated successfully, but these errors were encountered: