replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, a, b) that returns a double width type #1350

shawnl · 2018-08-07T16:32:10Z

`@mul(comptime T: type, a: T, b: T) @inttype(t.is_signed, T.bit_count * 2)

some architectures, such as mips[1] and sh4[2] do not have overflowing multiplication. Instead they return a multiplication result that is twice as big as the input types. This is how multiplication is done in hardware, the result. For 128X128-,>256 we will need a new u256 type and upstream work (also for 256X256->512[3] and 512X512->1024[4])

[1] https://stackoverflow.com/questions/16050338/mips-integer-multiplication-and-division
[2] http://www.shared-ptr.com/sh_insns.
[3] https://stackoverflow.com/questions/34234407/is-there-hardware-support-for-128bit-integers-in-modern-processors/50776753#50776753 The x86/x64, however, is a superscalar CPU, and the registers you know of are merely the architectural registers. Behind the scenes, there are a lot more registers that help optimize the CPU pipeline to perform out of order instructions using multiple ALUs. While the x64 may not be a 128-bit CPU, SSE/SSE2 introduced native 128-bit math, AVX introduced 256-bit native integer math, and AVX2 introduced 512-bit integer math. When returning from functions you will return the value in the 128-bit XMM0 SSE/SSE2 register, 256-bit AVX results in YMM0, and 512-bit AVX2 results in ZMM0; these, however, are add-ons to the x86/x64, not the primary architecture and support is entirely compiler and release platform (such as Python) dependent.
[4] https://en.wikipedia.org/wiki/AVX-512

andrewrk · 2018-08-07T17:33:43Z

Please elaborate:

explanation of use case
proposed API
example test case that should pass if this is implemented

I'll re-open when you provide these things.

shawnl · 2018-08-07T22:05:36Z

Sorry I was on my phone

andrewrk · 2018-08-07T22:14:30Z

Thanks! No worries, just wanted to throw the ball back in your court

shawnl · 2018-08-07T22:36:09Z

can you add the optimization tag, because this would provide access to these instructions.

andrewrk · 2018-08-07T23:16:29Z

I also added upstream, because arguably this should exist in LLVM. https://llvm.org/docs/LangRef.html#llvm-smul-with-overflow-intrinsics

shawnl · 2018-08-07T23:47:02Z

the sparc origins of llvm is showing. reported upstream: https://bugs.llvm.org/show_bug.cgi?id=38475

shawnl · 2018-08-08T14:18:15Z

If you look at the upstream hug it looks like we do not need upstream work because value range propagation uses the 32*32->64 instructions.

andrewrk · 2018-08-08T15:28:07Z

I see, thanks for the follow-up. Here is my counter proposal:

@mul(comptime T: type, x: T, y: T) @IntType(T.is_signed, T.bit_count * 2)

No possibility of failure, no pointers, it just has a double-wide return type.

andrewrk · 2019-02-15T06:31:45Z

I think this may be better as a userland function. The language intrinsic should probably stay mapped directly to the LLVM intrinsic.

in std.math:

fn wideMul(comptime T: type, x: T, y: T) @IntType(T.is_signed, T.bit_count * 2) {
    const ResultInt = @IntType(T.is_signed, T.bit_count * 2);
    return ResultInt(x) * ResultInt(y);
}

This is what Zig would generate anyway for this intrinsic, so making it a compiler primitive would not be an optimization.

Feel free to make additional arguments for this intrinsic.

andrewrk closed this as completed Aug 7, 2018

shawnl changed the title ~~@multiply that returns result twice as wide as input.~~ replace @mulWithOverflow(u64, x, radix, &x) with @multply(u64, x, radix, &low, &high) Aug 7, 2018

shawnl changed the title ~~replace @mulWithOverflow(u64, x, radix, &x) with @multply(u64, x, radix, &low, &high)~~ replace @mulWithOverflow(u64, x, radix, &x) with @multiply(u64, x, radix, &low, &high) Aug 7, 2018

shawnl changed the title ~~replace @mulWithOverflow(u64, x, radix, &x) with @multiply(u64, x, radix, &low, &high)~~ replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, x, radix, &low, &high) Aug 7, 2018

andrewrk reopened this Aug 7, 2018

andrewrk added this to the 0.4.0 milestone Aug 7, 2018

andrewrk added the proposal This issue suggests modifications. If it also has the "accepted" label then it is planned. label Aug 7, 2018

shawnl changed the title ~~replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, x, radix, &low, &high)~~ replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, x, radix, &high, &low) Aug 7, 2018

andrewrk added optimization upstream An issue with a third party project that Zig uses. labels Aug 7, 2018

andrewrk removed the upstream An issue with a third party project that Zig uses. label Aug 8, 2018

shawnl changed the title ~~replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, x, radix, &high, &low)~~ replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, a, b) that returns a double width type Aug 8, 2018

shawnl mentioned this issue Aug 18, 2018

std/crypto: add chacha20 #1369

Merged

andrewrk closed this as completed Feb 15, 2019

shawnl mentioned this issue Mar 27, 2019

add math.wideMul #2111

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, a, b) that returns a double width type #1350

replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, a, b) that returns a double width type #1350

shawnl commented Aug 7, 2018 •

edited

Loading

andrewrk commented Aug 7, 2018

shawnl commented Aug 7, 2018

andrewrk commented Aug 7, 2018

shawnl commented Aug 7, 2018

andrewrk commented Aug 7, 2018 •

edited

Loading

shawnl commented Aug 7, 2018

shawnl commented Aug 8, 2018

andrewrk commented Aug 8, 2018 •

edited

Loading

andrewrk commented Feb 15, 2019 •

edited

Loading

replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, a, b) that returns a double width type #1350

replace @mulWithOverflow(u64, x, radix, &x) with @mul(u64, a, b) that returns a double width type #1350

Comments

shawnl commented Aug 7, 2018 • edited Loading

andrewrk commented Aug 7, 2018

shawnl commented Aug 7, 2018

andrewrk commented Aug 7, 2018

shawnl commented Aug 7, 2018

andrewrk commented Aug 7, 2018 • edited Loading

shawnl commented Aug 7, 2018

shawnl commented Aug 8, 2018

andrewrk commented Aug 8, 2018 • edited Loading

andrewrk commented Feb 15, 2019 • edited Loading

shawnl commented Aug 7, 2018 •

edited

Loading

andrewrk commented Aug 7, 2018 •

edited

Loading

andrewrk commented Aug 8, 2018 •

edited

Loading

andrewrk commented Feb 15, 2019 •

edited

Loading