You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Dec 22, 2021. It is now read-only.
@AndrewScheidecker mentioned in his review of #1 the possibility of including vectorized bit counting instructions to match the existing scalar instructions. They would have these signatures:
i8x16.clz(x: v128) -> v128
i16x8.clz(x: v128) -> v128
i32x4.clz(x: v128) -> v128
i64x2.clz(x: v128) -> v128
i8x16.ctz(x: v128) -> v128
i16x8.ctz(x: v128) -> v128
i32x4.ctz(x: v128) -> v128
i64x2.ctz(x: v128) -> v128
At least AArch64 has vectorized CLZ and RBIT instructions that could be used to implement this. But they could be quite impractical to emulate on other platforms.
Are these instructions widely available in SIMD instruction sets?
Are there plausible applications for these instructions?