Skip to content
This repository was archived by the owner on Dec 22, 2021. It is now read-only.
This repository was archived by the owner on Dec 22, 2021. It is now read-only.

Consider adding Horizontal Add #20

@dtig

Description

@dtig

Packed horizontal arithmetic is reasonably performant on SSE3+ and Neon. These would be useful for complex multiplications, and in the absence of the opcodes below, these would need to be a combination of shifts and adds.

f32x4.addHoriz(x: v128, y:v128) -> v128
i32x4.addHoriz(x: v128, y:v128) -> v128
i16x8.addHoriz(x: v128, y:v128) -> v128

Thoughts on whether horizontal add instructions would be useful to include in the current SIMD spec?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions