Implement the `dot4add_u8packed` HLSL Function #99219

farzonl · 2024-07-16T20:14:01Z

DirectX

DXIL Opcode	DXIL OpName	Shader Model	Shader Stages
164	Dot4AddU8Packed	6.4	()

SPIR-V

OpUDot:

Description:

Unsigned integer dot product of Vector 1 and Vector 2.

Result Type must be an integer type with Signedness of 0 whose
Width must be greater than or equal to that of the components of
Vector 1 and Vector 2.

Vector 1 and Vector 2 must have the same type.

Vector 1 and Vector 2 must be either 32-bit integers (enabled by the
DotProductInput4x8BitPacked capability) or vectors of
integer type with Signedness of 0 (enabled by the
DotProductInput4x8Bit or DotProductInputAll
capability).

When Vector 1 and Vector 2 are scalar integer types, Packed Vector
Format must be specified to select how the integers are to be
interpreted as vectors.

All components of the input vectors are zero-extended to the bit width
of the resultâ€™s type. The zero-extended input vectors are then
multiplied component-wise and all components of the vector resulting
from the component-wise multiplication are added together. The resulting
value will equal the low-order N bits of the correct result R, where N
is the result width and R is computed with enough precision to avoid
overflow and underflow.

Capability:
DotProduct

Missing before version 1.6.

Word Count	Opcode	Results	Operands
5 + variable	4451	<id> Result Type	Result <id>	<id> Vector 1	<id> Vector 2	Optional Packed Vector Format Packed Vector Format

Test Case(s)

Example 1

//dxc dot4add_u8packed_test.hlsl -T lib_6_8 -enable-16bit-types -O0

export uint fn(uint p1, uint p2, uint p3) {
    return dot4add_u8packed(p1, p2, p3);
}

HLSL:

Syntax

uint dot4add_u8packed(uint a, uint b, uint c);

Type Description

Name	Template Type	Component Type	Size
ret	scalar	uint	1
a	scalar	uint	1
b	scalar	uint	1
c	scalar	uint	1

Minimum Shader Model

This function is supported in the following shader models.

Shader Model	Supported
Shader Model 6.4 and higher shader models	yes

Shader Stages

```- create a clang built-in in Builtins.td - link dot4add_u8packed in hlsl_intrinsics.h - add lowering to spirv backend through expansion of operation as OpUDot is missing up to SPIRV 1.6 in SPIRVInstructionSelector.cpp - add lowering to spirv backend using OpUDot if applicable SPIRV version or SPV_KHR_integer_dot_product is enabled - add dot4add_u8packed intrinsic to IntrinsicsDirectX.td and mapping to DXIL.td op Dot4AddU8Packed - add tests for HLSL intrinsic lowering to dx/spv intrinsic in dot4add_u8packed.hlsl - add tests for sema checks in dot4add_u8packed-errors.hlsl - add test of spir-v lowering in SPIRV/dot4add_u8packed.ll - add test to dxil lowering in DirectX/dot4add_u8packed.ll ``` Resolves #99219

llvmbot · 2024-11-07T18:41:23Z

@llvm/issue-subscribers-clang-codegen

Author: Farzon Lotfi (farzonl)

- [ ] Implement `dot4add_u8packed` clang builtin, - [ ] Link `dot4add_u8packed` clang builtin with `hlsl_intrinsics.h` - [ ] Add sema checks for `dot4add_u8packed` to `CheckHLSLBuiltinFunctionCall` in `SemaChecking.cpp` - [ ] Add codegen for `dot4add_u8packed` to `EmitHLSLBuiltinExpr` in `CGBuiltin.cpp` - [ ] Add codegen tests to `clang/test/CodeGenHLSL/builtins/dot4add_u8packed.hlsl` - [ ] Add sema tests to `clang/test/SemaHLSL/BuiltIns/dot4add_u8packed-errors.hlsl` - [ ] Create the `int_dx_dot4add_u8packed` intrinsic in `IntrinsicsDirectX.td` - [ ] Create the `DXILOpMapping` of `int_dx_dot4add_u8packed` to `164` in `DXIL.td` - [ ] Create the `dot4add_u8packed.ll` and `dot4add_u8packed_errors.ll` tests in `llvm/test/CodeGen/DirectX/` - [ ] Create the `int_spv_dot4add_u8packed` intrinsic in `IntrinsicsSPIRV.td` - [ ] In SPIRVInstructionSelector.cpp create the `dot4add_u8packed` lowering and map it to `int_spv_dot4add_u8packed` in `SPIRVInstructionSelector::selectIntrinsic`. - [ ] Create SPIR-V backend test case in `llvm/test/CodeGen/SPIRV/hlsl-intrinsics/dot4add_u8packed.ll`

DirectX

DXIL Opcode	DXIL OpName	Shader Model	Shader Stages
164	Dot4AddU8Packed	6.4	()

SPIR-V

OpUDot:

Description:

Unsigned integer dot product of Vector 1 and Vector 2.

Result Type must be an integer type with Signedness of 0 whose
Width must be greater than or equal to that of the components of
Vector 1 and Vector 2.

Vector 1 and Vector 2 must have the same type.

Vector 1 and Vector 2 must be either 32-bit integers (enabled by the
DotProductInput4x8BitPacked capability) or vectors of
integer type with Signedness of 0 (enabled by the
DotProductInput4x8Bit or DotProductInputAll
capability).

When Vector 1 and Vector 2 are scalar integer types, Packed Vector
Format must be specified to select how the integers are to be
interpreted as vectors.

All components of the input vectors are zero-extended to the bit width
of the resultâ€™s type. The zero-extended input vectors are then
multiplied component-wise and all components of the vector resulting
from the component-wise multiplication are added together. The resulting
value will equal the low-order N bits of the correct result R, where N
is the result width and R is computed with enough precision to avoid
overflow and underflow.

Capability:
DotProduct

Missing before version 1.6.

<table style="width:100%;">
<colgroup>
<col style="width: 14%" />
<col style="width: 14%" />
<col style="width: 14%" />
<col style="width: 14%" />
<col style="width: 14%" />
<col style="width: 14%" />
<col style="width: 14%" />
</colgroup>
<thead>
<tr>
<th>Word Count</th>
<th>Opcode</th>
<th>Results</th>
<th>Operands</th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td class="tableblock halign-left valign-top">5 + variable</td>
<td class="tableblock halign-left valign-top">4451</td>
<td
class="tableblock halign-left valign-top"><id> 
Result Type</td>
<td class="tableblock halign-left valign-top"><a
href="#ResultId">Result <id></a></td>
<td
class="tableblock halign-left valign-top"><id> 
Vector 1</td>
<td
class="tableblock halign-left valign-top"><id> 
Vector 2</td>
<td class="tableblock halign-left valign-top">Optional 
<a href="#Packed_Vector_Format">Packed Vector Format</a> 
Packed Vector Format</td>
</tr>
</tbody>
</table>

Test Case(s)

Example 1

//dxc dot4add_u8packed_test.hlsl -T lib_6_8 -enable-16bit-types -O0

export uint fn(uint p1, uint p2, uint p3) {
    return dot4add_u8packed(p1, p2, p3);
}

HLSL:

Syntax

uint dot4add_u8packed(uint a, uint b, uint c);

Type Description

Name	Template Type	Component Type	Size
ret	scalar	uint	1
a	scalar	uint	1
b	scalar	uint	1
c	scalar	uint	1

Minimum Shader Model

This function is supported in the following shader models.

Shader Model	Supported
Shader Model 6.4 and higher shader models	yes

Shader Stages

```- create a clang built-in in Builtins.td - link dot4add_u8packed in hlsl_intrinsics.h - add lowering to spirv backend through expansion of operation as OpUDot is missing up to SPIRV 1.6 in SPIRVInstructionSelector.cpp - add lowering to spirv backend using OpUDot if applicable SPIRV version or SPV_KHR_integer_dot_product is enabled - add dot4add_u8packed intrinsic to IntrinsicsDirectX.td and mapping to DXIL.td op Dot4AddU8Packed - add tests for HLSL intrinsic lowering to dx/spv intrinsic in dot4add_u8packed.hlsl - add tests for sema checks in dot4add_u8packed-errors.hlsl - add test of spir-v lowering in SPIRV/dot4add_u8packed.ll - add test to dxil lowering in DirectX/dot4add_u8packed.ll ``` Resolves llvm#99219

farzonl added backend:DirectX backend:SPIR-V bot:HLSL HLSL HLSL Language Support metabug Issue to collect references to a group of similar or related issues. labels Jul 16, 2024

github-project-automation bot added this to HLSL Support Jul 16, 2024

farzonl mentioned this issue Jul 16, 2024

Implement the entire HLSL API set. #99235

Open

farzonl mentioned this issue Aug 7, 2024

Intrinsics used by DML shaders are implemented llvm/wg-hlsl#30

Closed

52 tasks

damyanp moved this to Ready in HLSL Support Oct 9, 2024

inbelic self-assigned this Oct 21, 2024

damyanp moved this from Ready to Active in HLSL Support Oct 22, 2024

inbelic moved this from Active to Ready in HLSL Support Nov 4, 2024

inbelic mentioned this issue Nov 5, 2024

[HLSL][SPIRV][DXIL] Implement dot4add_u8packed intrinsic #115068

Merged

inbelic moved this from Ready to Needs Review in HLSL Support Nov 6, 2024

inbelic moved this from Needs Review to Active in HLSL Support Nov 6, 2024

inbelic moved this from Active to Needs Review in HLSL Support Nov 6, 2024

inbelic closed this as completed in #115068 Nov 7, 2024

github-project-automation bot moved this from Needs Review to Closed in HLSL Support Nov 7, 2024

EugeneZelenko added clang:headers Headers provided by Clang, e.g. for intrinsics clang:codegen IR generation bugs: mangling, exceptions, etc. labels Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the `dot4add_u8packed` HLSL Function #99219

Implement the `dot4add_u8packed` HLSL Function #99219

farzonl commented Jul 16, 2024

llvmbot commented Nov 7, 2024

DirectX

SPIR-V

OpUDot:

Description:

Test Case(s)

Example 1

HLSL:

Syntax

Type Description

Minimum Shader Model

Shader Stages

See also

Implement the dot4add_u8packed HLSL Function #99219

Implement the dot4add_u8packed HLSL Function #99219

Comments

farzonl commented Jul 16, 2024

DirectX

SPIR-V

OpUDot:

Description:

Test Case(s)

Example 1

HLSL:

Syntax

Type Description

Minimum Shader Model

Shader Stages

See also

llvmbot commented Nov 7, 2024

DirectX

SPIR-V

OpUDot:

Description:

Test Case(s)

Example 1

HLSL:

Syntax

Type Description

Minimum Shader Model

Shader Stages

See also

Implement the `dot4add_u8packed` HLSL Function #99219

Implement the `dot4add_u8packed` HLSL Function #99219