Add test for UAV sequential consistency #225

llvm-beanz · 2025-06-05T19:58:17Z

This test aims to verify that UAV accesses are performed sequentially
and that the reads and writes are consistent such that a read which
occurs after a write must observe the effect of the write.

This test aims to verify that UAV accesses are performed sequentially and that the reads and writes are consistent such that a read which occurs after a write must observe the effect of the write.

This just helps with debugging when things fail.

Aadded a lit feature for Intel UHD drivers that are exhibiting this failure. A subesquent change will update the test to XFAIL based on this feature.

spall · 2025-06-09T20:35:01Z

test/Bugs/UAV-Sequental-Consistency.yaml

+  Result[0] = X[0] + 1;
+  Result[1] = X[1] + Result[0];
+
+  Result[2] = X[0] + 2;


isn't this equivalent to lines 8 and 9? Is the bug not observable if 11 and 12 are not included?

What if you reorder them yourself? will intel move them back?

I was unable to reproduce the issue if I didn't have both sequences of reads and
writes. I suspect whatever is going on in their optimizer depends on the amount
of adjacent data being loaded.

right; that makes sense

I wouldn't expect lines 8 and 9 alone to reproduce the issue, if the bug comes from a coalesced load not invalidating previously read locations upon store.

The first read from Result occurs after the first write to Result[0], so if it's a coalescing bug as I hypothesized, the coalesced load would happen after that first write. It would require another read back from a location overwritten after the coalesced load to expose the bug.

That said, this probably could be reduced by one line, since the write to Result[1] and a read back of that should be sufficient for the repro.

[numthreads(1,1,1)] void main() { Result[0] = X[0] + 1; // The write to Result[1] here should invalidate any coalesced load of that // value during the read of Result[0]. Result[1] = X[1] + Result[0]; // Now, this should be sufficient to expose the invalid use of Result[1] // from the coalesced load. Result[2] = X[2] + Result[1]; }

I feel like the more adjacent the location is to the coalesced value loaded, the more likely we are to catch a bug of this kind.

To be clear, I'm not asking for this change to be made. I'm just trying to illustrate what could be an even more minimal repro, if I understand the issue correctly.

Oh, I just realized... If the Adjacent-Partial-Writes.yaml test fails (wow!), this issue might be slightly different than what I thought, but would manifest in the same way in this case. I'm surprised a driver would get away with a bug that causes this prior simple case to fail!

I iterated a bunch trying to reduce this further. This was the smallest that I got to reproduce it. This is partially made more challenging to iterate on because it only impacts Intel UHD drivers, and the only one of those I have access to is the GitHub action runner here, which is way over subscribed on its work.

Without a clearer understanding of the underling issue (which has been reported to Intel to investigate) I don't want to spend more time reducing the already quite small test case.

I've grouped these both under the same LIT check because they seem to be loosely related, and I can imagine they could both be caused by unsafe load/store optimizations resulting in mis-compiles. They may not be the same issue, we'll just have to wait until Intel can investigate and identify the problem.

spall · 2025-06-09T20:36:27Z

test/Bugs/UAV-Sequental-Consistency.yaml

+
+# RUN: split-file %s %t
+# RUN: %dxc_target -T cs_6_5 -Fo %t.o %t/source.hlsl
+# RUN: %offloader %t/pipeline.yaml %t.o


This is also reduced from a case that was failing on Intel UHD drivers.

…ial-consistency

tex3d · 2025-06-11T18:51:14Z

test/Bugs/Adjacent-Partial-Writes.yaml

+    ZeroInitSize: 48
+  - Name: ExpectedOut # The result we expect
+    Format: UInt32
+    Stride: 16
+    Data: [3, 0, 32, 64, 3, 0, 32, 64, 3, 0, 0, 0]


Could we initialize the output to a unique sequence instead of zero? That would help differentiate a lack of writing outputs from overwriting adjacent locations with zeros.

Suggested change

ZeroInitSize: 48

- Name: ExpectedOut # The result we expect

Format: UInt32

Stride: 16

Data: [3, 0, 32, 64, 3, 0, 32, 64, 3, 0, 0, 0]

Data: [101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112]

- Name: ExpectedOut # The result we expect

Format: UInt32

Stride: 16

Data: [3, 102, 32, 64, 3, 106, 32, 64, 3, 110, 111, 112]

Putting sentinel values in the output buffer is a good idea, although the change here also changes the expected output which isn't correct (since all values in the expected output are written to by the shader). I'll push an update.

Edit: except the last two... the other 0's are intended to be 0.

spall

lgtm

llvm-beanz changed the title ~~Cbieneman/uav sequential consistency~~ Add test for UAV sequential consistency Jun 5, 2025

This was referenced Jun 6, 2025

Track Intel UHD memory sequential coherence bug #226

Open

Make HLK test from llvm/offload-test-suite#225 microsoft/DirectXShaderCompiler#7521

Open

llvm-beanz force-pushed the cbieneman/uav-sequential-consistency branch from 75b4269 to aafd130 Compare June 7, 2025 15:18

llvm-beanz added 4 commits June 9, 2025 11:27

Add test for UAV sequential consistency

8f847c3

This test aims to verify that UAV accesses are performed sequentially and that the reads and writes are consistent such that a read which occurs after a write must observe the effect of the write.

Simplify test by removing second SRV

e2ba531

Add a step to dump the API-query output

e037116

This just helps with debugging when things fail.

Add lit feature for Intel UHD

941e510

Aadded a lit feature for Intel UHD drivers that are exhibiting this failure. A subesquent change will update the test to XFAIL based on this feature.

llvm-beanz force-pushed the cbieneman/uav-sequential-consistency branch from aafd130 to 941e510 Compare June 9, 2025 16:27

spall reviewed Jun 9, 2025

View reviewed changes

llvm-beanz and others added 7 commits June 9, 2025 18:50

Add partial writes test

288ceaf

This is also reduced from a case that was failing on Intel UHD drivers.

Expanding the case since it doesn't seem to be triggering

4c0e8d0

Fixing int64 test

5c937c5

Reduce test

1d80f1c

Merge remote-tracking branch 'origin/main' into cbieneman/uav-sequent…

1c078e9

…ial-consistency

Mark failing tests as XFAIL on UHD drivers

6787fd3

python format

006ceb0

tex3d reviewed Jun 11, 2025

View reviewed changes

llvm-beanz added 3 commits June 12, 2025 08:52

Use sentinel values in the initialized output buffer

9f1b71b

Enable Int64 for Metal

fa54de5

Updating XFAILS and adding comments

91c2dee

spall approved these changes Jun 12, 2025

View reviewed changes

Update XFAIL and comments to reflect why this passes with Clang

f556360

llvm-beanz marked this pull request as ready for review June 17, 2025 17:52

llvm-beanz merged commit 81e73ae into llvm:main Jun 19, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add test for UAV sequential consistency #225

Add test for UAV sequential consistency #225

Uh oh!

llvm-beanz commented Jun 5, 2025 •

edited

Loading

Uh oh!

spall Jun 9, 2025

Uh oh!

spall Jun 9, 2025 •

edited

Loading

Uh oh!

llvm-beanz Jun 11, 2025

Uh oh!

spall Jun 11, 2025

Uh oh!

tex3d Jun 11, 2025

Uh oh!

tex3d Jun 11, 2025

Uh oh!

tex3d Jun 11, 2025

Uh oh!

llvm-beanz Jun 12, 2025

Uh oh!

spall Jun 9, 2025

Uh oh!

tex3d Jun 11, 2025

Uh oh!

llvm-beanz Jun 12, 2025 •

edited

Loading

Uh oh!

spall left a comment

Uh oh!

Uh oh!

Uh oh!

Add test for UAV sequential consistency #225

Add test for UAV sequential consistency #225

Uh oh!

Conversation

llvm-beanz commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

spall Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

llvm-beanz Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

spall left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

llvm-beanz commented Jun 5, 2025 •

edited

Loading

spall Jun 9, 2025 •

edited

Loading

llvm-beanz Jun 12, 2025 •

edited

Loading