You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One V_NOP od unrelated VALU instruction in between is required for
correctness when matrix A or B of current WMMA instruction overlaps
with matrix D of previous WMMA instruction.
Remaining cases of WMMA operand overlaps are handled by the hardware
and do not require handling in hazard recognizer.
Hardware may stall in cases where:
- matrix C of current WMMA instruction overlaps with
matrix D of previous WMMA instruction
- VALU instruction reads matrix D of previous WMMA instruction
- matrix A,B or C of WMMA instruction reads result of previous
VALU instruction
0 commit comments