Unified Attention Accuracy Bugfixes #393

kzawora-intel · 2025-10-13T12:52:59Z

I've noticed two accuracy issues in unified attention:

We weren't updating the persistent request states and batch in the unified_execute_model method.
We were overextending non-aligned prefix_prefill context lengths by one token .

The first one had major impact - I suspect we were malforming batches as the generation process went on, since the self.input_batch.num_tokens & req_state.output_token_ids were not updated correctly - in Granite GSM8K fixing that yielded +10 percentage points improvement
The second one had a negligible impact - I didn't notice any acc improvement in any tests I've run - but we should be masking anything above context length regardless.

I've added GSM8k accuracy test to CI with this PR that should now pass as well.

Signed-off-by: Konrad Zawora <[email protected]>

github-actions · 2025-10-13T12:56:16Z

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

github-actions · 2025-10-14T15:58:32Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
577d498212022f95dc3a59746b1da1c6ed23eaba

kzawora-intel added 2 commits October 13, 2025 15:51

Fix UA acc

e03269c

Signed-off-by: Konrad Zawora <[email protected]>

Add granite UA tests

2f65235

Signed-off-by: Konrad Zawora <[email protected]>

kzawora-intel requested review from adobrzyn, afierka-intel, mgawarkiewicz-intel, michalkuligowski, mswiniarsk, vivekgoe and xuechendi as code owners October 13, 2025 12:53

remove fpdb

1d5177f

Signed-off-by: Konrad Zawora <[email protected]>

kzawora-intel and others added 2 commits October 13, 2025 15:58

Merge branch 'main' into private/kzawora/ua_acc

dd2f909

Merge branch 'main' into private/kzawora/ua_acc

d2fd0d5

mswiniarsk approved these changes Oct 14, 2025

View reviewed changes

adobrzyn merged commit 09e4a68 into main Oct 15, 2025
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unified Attention Accuracy Bugfixes #393

Unified Attention Accuracy Bugfixes #393

Uh oh!

kzawora-intel commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Unified Attention Accuracy Bugfixes #393

Unified Attention Accuracy Bugfixes #393

Uh oh!

Conversation

kzawora-intel commented Oct 13, 2025

Uh oh!

github-actions bot commented Oct 13, 2025

🚧 CI Blocked

Uh oh!

github-actions bot commented Oct 14, 2025

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants