[Bugfix] Fix mrope in Transformers Backend #26087

zucchini-nlp · 2025-10-02T11:04:15Z

Fixes the recently skipped test by creating image_grid_thw as batched fields. Otherwise they get an extra dimension and fail when preparing mrope positions in vLLM's model runners

Also, adds/rewrites some comments in processor code to keep them up to date. I tried to follow the base class apply() method, but transformers unfortunately cannot split the logic for placeholder and the rest into two. It will force us to call transformer utilities twice which is not very fast. So I am keeping it as is and simply adding comments

cc @hmellor as we have been talking about it internally

Signed-off-by: raushan <[email protected]>

gemini-code-assist

Code Review

This pull request fixes an mrope issue in the Transformers backend by correctly configuring image_grid_thw and video_grid_thw as batched fields. It also includes several refactorings and cleanups, such as using a public API for setting attention implementation and removing unused code. The changes are generally good, but I've identified a critical issue where unguarded dictionary access could lead to a KeyError when processing text-only inputs in a multimodal model. I've provided a suggestion to make the code more robust.

vllm/model_executor/models/transformers.py

Signed-off-by: raushan <[email protected]>

vllm/model_executor/models/transformers.py

mergify · 2025-10-03T07:03:51Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @zucchini-nlp.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: raushan <[email protected]>

Signed-off-by: Harry Mellor <[email protected]>

vllm/model_executor/models/transformers.py

Signed-off-by: Harry Mellor <[email protected]>

issue has been fixed

Signed-off-by: Harry Mellor <[email protected]>

mergify · 2025-10-06T07:20:54Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @zucchini-nlp.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

hmellor

LGTM! Let's see what CI thinks

Signed-off-by: Harry Mellor <[email protected]>

Signed-off-by: raushan <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Signed-off-by: Karan Goel <[email protected]>

Signed-off-by: raushan <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]>

Signed-off-by: raushan <[email protected]> Signed-off-by: Harry Mellor <[email protected]> Co-authored-by: Harry Mellor <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

fix qwen

8b13589

Signed-off-by: raushan <[email protected]>

zucchini-nlp requested review from DarkLight1337, hmellor and ywang96 as code owners October 2, 2025 11:04

mergify bot added the multi-modality Related to multi-modality (#4194) label Oct 2, 2025

gemini-code-assist bot reviewed Oct 2, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

DarkLight1337 reviewed Oct 2, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

overwrite get mm data

59fb329

Signed-off-by: raushan <[email protected]>

DarkLight1337 reviewed Oct 2, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

hmellor reviewed Oct 2, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

hmellor reviewed Oct 2, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Show resolved Hide resolved

mergify bot added the needs-rebase label Oct 3, 2025

zucchini-nlp and others added 2 commits October 3, 2025 11:09

update

2d754fb

Signed-off-by: raushan <[email protected]>

Merge branch 'main' into pr/zucchini-nlp/26087

1631892

Signed-off-by: Harry Mellor <[email protected]>

mergify bot removed the needs-rebase label Oct 3, 2025

hmellor previously requested changes Oct 3, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

revert change to setting of _attn_implementation

c3ddae5

Signed-off-by: Harry Mellor <[email protected]>

hmellor added 4 commits October 3, 2025 14:27

Move it back before in case there are copies

f0f47e7

Signed-off-by: Harry Mellor <[email protected]>

Merge commit '17edd8a' into pr/zucchini-nlp/26087

b1c5cfb

ruff

d9d71c5

Merge commit 'd6953be' into pr/zucchini-nlp/26087

8ac10fd

mergify bot added the needs-rebase label Oct 6, 2025

hmellor approved these changes Oct 6, 2025

View reviewed changes

Merge remote-tracking branch 'upstream/main' into pr/zucchini-nlp/26087

145685d

Signed-off-by: Harry Mellor <[email protected]>

mergify bot removed the needs-rebase label Oct 6, 2025

hmellor enabled auto-merge (squash) October 6, 2025 09:54

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 6, 2025

hmellor merged commit ab5e7d9 into vllm-project:main Oct 6, 2025
55 checks passed

hmellor added this to Transformers backend Oct 7, 2025

hmellor moved this to Done in Transformers backend Oct 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix mrope in Transformers Backend #26087

[Bugfix] Fix mrope in Transformers Backend #26087

zucchini-nlp commented Oct 2, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Oct 3, 2025

Uh oh!

Uh oh!

mergify bot commented Oct 6, 2025

Uh oh!

hmellor left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Bugfix] Fix mrope in Transformers Backend #26087

[Bugfix] Fix mrope in Transformers Backend #26087

Conversation

zucchini-nlp commented Oct 2, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mergify bot commented Oct 3, 2025

Uh oh!

Uh oh!

mergify bot commented Oct 6, 2025

Uh oh!

hmellor left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Oct 2, 2025 •

edited by github-actions bot

Loading