[Core] Enable LoRA support for classification model #24596

jeejeelee · 2025-09-10T16:15:44Z

Purpose

FIX #23719
FIX #19623

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Jee Jee Li <[email protected]>

gemini-code-assist

Code Review

This pull request introduces ClassifierWithLoRA to enable LoRA support for classification models. However, the implementation in the new file vllm/lora/layers/classifier.py has several critical issues. These include a hardcoded value that should be configurable, incorrect initialization of an internal state variable that will lead to runtime errors, and an incorrect and incomplete forward method implementation. These issues must be addressed for the layer to function correctly.

gemini-code-assist · 2025-09-10T16:17:17Z

vllm/lora/layers/classifier.py

+    ) -> None:
+        self.lora_config = lora_config
+
+        self.max_class_label =3 # self.lora_config.max_class_label


The max_class_label is hardcoded to 3. This appears to be a placeholder and should be derived from lora_config as suggested by the commented-out code. Hardcoding this value prevents the layer from being used for classification tasks with a different number of labels.

Suggested change

self.max_class_label =3 # self.lora_config.max_class_label

self.max_class_label = self.lora_config.max_class_label

gemini-code-assist · 2025-09-10T16:17:18Z

vllm/lora/layers/classifier.py

+        self.lora_config = lora_config
+
+        self.max_class_label =3 # self.lora_config.max_class_label
+        self._label_slot = [-1] * self.max_class_label


The _label_slot list is initialized with a size of self.max_class_label. However, it is indexed by a LoRA slot index in reset_lora and set_lora, which can go up to max_loras - 1. If max_loras is greater than max_class_label, this will cause an IndexError. The size of _label_slot should be max_loras to correctly track the state for each LoRA slot.

Suggested change

self._label_slot = [-1] * self.max_class_label

self._label_slot = [-1] * max_loras

gemini-code-assist · 2025-09-10T16:17:18Z

vllm/lora/layers/classifier.py

+    def forward(self, input_: torch.Tensor) -> torch.Tensor:
+        """Forward of ClassifierWithLoRA
+
+        Args:
+            input_: Tensor whose last dimension is `input_size`.
+
+        Returns:
+            - output
+
+        """
+        y = torch.zeros( self.input_size, self.max_class_label, device=input_.device)
+
+        self.punica_wrapper.add_shrink(
+                y,
+                self.lora_a_stacked,
+                add_input=True)
+        # Cast y using self._label_slot
+        return y


The forward method has several critical issues making it non-functional and incomplete:

Unused Input: The input_ tensor is not used in the computation, which is incorrect for a forward pass.

Incorrect add_shrink Call: The arguments to self.punica_wrapper.add_shrink are mismatched with its definition (add_shrink(y, x, lora_a_stacked, scale, **kwargs)). The input_ tensor should be passed as x, self.lora_a_stacked should be wrapped in a tuple for the lora_a_stacked argument, and a scale factor is missing. The current call will cause a TypeError.

Incomplete Logic: The comment # Cast y using self._label_slot on line 79 indicates missing implementation.

Incorrect y Shape: The y tensor is initialized with shape (self.input_size, self.max_class_label), which is missing a batch dimension.

Missing LoRA 'B' weights: A standard LoRA layer uses both A and B weight matrices. This implementation only seems to use lora_a, and lora_b is ignored in set_lora. The forward method should incorporate the full LoRA logic.

This method needs a complete rewrite to correctly implement the forward pass.

Signed-off-by: Jee Jee Li <[email protected]>

pb-sameerreddy · 2025-09-19T12:41:03Z

Had a question on this PR, I thought LoRA already works for classification models? Or is this to extend support for LoRA weights for the classifier head as well as the base model?

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee · 2025-09-22T03:05:11Z

Had a question on this PR, I thought LoRA already works for classification models? Or is this to extend support for LoRA weights for the classifier head as well as the base model?

Yes, mainly for the multi-lora of the classifier head

Signed-off-by: Jee Jee Li <[email protected]>

mergify · 2025-09-24T01:47:42Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @jeejeelee.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Jee Jee Li <[email protected]>

Init

25ff1c6

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee marked this pull request as draft September 10, 2025 16:15

gemini-code-assist bot reviewed Sep 10, 2025

View reviewed changes

Fix

fac4b71

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee force-pushed the class-support-lora branch from 7bd8239 to fac4b71 Compare September 10, 2025 16:29

Merge branch 'main' into class-support-lora

475c03b

jeejeelee mentioned this pull request Sep 13, 2025

[Bugfix] Dtype error with sequence classification model and lora. #24775

Open

5 tasks

jeejeelee added 15 commits September 15, 2025 14:22

Merge branch 'main' into class-support-lora

35d92e4

Move forward

e7d7446

Signed-off-by: Jee Jee Li <[email protected]>

Revert line

69f902e

Signed-off-by: Jee Jee Li <[email protected]>

Revert line

b3e0af2

Signed-off-by: Jee Jee Li <[email protected]>

Revert line

c266bdc

Signed-off-by: Jee Jee Li <[email protected]>

Move forward

a027955

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'main' into class-support-lora

a3fc6c5

Merge branch 'main' into class-support-lora

b0803c4

Move forward

80bc921

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'main' into class-support-lora

fbb44c4

Optimize adapter.py

57dfb17

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'main' into class-support-lora

714f3ec

Merge branch 'main' into class-support-lora

564c6fd

Move forward

211aa32

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'main' into class-support-lora

1df6167

jeejeelee mentioned this pull request Sep 19, 2025

[Core] Modify the initialization parameters of the lora manager #25249

Merged

5 tasks

jeejeelee added 2 commits September 20, 2025 02:41

Fix Conflict

fd534b1

Signed-off-by: Jee Jee Li <[email protected]>

Move forward

2f6fa05

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee marked this pull request as ready for review September 22, 2025 03:03

jeejeelee requested review from simon-mo, WoosukKwon and youkaichao as code owners September 22, 2025 03:03

jeejeelee requested review from robertgshaw2-redhat, mgoin, tlrmchlsmth, houseroad, hmellor, yewentao256 and ProExpertProg as code owners September 22, 2025 03:03

Merge branch 'main' into class-support-lora

f747fc6

jeejeelee marked this pull request as draft September 22, 2025 03:03

jeejeelee added 3 commits September 23, 2025 20:59

Merge branch 'main' into class-support-lora

d8aaf5f

Move forward

b4ede34

Signed-off-by: Jee Jee Li <[email protected]>

Move forward

a7269c4

Signed-off-by: Jee Jee Li <[email protected]>

mergify bot added the needs-rebase label Sep 24, 2025

Fix conflcit

541d4f9

Signed-off-by: Jee Jee Li <[email protected]>

mergify bot removed the needs-rebase label Sep 24, 2025

jeejeelee added 7 commits September 25, 2025 02:35

Move forward

9b110e3

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'main' into class-support-lora

47493e4

Move forward

839cb18

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'main' into class-support-lora

9a96010

Merge branch 'vllm-project:main' into class-support-lora

c825b96

Move forward

13eee74

Signed-off-by: Jee Jee Li <[email protected]>

Fix conflict

1e63510

Signed-off-by: Jee Jee Li <[email protected]>

mergify bot added the v1 label Oct 9, 2025

Fix conflict

8209698

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee force-pushed the class-support-lora branch from bd0eb9f to 8209698 Compare October 9, 2025 03:56

Merge branch 'main' into class-support-lora

7613894

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Core] Enable LoRA support for classification model #24596

[Core] Enable LoRA support for classification model #24596

jeejeelee commented Sep 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 10, 2025

Uh oh!

gemini-code-assist bot Sep 10, 2025

Uh oh!

gemini-code-assist bot Sep 10, 2025

Uh oh!

pb-sameerreddy commented Sep 19, 2025

Uh oh!

jeejeelee commented Sep 22, 2025 •

edited

Loading

Uh oh!

mergify bot commented Sep 24, 2025

Uh oh!

Uh oh!

	self.max_class_label =3 # self.lora_config.max_class_label
	self.max_class_label = self.lora_config.max_class_label

	self._label_slot = [-1] * self.max_class_label
	self._label_slot = [-1] * max_loras

Uh oh!

[Core] Enable LoRA support for classification model #24596

Are you sure you want to change the base?

[Core] Enable LoRA support for classification model #24596

Conversation

jeejeelee commented Sep 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

pb-sameerreddy commented Sep 19, 2025

Uh oh!

jeejeelee commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mergify bot commented Sep 24, 2025

Uh oh!

Uh oh!

jeejeelee commented Sep 10, 2025 •

edited by github-actions bot

Loading

jeejeelee commented Sep 22, 2025 •

edited

Loading