[V1] Logits processor docs #22919

afeldman-nm · 2025-08-14T16:17:32Z

Purpose

Document the vLLM v1 logits processor functionality including built-in logits processors and custom logits processors

Test Plan

N/A

Test Result

N/A

(Optional) Documentation Update

See Purpose

Essential Elements of an Effective PR Description Checklist

[x ] The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
[ x] The test plan, such as providing test command.
[ x] The test results, such as pasting the results comparison before and after, or e2e results
[ x] (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Signed-off-by: Andrew Feldman <[email protected]>

github-actions · 2025-08-14T16:17:41Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request introduces documentation for the custom logits processor extensibility feature. The new markdown file explains how to create and use custom logits processors, including a code example. My review focuses on fixing several issues in this code example to make it functional and clear for developers.

gemini-code-assist · 2025-08-14T16:18:31Z

docs/features/custom_logitsprocs.md

+The contrived example below implements a 
+
+??? code "Example custom logits processor definition"
+
+    ```python
+    from typing import Optional
+    import torch
+    from vllm.config import VllmConfig
+    from vllm.sampling_params import SamplingParams
+    from vllm.v1.sample.logits_processor import (BatchUpdate,
+                                                LogitsProcessor,
+                                                MoveDirectionality)
+
+    class DummyLogitsProcessor(LogitsProcessor):
+        """Fake logit processor to support unit testing and examples"""
+
+        def __init__(self, vllm_config: "VllmConfig", device: torch.device,
+                    is_pin_memory: bool):
+            self.req_info: dict[int, SamplingParams] = {}
+
+        def is_argmax_invariant(self) -> bool:
+            """Never impacts greedy sampling"""
+            return False
+
+        def update_state(self, batch_update: Optional[BatchUpdate]):
+            if not batch_update:
+                return
+
+            # Process added requests.
+            for index, params, _, _ in batch_update.added:
+                assert params is not None
+                if params.extra_args and (target_token :=
+                                        params.extra_args.get("target_token")):
+                    self.req_info[index] = target_token
+
+            if self.req_info:
+                # Process removed requests.
+                for index in batch_update.removed:
+                    self.req_info.pop(index, None)
+
+                # Process moved requests, unidirectional move (a->b) and swap
+                # (a<->b)
+                for adx, bdx, direct in batch_update.moved:
+                    a_val = self.req_info.pop(adx, None)
+                    b_val = self.req_info.pop(bdx, None)
+                    if a_val is not None:
+                        self.req_info[bdx] = a_val
+                    if direct == MoveDirectionality.SWAP and b_val is not None:
+                        self.req_info[adx] = b_val
+
+        def apply(self, logits: torch.Tensor) -> torch.Tensor:
+            if not self.req_info:
+                return logits
+
+            # Save target values before modification
+            rows_list = list(self.req_info.keys())
+            cols = torch.tensor([self.req_info[i] for i in rows_list],
+                                dtype=torch.long,
+                                device=logits.device)
+            rows = torch.tensor(rows_list, dtype=torch.long, device=logits.device)
+            values_to_keep = logits[rows, cols].clone()
+    ```


The example DummyLogitsProcessor has a few issues that prevent it from working correctly and could confuse users:

The introductory sentence on line 16 is incomplete.

The type hint for self.req_info in __init__ is dict[int, SamplingParams], but it is used to store integer target_token values. This should be dict[int, int].

The apply method is incomplete. When self.req_info is populated, it doesn't return the logits tensor, which will lead to a runtime error. The logic is also unfinished.

I've provided a corrected version of the example that addresses these points, making it a complete and functional illustration of a custom logits processor.

The contrived example below implements a logits processor that forces the model to select a specific `target_token` for requests that provide it. ??? code "Example custom logits processor definition" ```python from typing import Optional import torch from vllm.config import VllmConfig from vllm.sampling_params import SamplingParams from vllm.v1.sample.logits_processor import (BatchUpdate, LogitsProcessor, MoveDirectionality) class DummyLogitsProcessor(LogitsProcessor): """Fake logit processor to support unit testing and examples""" def __init__(self, vllm_config: "VllmConfig", device: torch.device, is_pin_memory: bool): self.req_info: dict[int, int] = {} def is_argmax_invariant(self) -> bool: """Never impacts greedy sampling""" return False def update_state(self, batch_update: Optional[BatchUpdate]): if not batch_update: return # Process added requests. for index, params, _, _ in batch_update.added: assert params is not None if params.extra_args and (target_token := params.extra_args.get("target_token")): self.req_info[index] = target_token if self.req_info: # Process removed requests. for index in batch_update.removed: self.req_info.pop(index, None) # Process moved requests, unidirectional move (a->b) and swap # (a<->b) for adx, bdx, direct in batch_update.moved: a_val = self.req_info.pop(adx, None) b_val = self.req_info.pop(bdx, None) if a_val is not None: self.req_info[bdx] = a_val if direct == MoveDirectionality.SWAP and b_val is not None: self.req_info[adx] = b_val def apply(self, logits: torch.Tensor) -> torch.Tensor: if not self.req_info: return logits rows_list = list(self.req_info.keys()) cols = torch.tensor([self.req_info[i] for i in rows_list], dtype=torch.long, device=logits.device) rows = torch.tensor(rows_list, dtype=torch.long, device=logits.device) # Get the original logits for the target tokens. values_to_keep = logits[rows, cols].clone() # For requests with a target token, set all other logits to -inf. # This is a contrived example to force the model to select the # target token. for row_idx in rows_list: logits[row_idx, :] = -float("inf") logits[rows, cols] = values_to_keep return logits ```

Signed-off-by: Andrew Feldman <[email protected]>

docs/features/custom_logitsprocs.md

Signed-off-by: Andrew Feldman <[email protected]>

…o lp_ext_docs

Signed-off-by: Andrew Feldman <[email protected]>

afeldman-nm · 2025-08-21T02:10:58Z

Thank you @JosephMarinier for your review, I believe I addressed everything you mentioned

JosephMarinier

Thank you for the cool feature! 🙏

Signed-off-by: Andrew Feldman <[email protected]>

…o lp_ext_docs

njhill

Thanks @afeldman-nm, looks great apart from the one comment, we could merge this since that will likely need to change soon anyhow.

docs/features/custom_logitsprocs.md

Signed-off-by: Andrew Feldman <[email protected]>

njhill

Thanks @afeldman-nm

Signed-off-by: Andrew Feldman <[email protected]> Signed-off-by: afeldman-nm <[email protected]> Co-authored-by: Joseph Marinier <[email protected]>

Signed-off-by: Andrew Feldman <[email protected]> Signed-off-by: afeldman-nm <[email protected]> Co-authored-by: Joseph Marinier <[email protected]> Signed-off-by: charlifu <[email protected]>

Signed-off-by: Andrew Feldman <[email protected]> Signed-off-by: afeldman-nm <[email protected]> Co-authored-by: Joseph Marinier <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Signed-off-by: Andrew Feldman <[email protected]> Signed-off-by: afeldman-nm <[email protected]> Co-authored-by: Joseph Marinier <[email protected]>

LP docs draft

ab9d63f

Signed-off-by: Andrew Feldman <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Aug 14, 2025

gemini-code-assist bot reviewed Aug 14, 2025

View reviewed changes

afeldman-nm added 3 commits August 19, 2025 09:52

wip

47b6329

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'main' into lp_ext_docs

c380d3a

custom args

75d6355

Signed-off-by: Andrew Feldman <[email protected]>

afeldman-nm marked this pull request as ready for review August 19, 2025 14:41

afeldman-nm requested a review from hmellor as a code owner August 19, 2025 14:41

afeldman-nm and others added 3 commits August 19, 2025 10:56

wip

505ac7d

Signed-off-by: Andrew Feldman <[email protected]>

wip

9892dda

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'main' into lp_ext_docs

d0c2bb5

afeldman-nm changed the title ~~[V1] Logits processor extensibility docs~~ [V1] Logits processor docs, V0 logits processor wrapper, and V0 logits processor docs Aug 20, 2025

JosephMarinier reviewed Aug 20, 2025

View reviewed changes

docs/features/custom_logitsprocs.md Outdated Show resolved Hide resolved

JosephMarinier reviewed Aug 20, 2025

View reviewed changes

docs/features/custom_logitsprocs.md Outdated Show resolved Hide resolved

JosephMarinier reviewed Aug 20, 2025

View reviewed changes

docs/features/custom_logitsprocs.md Show resolved Hide resolved

afeldman-nm added 7 commits August 20, 2025 18:24

design wip

ebfb31f

Signed-off-by: Andrew Feldman <[email protected]>

more design

29ff326

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'main' into lp_ext_docs

6bbcddf

examples

687ec93

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'lp_ext_docs' of https://github.com/neuralmagic/vllm int…

f1d1ce3

…o lp_ext_docs

fixed type annotation

15f9ec7

Signed-off-by: Andrew Feldman <[email protected]>

refactor

5f2d48b

Signed-off-by: Andrew Feldman <[email protected]>

mergify bot added the v1 label Aug 21, 2025

typo

6ab2285

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'main' into lp_ext_docs

67a3e97

afeldman-nm changed the title ~~[V1] Logits processor docs, V0 logits processor wrapper, and V0 logits processor docs~~ [V1] Logits processor docs Aug 21, 2025

JosephMarinier approved these changes Aug 21, 2025

View reviewed changes

fixes

0121cde

Signed-off-by: Andrew Feldman <[email protected]>

afeldman-nm and others added 9 commits September 9, 2025 02:51

Merge branch 'main' into lp_ext_docs

22275ec

refactor

b7e5912

Signed-off-by: Andrew Feldman <[email protected]>

lint failures

c70376d

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'main' into lp_ext_docs

5e755fe

retrigger checks

bfc6ac5

Signed-off-by: Andrew Feldman <[email protected]>

Merge branch 'main' into lp_ext_docs

0635564

Merge branch 'main' into lp_ext_docs

80eed32

Merge branch 'main' into lp_ext_docs

08cdd73

Merge branch 'lp_ext_docs' of https://github.com/neuralmagic/vllm int…

d945db0

…o lp_ext_docs

njhill reviewed Sep 12, 2025

View reviewed changes

docs/features/custom_logitsprocs.md Show resolved Hide resolved

afeldman-nm added 6 commits September 12, 2025 13:55

Merge branch 'main' into lp_ext_docs

cf1d209

Merge branch 'main' into lp_ext_docs

f4e53ed

Merge branch 'main' into lp_ext_docs

1e091d0

Merge branch 'main' into lp_ext_docs

6e38ec8

disclaimer

6c5f58d

Signed-off-by: Andrew Feldman <[email protected]>

more disclaimer

758e6b1

Signed-off-by: Andrew Feldman <[email protected]>

njhill approved these changes Sep 17, 2025

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 17, 2025

afeldman-nm added 2 commits September 17, 2025 09:27

Merge branch 'main' into lp_ext_docs

6b5148e

Merge branch 'main' into lp_ext_docs

62b1d7c

njhill enabled auto-merge (squash) September 17, 2025 15:30

njhill merged commit 7ae9887 into vllm-project:main Sep 17, 2025
42 checks passed

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[V1] Logits processor docs (vllm-project#22919)

ffed65e

Signed-off-by: Andrew Feldman <[email protected]> Signed-off-by: afeldman-nm <[email protected]> Co-authored-by: Joseph Marinier <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V1] Logits processor docs #22919

[V1] Logits processor docs #22919

Uh oh!

afeldman-nm commented Aug 14, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Aug 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afeldman-nm commented Aug 21, 2025

Uh oh!

JosephMarinier left a comment •

edited

Loading

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[V1] Logits processor docs #22919

[V1] Logits processor docs #22919

Uh oh!

Conversation

afeldman-nm commented Aug 14, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

(Optional) Documentation Update

See Purpose

Uh oh!

github-actions bot commented Aug 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afeldman-nm commented Aug 21, 2025

Uh oh!

JosephMarinier left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

afeldman-nm commented Aug 14, 2025 •

edited by github-actions bot

Loading

JosephMarinier left a comment •

edited

Loading