[Misc] Logits processor plugins #4769

NadavShmayo · 2024-05-11T21:40:15Z

This pull request adds support for Logits processor plugins.
This makes implementing custom Logits processors very easy, and eliminates the need to change vLLM directly to implement it.

For example with this merge request we could implement all of the guided decoding features, just by implementing a Python package and installing it in the same virtualenv as vLLM, without actually changing vLLM source code.

Example code for a logits processor plugin that given a token id multiplies its logit by 100:

from pydantic import BaseModel


class MyParameters(BaseModel):
    token_id: int


class MyLogitsProcessor:
    def __init__(self, tokenizer, parameters: MyParameters):
        self.tokenizer = tokenizer
        self.parameters = parameters

    def __call__(self, token_ids, logits):
        new_logits = logits.clone()
        new_logits[self.parameters.token_id] *= 100
        return new_logits


LOGITS_PROCESSOR_PLUGIN = {
    'logits_processor_class': MyLogitsProcessor,
    'parameters_model': MyParameters
}

And the setup.py file for the package should look something like this:

from setuptools import setup

setup(name='example_logits_processor',
      version='0.1',
      install_requires=[
            "pydantic>=1.8.2"
      ],
      entry_points={
            'vllm.logits_processors': ['example_plugin=example_plugin.main:LOGITS_PROCESSOR_PLUGIN']
      }
      )

With this merge request vLLM will load all the plugins at startup, and each inference request can specify usage of custom logits processors using the logits_processors field in the request body.
The parameters_model in the plugin dictionary is used to validate and parse the request body.

I will soon add to this pull request a page in the documentation explaining how to implement custom logits processors.

NadavShmayo · 2024-05-13T15:54:52Z

I added some documentation about this feature :)

simon-mo · 2024-05-14T23:14:36Z

@mmoskal @noamgat @br3no curious about your feedback on this!

mmoskal · 2024-05-15T00:21:12Z

This looks cool - a distribution mechanism for logit processors. When #4775 gets merged this PR would need to be updated to support the more generic interface.

noamgat · 2024-05-15T05:24:06Z

I am very much in favor of this approach. A few months ago I tried to get a similar concept in huggingface-tgi:
huggingface/text-generation-inference#1274
But have since switched to vLLM :)

br3no · 2024-05-15T08:24:47Z

I like this idea. And I agree with @mmoskal that it would be important to support the more involved API being worked on in #4775.

I wonder though how one would implement support for the OpenAI API on tool use if guided decoding were to be provided by such a plugin. The code on the OpenAI server depends on the guided decoding backend and will need to know how to transform the OpenAI API conformant parameters into valid guided decoding parameters (c.f. #4656).

Supporting the OpenAI API as thoroughly as possible is a very valuable thing that should not be sacrificed for software-architectural reasons.

So we can either define guided decoding as a core vLLM feature that is not in the scope of logit-processor plugins or we can think about e.g. also making the frontend part necessary to "correctly" use the plugins also pluggable. Latter would be a challenging endeavor.

NadavShmayo · 2024-05-21T15:34:55Z

Thank you for the feedback everyone.

Regarding @br3no response: It's a good point, I believe as a first step it does make sense to keep the guided decoding code as core vLLM logic, and even more so as it's already implemented this way.

I will try and think how it would be possible to implement it as plugins but still allow tool calling, but I believe this pull request is valuable both ways :)

NadavShmayo · 2024-06-09T20:58:36Z

@DarkLight1337 @simon-mo
Hi guys, any updates regarding this pull-request? (:

njhill · 2024-06-10T16:15:18Z

Just to mention here that to properly support stateful logits processors we are proposing to change the API to take logits processor factories rather than logts processor instances, see #5329.

github-actions · 2024-10-27T02:06:30Z

This pull request has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this pull request should remain open. Thank you!

mergify · 2024-11-27T02:09:21Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @NadavShmayo.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

hmellor · 2025-02-18T14:59:22Z

@njhill does that mean that this PR is superseded by #5329?

hmellor · 2025-02-28T13:43:30Z

Closing as stale. If you plan to continue this work, feel free to re-open.

NadavShmayo added 2 commits May 11, 2024 18:48

Add plugin contracts package with logits processor file

e6431cc

Add support for logits processor plugins

bdd54c6

rkooo567 self-assigned this May 13, 2024

NadavShmayo added 2 commits May 13, 2024 18:51

Add logits processor plugins documentation

c1cb428

Add missing spaces in errors for logit processor plugins

613a586

DarkLight1337 mentioned this pull request May 31, 2024

Logit Processor additional data #2142

Closed

Made logits_processor_plugins parameter optional in openai serving

4c7c0e9

simon-mo mentioned this pull request Jun 12, 2024

[RFC]: Improve guided decoding (logit_processor) APIs and performance. #5423

Closed

NadavShmayo mentioned this pull request Aug 6, 2024

[RFC]: vLLM plugin system #7131

Closed

github-actions bot added the stale Over 90 days of inactivity label Oct 27, 2024

github-actions bot added unstale Recieved activity after being labelled stale and removed stale Over 90 days of inactivity labels Nov 27, 2024

mergify bot added the documentation Improvements or additions to documentation label Nov 27, 2024

mergify bot added frontend needs-rebase labels Nov 27, 2024

hmellor closed this Feb 28, 2025

Uh oh!

[Misc] Logits processor plugins #4769

[Misc] Logits processor plugins #4769

Uh oh!

Conversation

NadavShmayo commented May 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NadavShmayo commented May 13, 2024

Uh oh!

simon-mo commented May 14, 2024

Uh oh!

mmoskal commented May 15, 2024

Uh oh!

noamgat commented May 15, 2024

Uh oh!

br3no commented May 15, 2024

Uh oh!

NadavShmayo commented May 21, 2024

Uh oh!

NadavShmayo commented Jun 9, 2024

Uh oh!

njhill commented Jun 10, 2024

Uh oh!

github-actions bot commented Oct 27, 2024

Uh oh!

mergify bot commented Nov 27, 2024

Uh oh!

hmellor commented Feb 18, 2025

Uh oh!

hmellor commented Feb 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

NadavShmayo commented May 11, 2024 •

edited

Loading