[Core] Remove tokenizer group in vLLM #24078

zhuohan123 · 2025-09-02T05:15:48Z

Purpose

Tokenizer group is an abstraction introduced in early day vLLM to support the case where different LoRA adapters use different tokenizers. Looking back, LoRA is a niche feature among all vLLM users, and different tokenizers for different LoRAs is a "niche of the niche" feature. However, This niche of the niche feature spreads all around in vLLM code base, which becomes technical debt.

In the long term, I believe it's a good idea to eliminate the use of tokenizer in vLLM core and make most part of the core only works on token-IDs. This reduces our coupling with huggingface tokenizer and will make developing vLLM core easier.

Also see #23474 #23540

Test Plan

Make sure all the existing tests pass.

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

…basic example Signed-off-by: Zhuohan Li <[email protected]>

Signed-off-by: Zhuohan Li <[email protected]>

mergify · 2025-09-17T04:09:56Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @zhuohan123.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Zhuohan Li <[email protected]>

mergify · 2025-09-17T04:32:31Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @zhuohan123.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Zhuohan Li <[email protected]>

vllm-project/vllm#24795 and vllm-project/vllm#24615 and vllm-project/vllm#24078 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]>

vllm-project/vllm#24795 and vllm-project/vllm#24615 and vllm-project/vllm#24078 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]> Signed-off-by: slokesha <[email protected]>

Signed-off-by: Zhuohan Li <[email protected]>

Signed-off-by: Zhuohan Li <[email protected]> Signed-off-by: charlifu <[email protected]>

Signed-off-by: Zhuohan Li <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Signed-off-by: Zhuohan Li <[email protected]>

[WIP] Remove tokenizer group reference in the main codebase and pass …

349683f

…basic example Signed-off-by: Zhuohan Li <[email protected]>

mergify bot added documentation Improvements or additions to documentation frontend structured-output v1 labels Sep 2, 2025

github-project-automation bot added this to Structured Output Sep 2, 2025

zhuohan123 added 5 commits September 1, 2025 23:16

fix all non-lora tests

d644792

Signed-off-by: Zhuohan Li <[email protected]>

mypy fix

49a304c

Signed-off-by: Zhuohan Li <[email protected]>

fix mypy

3cd8df1

Signed-off-by: Zhuohan Li <[email protected]>

fix mypy

fda07d7

Signed-off-by: Zhuohan Li <[email protected]>

remove get_lora_tokenizer

658d84d

Signed-off-by: Zhuohan Li <[email protected]>

mergify bot added the performance Performance-related issues label Sep 2, 2025

mypy

f350583

Signed-off-by: Zhuohan Li <[email protected]>

zhuohan123 marked this pull request as ready for review September 2, 2025 22:41

zhuohan123 requested review from DarkLight1337, WoosukKwon, aarnphm, alexm-redhat, comaniac, mgoin, njhill, robertgshaw2-redhat, russellb, simon-mo, youkaichao and ywang96 as code owners September 2, 2025 22:42

zhuohan123 added 2 commits September 2, 2025 17:24

remove extra parametmer

ecfb457

Signed-off-by: Zhuohan Li <[email protected]>

fix mistral tokenizer

39a736e

Signed-off-by: Zhuohan Li <[email protected]>

zhuohan123 requested a review from patrickvonplaten as a code owner September 3, 2025 02:48

simon-mo added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 3, 2025

mergify bot added the needs-rebase label Sep 15, 2025

zhuohan123 added 2 commits September 16, 2025 16:01

Print warning for LoRA requests

57f82d8

Signed-off-by: Zhuohan Li <[email protected]>

Merge branch 'main' into zhuohan/remove-token-group

4ca5ac9

Signed-off-by: Zhuohan Li <[email protected]>

zhuohan123 enabled auto-merge (squash) September 16, 2025 23:27

mergify bot removed the needs-rebase label Sep 16, 2025

fix test error

6905902

Signed-off-by: Zhuohan Li <[email protected]>

mergify bot added the needs-rebase label Sep 17, 2025

Merge branch 'main' into zhuohan/remove-token-group

3e0e590

Signed-off-by: Zhuohan Li <[email protected]>

mergify bot removed the needs-rebase label Sep 17, 2025

mergify bot added the needs-rebase label Sep 17, 2025

Merge branch 'main' into zhuohan/remove-token-group

2ff7f95

Signed-off-by: Zhuohan Li <[email protected]>

mergify bot removed the needs-rebase label Sep 17, 2025

zhuohan123 merged commit 6c47f6b into main Sep 17, 2025
56 checks passed

zhuohan123 deleted the zhuohan/remove-token-group branch September 17, 2025 08:43

github-project-automation bot moved this to Done in Structured Output Sep 17, 2025

github-project-automation bot moved this to Done in Tool Calling Sep 17, 2025

adobrzyn mentioned this pull request Sep 17, 2025

CI fix vllm-project/vllm-gaudi#186

Merged

xuechendi pushed a commit to vllm-project/vllm-gaudi that referenced this pull request Sep 17, 2025

CI fix (#186)

a3dce5c

vllm-project/vllm#24795 and vllm-project/vllm#24615 and vllm-project/vllm#24078 --------- Signed-off-by: Agata Dobrzyniewicz <[email protected]>

zhuohan123 mentioned this pull request Sep 17, 2025

[Core] Remove lora additional vocabulary #23540

Open

mgoin mentioned this pull request Sep 17, 2025

[CI Bugfix] Fix failing test_model_load_with_params tests due to tokenizer refactor #25086

Merged

5 tasks

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Core] Remove tokenizer group in vLLM (vllm-project#24078)

6bd1664

Signed-off-by: Zhuohan Li <[email protected]>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Core] Remove tokenizer group in vLLM (vllm-project#24078)

332a076

Signed-off-by: Zhuohan Li <[email protected]> Signed-off-by: charlifu <[email protected]>

DarkLight1337 mentioned this pull request Sep 25, 2025

[Performance] model_config.compute_hash is computed every time and introduce overhead in each new multi-modal req #25671

Closed

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Core] Remove tokenizer group in vLLM (vllm-project#24078)

a065959

Signed-off-by: Zhuohan Li <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Core] Remove tokenizer group in vLLM (vllm-project#24078)

91c5278

Signed-off-by: Zhuohan Li <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Core] Remove tokenizer group in vLLM #24078

[Core] Remove tokenizer group in vLLM #24078

Uh oh!

zhuohan123 commented Sep 2, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Sep 17, 2025

Uh oh!

mergify bot commented Sep 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Core] Remove tokenizer group in vLLM #24078

[Core] Remove tokenizer group in vLLM #24078

Uh oh!

Conversation

zhuohan123 commented Sep 2, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Sep 17, 2025

Uh oh!

mergify bot commented Sep 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhuohan123 commented Sep 2, 2025 •

edited by github-actions bot

Loading