[Test]: Hermes tool parser stream output error in Qwen3 case #25203

ahartel · 2025-09-18T20:25:20Z

Purpose

Fix: #19056

Test Plan

Added some tests in the PR.

github-actions · 2025-09-18T20:25:30Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

gemini-code-assist

Code Review

This pull request addresses a streaming output error in the Hermes tool parser, particularly for the Qwen3 model, by correcting the logic for handling partially streamed JSON arguments. The addition of a comprehensive test suite for the Hermes parser, including specific cases for Qwen tokenization, streaming, and non-streaming scenarios, is a great improvement and significantly enhances the robustness of the parser. While the main fix is sound, I've identified a pre-existing critical bug in the type checking logic that should be addressed.

vllm/entrypoints/openai/tool_parsers/hermes_tool_parser.py

ahartel · 2025-09-19T06:06:19Z

@cedonley I saw that you changed lines 369-373 of hermes_tool_parser.py in #10979. The actual fix I am proposing here touches those lines as well by removing 2 string slice operations. Can you remember why you introduced them? Or do you have test cases at hand that I could add to the code base to make sure that they still run?

ahartel · 2025-09-19T06:09:10Z

Just found this question as well on this topic. The question and your answer seem to suggest that my fix might be applicable.

tugot17 · 2025-09-19T09:03:52Z

The tests seem to pass and all looks fine;

Do you have any idea why it used to be

if (delta_text not in cur_arguments_json[:-2]):

before?

ahartel · 2025-09-19T09:44:55Z

The tests seem to pass and all looks fine;

Do you have any idea why it used to be
if (delta_text not in cur_arguments_json[:-2]):
before?

No, unfortunately not. Let's see if @cedonley has any insights. See also the links to a previous discussion in my previous comment.

chaunceyjiang · 2025-09-22T06:24:04Z

Hi @ahartel Could you retest based on the main branch to see if the issue still exists?

ahartel · 2025-09-22T06:44:29Z

Hi @ahartel Could you retest based on the main branch to see if the issue still exists?

My newly added tests do indeed pass on main and used to fail previously. Seems to have been fixed. Thanks for pointing that out.

@chaunceyjiang Would you mind merging the changes to file tests/entrypoints/openai/tool_parsers/test_hermes_tool_parser.py? This would add some more test coverage and maybe also document the bevavior of the hermes tool parser.

chaunceyjiang · 2025-09-22T06:47:33Z

@chaunceyjiang Would you mind merging the changes to file tests/entrypoints/openai/tool_parsers/test_hermes_tool_parser.py? This would add some more test coverage and maybe also document the bevavior of the hermes tool parser.

Of course, this is a new test case. We can move forward quickly.

Signed-off-by: Andreas Hartel <[email protected]>

ahartel · 2025-09-22T06:54:13Z

Thanks. I updated my PR to only contain my test additions (plus some very minor reformattings)

tests/entrypoints/openai/tool_parsers/test_hermes_tool_parser.py

chaunceyjiang

Thanks~

ahartel · 2025-09-23T11:27:38Z

Thank you for your support @chaunceyjiang

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]>

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: charlifu <[email protected]>

Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: yewentao256 <[email protected]>

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: gaojc <[email protected]>

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]>

ahartel requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang, robertgshaw2-redhat and simon-mo as code owners September 18, 2025 20:25

mergify bot added frontend qwen Related to Qwen models tool-calling labels Sep 18, 2025

github-project-automation bot added this to Tool Calling Sep 18, 2025

gemini-code-assist bot reviewed Sep 18, 2025

View reviewed changes

vllm/entrypoints/openai/tool_parsers/hermes_tool_parser.py Outdated Show resolved Hide resolved

ahartel force-pushed the fix-hermes-parser branch 2 times, most recently from d8ed627 to e7e9587 Compare September 19, 2025 06:02

ahartel force-pushed the fix-hermes-parser branch from e7e9587 to 6a1d542 Compare September 22, 2025 06:50

chaunceyjiang self-assigned this Sep 22, 2025

[test]: Add unit tests for Hermes tool parser

60cf77f

Signed-off-by: Andreas Hartel <[email protected]>

ahartel force-pushed the fix-hermes-parser branch from 6a1d542 to 60cf77f Compare September 22, 2025 06:53

chaunceyjiang changed the title ~~[fix]: Hermes tool parser stream output error in Qwen3 case (#19056)~~ [Test]: Hermes tool parser stream output error in Qwen3 case Sep 22, 2025

chaunceyjiang added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 22, 2025

chaunceyjiang reviewed Sep 22, 2025

View reviewed changes

tests/entrypoints/openai/tool_parsers/test_hermes_tool_parser.py Show resolved Hide resolved

chaunceyjiang enabled auto-merge (squash) September 23, 2025 09:54

chaunceyjiang approved these changes Sep 23, 2025

View reviewed changes

chaunceyjiang merged commit 4322c55 into vllm-project:main Sep 23, 2025
30 checks passed

github-project-automation bot moved this to Done in Tool Calling Sep 23, 2025

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Test]: Hermes tool parser stream output error in Qwen3 case (vllm-pr…

7e0161d

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Test]: Hermes tool parser stream output error in Qwen3 case (vllm-pr…

8960ec2

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: charlifu <[email protected]>

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[Test]: Hermes tool parser stream output error in Qwen3 case (#25203)

fb64e67

Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: yewentao256 <[email protected]>

gjc0824 pushed a commit to gjc0824/vllm that referenced this pull request Oct 10, 2025

[Test]: Hermes tool parser stream output error in Qwen3 case (vllm-pr…

82c427f

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: gaojc <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Test]: Hermes tool parser stream output error in Qwen3 case (vllm-pr…

c317416

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Test]: Hermes tool parser stream output error in Qwen3 case (vllm-pr…

15c694b

…oject#25203) Signed-off-by: Andreas Hartel <[email protected]>

Uh oh!

[Test]: Hermes tool parser stream output error in Qwen3 case #25203

[Test]: Hermes tool parser stream output error in Qwen3 case #25203

Uh oh!

Conversation

ahartel commented Sep 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

github-actions bot commented Sep 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ahartel commented Sep 19, 2025

Uh oh!

ahartel commented Sep 19, 2025

Uh oh!

tugot17 commented Sep 19, 2025

Uh oh!

ahartel commented Sep 19, 2025

Uh oh!

chaunceyjiang commented Sep 22, 2025

Uh oh!

ahartel commented Sep 22, 2025

Uh oh!

chaunceyjiang commented Sep 22, 2025

Uh oh!

ahartel commented Sep 22, 2025

Uh oh!

Uh oh!

chaunceyjiang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ahartel commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ahartel commented Sep 18, 2025 •

edited by github-actions bot

Loading