feat: Support stream_interval #5284

kaiyux · 2025-06-17T12:09:09Z

You can use a extra-llm-api-config.yml to enable the feature:

stream_interval: 4

Signed-off-by: Kaiyu Xie <[email protected]>

hypdeb

Seems like a very small number of changes to enable that feature, which is great! Do you have some numbers on how this affects performance?

tensorrt_llm/_torch/pyexecutor/config.py

…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux · 2025-06-17T13:50:39Z

/bot run

pcastonguay

Can we add tests to verify this is working as expected?

tensorrt_llm/_torch/pyexecutor/config.py

tensorrt-cicd · 2025-06-17T13:56:48Z

PR_Github #9213 [ run ] triggered by Bot

kaiyux · 2025-06-17T15:56:06Z

Can we add tests to verify this is working as expected?

@pcastonguay I added tests to tests/integration/defs/accuracy/test_llm_api_pytorch.py (90c12f4) to make sure that there are no tokens missed. However if you meant to verify that it indeed return responses every N iterations, I'm not what might be the best way to verify that for now and can take a closer look tomorrow.

pcastonguay · 2025-06-17T16:31:11Z

Can we add tests to verify this is working as expected?

@pcastonguay I added tests to tests/integration/defs/accuracy/test_llm_api_pytorch.py (90c12f4) to make sure that there are no tokens missed. However if you meant to verify that it indeed return responses every N iterations, I'm not what might be the best way to verify that for now and can take a closer look tomorrow.

Yes I meant verifying that you only get a response every N tokens.

kaiyux · 2025-06-18T00:51:01Z

/bot run

tensorrt-cicd · 2025-06-18T00:58:09Z

PR_Github #9259 [ run ] triggered by Bot

tensorrt_llm/evaluate/interface.py

tensorrt_llm/llmapi/llm_args.py

Signed-off-by: Kaiyu Xie <[email protected]>

tensorrt-cicd · 2025-06-18T02:47:32Z

PR_Github #9259 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6793 completed with status: 'FAILURE'

Signed-off-by: Kaiyu Xie <[email protected]>

…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux · 2025-06-18T02:54:08Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-06-18T02:59:28Z

PR_Github #9297 [ run ] triggered by Bot

tensorrt_llm/evaluate/interface.py

QiJune

LGTM

…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>

tensorrt-cicd · 2025-06-19T03:48:47Z

PR_Github #9421 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6914 completed with status: 'FAILURE'

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux · 2025-06-19T04:09:50Z

/bot run --disable-fail-fast

tensorrt-cicd · 2025-06-19T04:15:22Z

PR_Github #9444 [ run ] triggered by Bot

kaiyux · 2025-06-19T05:43:29Z

/bot run

tensorrt-cicd · 2025-06-19T05:48:49Z

PR_Github #9450 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-19T05:48:51Z

PR_Github #9444 [ run ] completed with state ABORTED

kaiyux · 2025-06-19T06:53:30Z

/bot run

tensorrt-cicd · 2025-06-19T06:58:57Z

PR_Github #9465 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-19T06:58:59Z

PR_Github #9450 [ run ] completed with state ABORTED
/LLM/main/L0_MergeRequest_PR pipeline #6939 completed with status: 'FAILURE'

tensorrt-cicd · 2025-06-19T08:13:09Z

PR_Github #9465 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6949 completed with status: 'FAILURE'

kaiyux · 2025-06-19T08:17:51Z

/bot run

tensorrt-cicd · 2025-06-19T08:23:10Z

PR_Github #9482 [ run ] triggered by Bot

tensorrt-cicd · 2025-06-19T13:57:08Z

PR_Github #9482 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #6960 completed with status: 'SUCCESS'

Signed-off-by: Kaiyu Xie <[email protected]>

Support stream_interval

fcd3f31

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux requested review from a team as code owners June 17, 2025 12:09

kaiyux requested a review from Naveassaf June 17, 2025 12:09

Fix

a4af706

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux requested review from Superjomn, hypdeb, QiJune and nv-yilinf June 17, 2025 12:30

hypdeb reviewed Jun 17, 2025

View reviewed changes

tensorrt_llm/_torch/pyexecutor/config.py Show resolved Hide resolved

kaiyux added 2 commits June 17, 2025 06:30

Merge branch 'main' of github.com:NVIDIA/TensorRT-LLM into user/kaiyu…

9521caf

…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>

Add accuracy test

90c12f4

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux requested review from syuoni and dongxuy04 June 17, 2025 13:49

pcastonguay requested changes Jun 17, 2025

View reviewed changes

tensorrt_llm/_torch/pyexecutor/config.py Show resolved Hide resolved

syuoni reviewed Jun 18, 2025

View reviewed changes

tensorrt_llm/evaluate/interface.py Show resolved Hide resolved

Superjomn reviewed Jun 18, 2025

View reviewed changes

tensorrt_llm/llmapi/llm_args.py Show resolved Hide resolved

Update

8200b35

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux added 2 commits June 17, 2025 19:50

Update docstring

18ce0ab

Signed-off-by: Kaiyu Xie <[email protected]>

Merge branch 'main' of github.com:NVIDIA/TensorRT-LLM into user/kaiyu…

8006de5

…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>

syuoni approved these changes Jun 18, 2025

View reviewed changes

tensorrt_llm/evaluate/interface.py Show resolved Hide resolved

QiJune approved these changes Jun 19, 2025

View reviewed changes

Merge branch 'main' of github.com:NVIDIA/TensorRT-LLM into user/kaiyu…

bd53480

…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>

Fix

ad7ec5e

Signed-off-by: Kaiyu Xie <[email protected]>

kaiyux force-pushed the user/kaiyu/stream_interval branch from f80ccb6 to ad7ec5e Compare June 19, 2025 04:09

kaiyux enabled auto-merge (squash) June 19, 2025 04:10

Merge branch 'main' into user/kaiyu/stream_interval

6d9ad83

kaiyux merged commit 7246fd7 into NVIDIA:main Jun 19, 2025
3 checks passed

kaiyux mentioned this pull request Jun 24, 2025

[nvbug/5354956] fix: unexpected keyword argument 'streaming' #5436

Merged

kaiyux deleted the user/kaiyu/stream_interval branch July 3, 2025 00:56

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 9, 2025

feat: Support stream_interval (NVIDIA#5284)

e27818d

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

feat: Support stream_interval (NVIDIA#5284)

23a6f4b

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

feat: Support stream_interval (NVIDIA#5284)

f048576

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

feat: Support stream_interval (NVIDIA#5284)

3054fa4

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 10, 2025

feat: Support stream_interval (NVIDIA#5284)

75fd57c

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

feat: Support stream_interval (NVIDIA#5284)

4409881

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

feat: Support stream_interval (NVIDIA#5284)

a4bd430

Signed-off-by: Kaiyu Xie <[email protected]>

dominicshanshan pushed a commit to dominicshanshan/TensorRT-LLM that referenced this pull request Jul 11, 2025

feat: Support stream_interval (NVIDIA#5284)

f082c13

Signed-off-by: Kaiyu Xie <[email protected]>

feat: Support stream_interval #5284

feat: Support stream_interval #5284

Uh oh!

Conversation

kaiyux commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hypdeb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kaiyux commented Jun 17, 2025

Uh oh!

pcastonguay left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tensorrt-cicd commented Jun 17, 2025

Uh oh!

kaiyux commented Jun 17, 2025

Uh oh!

pcastonguay commented Jun 17, 2025

Uh oh!

kaiyux commented Jun 18, 2025

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

kaiyux commented Jun 18, 2025

Uh oh!

tensorrt-cicd commented Jun 18, 2025

Uh oh!

Uh oh!

QiJune left a comment

Choose a reason for hiding this comment

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

kaiyux commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

kaiyux commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

kaiyux commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

kaiyux commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

tensorrt-cicd commented Jun 19, 2025

Uh oh!

Uh oh!

Uh oh!

kaiyux commented Jun 17, 2025 •

edited

Loading