-
Notifications
You must be signed in to change notification settings - Fork 1.8k
feat: Support stream_interval #5284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems like a very small number of changes to enable that feature, which is great! Do you have some numbers on how this affects performance?
…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
/bot run |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add tests to verify this is working as expected?
PR_Github #9213 [ run ] triggered by Bot |
@pcastonguay I added tests to |
Yes I meant verifying that you only get a response every N tokens. |
/bot run |
PR_Github #9259 [ run ] triggered by Bot |
Signed-off-by: Kaiyu Xie <[email protected]>
PR_Github #9259 [ run ] completed with state |
Signed-off-by: Kaiyu Xie <[email protected]>
…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>
/bot run --disable-fail-fast |
PR_Github #9297 [ run ] triggered by Bot |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…/stream_interval Signed-off-by: Kaiyu Xie <[email protected]>
PR_Github #9421 [ run ] completed with state |
Signed-off-by: Kaiyu Xie <[email protected]>
f80ccb6
to
ad7ec5e
Compare
/bot run --disable-fail-fast |
PR_Github #9444 [ run ] triggered by Bot |
/bot run |
PR_Github #9450 [ run ] triggered by Bot |
PR_Github #9444 [ run ] completed with state |
/bot run |
PR_Github #9465 [ run ] triggered by Bot |
PR_Github #9450 [ run ] completed with state |
PR_Github #9465 [ run ] completed with state |
/bot run |
PR_Github #9482 [ run ] triggered by Bot |
PR_Github #9482 [ run ] completed with state |
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
Signed-off-by: Kaiyu Xie <[email protected]>
You can use a
extra-llm-api-config.yml
to enable the feature: