Skip to content

Conversation

WoosukKwon
Copy link
Collaborator

@WoosukKwon WoosukKwon commented Mar 26, 2023

TODO:

  • Test against HF implementation
  • Add TP support (@zhuohan123)

@WoosukKwon
Copy link
Collaborator Author

@zhuohan123 Please feel free to approve and merge this PR once you think it's ready.

@zhuohan123 zhuohan123 self-requested a review March 29, 2023 06:37
@zhuohan123 zhuohan123 merged commit 80a2f81 into main Mar 30, 2023
@WoosukKwon WoosukKwon deleted the llama branch April 12, 2023 03:12
v1nc3nt27 pushed a commit to v1nc3nt27/vllm that referenced this pull request Sep 12, 2023
dont error if user doesnt have kernels installed
bigPYJ1151 pushed a commit to bigPYJ1151/vllm that referenced this pull request Dec 29, 2023
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
zeroorhero pushed a commit to zeroorhero/vllm that referenced this pull request Sep 23, 2024
juncgu pushed a commit to juncgu/vllm that referenced this pull request May 8, 2025
Suggestion: Generalize/streamline async loading (remote prefill) side
zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 5, 2025
zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 6, 2025
zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants