Skip to content

Conversation

zhuohan123
Copy link
Member

@zhuohan123 zhuohan123 commented Mar 27, 2023

Add a FastAPI-based frontend to cacheflow while keeping the old script working.

Remaining TODOs:

  • Add a README for the FastAPI frontend.
  • Rename the old script.
  • Add a gradio demo web frontend.

@zhuohan123 zhuohan123 requested a review from WoosukKwon March 27, 2023 06:19
Copy link
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for your effort.

@zhuohan123 zhuohan123 merged commit 721fa3d into main Mar 29, 2023
@zhuohan123 zhuohan123 deleted the real-frontend branch March 29, 2023 06:49
xiangyuT pushed a commit to xiangyuT/vllm that referenced this pull request Oct 25, 2023
* Add underlying functions

* tests done
hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024
slyalin pushed a commit to slyalin/vllm that referenced this pull request Mar 22, 2024
ykim362 pushed a commit to ykim362/vllm that referenced this pull request Jun 17, 2024
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
zeroorhero pushed a commit to zeroorhero/vllm that referenced this pull request Sep 23, 2024
wuhuikx pushed a commit to wuhuikx/vllm that referenced this pull request Mar 27, 2025
### What this PR does / why we need it?
This PR adds Chinese documents for vllm-ascend for Chinese-speaking
developers

### Does this PR introduce _any_ user-facing change?
Change as follows
- add README.zh.md
- add environment.zh.md
- add CONTRIBUTING.zh.md

### How was this patch tested?
By CI

---------

Signed-off-by: wangli <[email protected]>
juncgu pushed a commit to juncgu/vllm that referenced this pull request May 8, 2025
Move new GPUModelRunner methods out of `execute_model` method
zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 5, 2025
* hf format

Signed-off-by: Chen Zhang <[email protected]>

* better qkv concat

Signed-off-by: Chen Zhang <[email protected]>

---------

Signed-off-by: Chen Zhang <[email protected]>
zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 6, 2025
* hf format

Signed-off-by: Chen Zhang <[email protected]>

* better qkv concat

Signed-off-by: Chen Zhang <[email protected]>

---------

Signed-off-by: Chen Zhang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants