Add ninja to dependency #21

WoosukKwon · 2023-04-02T01:59:32Z

The compilation time of flash-attn can be drastically reduced if ninja is installed. Related issue: Dao-AILab/flash-attention#150

…ock_size [CPU] Support for larger block_size

Fix more logging lint errors

Signed-off-by: Nick Hill <[email protected]> Co-authored-by: Daniel Clark <[email protected]>

make package version control by setuptools_scm to keep the same with vllm Signed-off-by: wangxiyuan <[email protected]>

Co-authored-by: Lucia Fang <[email protected]>

Add ninja to dependency

86983b7

WoosukKwon merged commit 2c5cd0d into main Apr 2, 2023

WoosukKwon deleted the ninja branch April 2, 2023 02:00

shanshanpt mentioned this pull request Nov 17, 2023

Run long conetxt error : CUDA error: an illegal memory access was encountered #1700

Closed

junior-zsy mentioned this pull request Nov 20, 2023

Error with 32k Long Text in chatglm2-6b-32k Model #1725

Closed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Add ninja to dependency (vllm-project#21)

f4354de

slyalin pushed a commit to slyalin/vllm that referenced this pull request Apr 3, 2024

Merge pull request vllm-project#21 from luo-cheng2021/luocheng/var_bl…

ee5c232

…ock_size [CPU] Support for larger block_size

tdg5 pushed a commit to tdg5/vllm that referenced this pull request Apr 25, 2024

Merge pull request vllm-project#21 from tdg5/exp-2

36cf873

Fix more logging lint errors

z103cb referenced this pull request in z103cb/opendatahub_vllm May 7, 2024

fix: Missed TLS config logic from internal fork (opendatahub-io#21)

7df0eb8

Signed-off-by: Nick Hill <[email protected]> Co-authored-by: Daniel Clark <[email protected]>

yuhuixu1993 mentioned this pull request Jun 2, 2024

[Bug]: loading squeezellm model #5190

Closed

alixiaodi mentioned this pull request Aug 2, 2024

[Bug]: #7072

Closed

wuhuikx pushed a commit to wuhuikx/vllm that referenced this pull request Mar 27, 2025

[Misc] version control by setuptools_scm (vllm-project#21)

c59375c

make package version control by setuptools_scm to keep the same with vllm Signed-off-by: wangxiyuan <[email protected]>

hao-cold mentioned this pull request May 13, 2025

[Bug]: CUDA error: an illegal instruction was encountered #18045

Closed

1 task

markmc mentioned this pull request May 21, 2025

[Bug][Failing Test]: Distributed Comm Ops - distributed/test_shm_broadcast.py #18492

Closed

1 task

zerosurplus mentioned this pull request Jun 16, 2025

[Bug]: torch.distributed.DistNetworkError: The client socket has timed out after 600000ms while trying to connect to (172.17.0.9, 46229). #19670

Open

1 task

xiaomofang mentioned this pull request Jul 31, 2025

[Bug]: There is an issue with speculative inference in Eagle mode, where the context length of vLLM inference is constrained by the draft model. #21986

Open

1 task

zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 5, 2025

Support Responses Streaming (vllm-project#21)

b775a39

zyongye pushed a commit to zyongye/vllm that referenced this pull request Aug 6, 2025

Support Responses Streaming (vllm-project#21)

696cfb8

heheda12345 pushed a commit to heheda12345/vllm that referenced this pull request Sep 29, 2025

support mtp with indexer kv (vllm-project#21)

6a29a01

Co-authored-by: Lucia Fang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add ninja to dependency #21

Add ninja to dependency #21

Uh oh!

WoosukKwon commented Apr 2, 2023

Uh oh!

Uh oh!

Uh oh!

Add ninja to dependency #21

Add ninja to dependency #21

Uh oh!

Conversation

WoosukKwon commented Apr 2, 2023

Uh oh!

Uh oh!