Fix a bug in attention kernel #68

WoosukKwon · 2023-05-04T09:50:21Z

Fixes #66

This PR fixes a bug in our attention kernel. The bug was introduced in #53 when changing the precision of computations in the attention kernel. Now the kernel unit tests are passed normally.

SUMMARY: - Added `licenses` subfolder for directories - Moved `LICENSE-apache` into `licenses` directory - Updated `setup.py` with NM Community License TEST PLAN: None

…ncies (vllm-project#68) I already tried to fix this using IBM/vllm#66 but upstream didn't like that change (the behaviour to filter out comments containing torch was intentional). After [some discussion](vllm-project#12255), we agreed on a different solution implemented in this PR. Note that I reverted the changes from vllm-project#66 by force pushing main. Note this has already been merged upstream by vllm-project#12260 but I'm cherry-picking the fix here since it is blocking the CI builds.

Kernel bugfix

8b3e878

WoosukKwon merged commit 130d5fd into main May 4, 2023

WoosukKwon deleted the attn-kernel-bugfix branch May 4, 2023 09:56

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Fix a bug in attention kernel (vllm-project#68)

f6bb3e1

yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024

Rs/licensing (vllm-project#68)

c23efd4

SUMMARY: - Added `licenses` subfolder for directories - Moved `LICENSE-apache` into `licenses` directory - Updated `setup.py` with NM Community License TEST PLAN: None

dllehr-amd pushed a commit to dllehr-amd/vllm that referenced this pull request Jul 22, 2024

Fix linting (vllm-project#68)

3200953

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Aug 15, 2024

Profile single forward (vllm-project#68)

566bdd2

sgsdxzy mentioned this pull request May 4, 2025

[Bug]: Unable to run Qwen3 on Turing GPUs after upgrading to torch 2.7.0 #17639

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix a bug in attention kernel #68

Fix a bug in attention kernel #68

Uh oh!

WoosukKwon commented May 4, 2023

Uh oh!

Uh oh!

Uh oh!

Fix a bug in attention kernel #68

Fix a bug in attention kernel #68

Uh oh!

Conversation

WoosukKwon commented May 4, 2023

Uh oh!

Uh oh!