Update release pipeline post PyTorch 2.8.0 update #23960

huydhn · 2025-08-29T20:22:26Z

Purpose

This is the second part after #20358. This PR does 3 things:

Remove CUDA 11.8 build. PyTorch 2.8.0 has removed 11.8 and 12.4, the supported CUDA versions for this release are 12.6, 12.8 (default) and 12.9. https://github.com/pytorch/pytorch/releases/tag/v2.8.0
Install libnuma-dev to fix arm64 build https://buildkite.com/vllm/release/builds/7768#0198f57a-b3ef-4861-8528-97ce129f5c03/114-5868
Move CUDA arm64 build to 12.9 because PyTorch 2.8 CUDA arm64 wheel is only available on CUDA 12.9

Test Plan

CI https://buildkite.com/vllm/release/builds/7784

cc @simon-mo @khluu @seemethere

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.

Signed-off-by: Huy Do <[email protected]>

docker/Dockerfile

gemini-code-assist

Code Review

This pull request correctly updates the release pipeline to replace the deprecated CUDA 11.8 build with a CUDA 12.9 build, following the PyTorch 2.8.0 update. It also addresses an arm64 build failure by adding libnuma-dev. However, a critical dependency is missing from the final runtime image stage in the Dockerfile, which will likely lead to runtime errors. The libnuma-dev package needs to be added to the vllm-base stage as well.

docker/Dockerfile

vadimkantorov · 2025-08-29T21:34:08Z

Does this PR mean, we can expect vllm nightlies built against 2.8.0 very soon? :) Might be good to even have a v0.10.1.2 release built against 2.8.0 - as v0.10.1.1 was released when 2.8.0 was already out

PyTorch 2.8.0 has an important fix #18851 (comment) , so would be very beneficial to have a vllm proper release built against 2.8.0...

Signed-off-by: Huy Do <[email protected]>

huydhn · 2025-08-29T22:00:30Z

Does this PR mean, we can expect vllm nightlies built against 2.8.0 very soon? :) Might be good to even have a v0.10.1.2 release built against 2.8.0 - as v0.10.1.1 was released when 2.8.0 was already out

PyTorch 2.8.0 has an important fix #18851 (comment) , so would be very beneficial to have a vllm proper release built against 2.8.0...

Yup, once this lands, the vLLM nightlies (and the next vLLM release) wheel will be on PyTorch 2.8.0

nWEIdia · 2025-08-29T22:02:28Z

Does this PR mean, we can expect vllm nightlies built against 2.8.0 very soon? :) Might be good to even have a v0.10.1.2 release built against 2.8.0 - as v0.10.1.1 was released when 2.8.0 was already out
PyTorch 2.8.0 has an important fix #18851 (comment) , so would be very beneficial to have a vllm proper release built against 2.8.0...

Yup, once this lands, the vLLM nightlies (and the next vLLM release) wheel will be on PyTorch 2.8.0

Nit: if you are ok with x86_64, the "nightlies" are already there. Where? Here: https://gallery.ecr.aws/q9t5s3a7/vllm-release-repo

Update: direct wheel can be obtained from URL like: #20358 (comment)

vadimkantorov · 2025-08-29T22:02:36Z

Are there any plans of releasing a service release built against 2.8.0? E.g. v0.10.1.2 or v0.10.2?

that would be exactly v0.10.1.1 code, but built against 2.8.0

Signed-off-by: Huy Do <[email protected]>

vadimkantorov · 2025-08-29T22:24:15Z

Nit: if you are ok with x86_64, the "nightlies" are already there

Seems there are docker images, right? I'm looking for s3/http-published whl files

nWEIdia · 2025-08-29T22:34:00Z

Nit: if you are ok with x86_64, the "nightlies" are already there

Seems there are docker images, right? I'm looking for s3/http-published whl files

Yes, I can see vllm-0.10.1rc2.dev371+g67c14906a-cp38-abi3-manylinux1_x86_64.whl from a build job that uploaded it to S3. Give it a try?

Update: #20358 (comment)

vadimkantorov · 2025-08-29T22:45:48Z

https://vllm-wheels.s3.us-west-2.amazonaws.com/g67c14906a/vllm-0.10.1rc2.dev371-cp38-abi3-manylinux1_x86_64.whl seems non-existent... I guess I'm still not using correct URL

Probably from this PR there should be some fresh builds against 2.8.0... I propose to still have a service release - as this could actually provide feedback to PyTorch if there are any perf regressions

seemethere

LGTM

Signed-off-by: Huy Do <[email protected]>

malfet

There is an issue on PyTorch side, to align build matrix for CUDA+PyTorch across x86 and aarch64

nWEIdia

I noticed that we have discrepancy regarding aarch64 docker image build.

x86 is cuda 12.8.1
aarch64 would be 12.9.1 with this PR

cc @nvpohanh

nvpohanh · 2025-09-01T01:00:09Z

@simon-mo Should we align on the cuda version used between x86 images and aarch64 images?

nWEIdia · 2025-09-01T06:28:11Z

I realize for PyTorch project, during v2.8.0 release, the aarch64 binary wheel was only available for cuda 12.9. That was probably why we had to use cu129 for ARM container.

nvpohanh · 2025-09-01T08:49:05Z

yes, so my question is: should we also upgrade the cuda version in x86 docker images so that the x86 images and aarch64 images have the same cuda version? Otherwise, it is kind of weird that the same vLLM release images have different cuda versions on different archs

docker/Dockerfile

.buildkite/scripts/upload-wheels.sh

youkaichao

this one looks better than #24020 , as it also takes care of the wheel uploading part.

please resolve some comments i left. thanks!

Signed-off-by: Huy Do <[email protected]>

nWEIdia

Seeing b72ebd5 aarch64 image (on https://gallery.ecr.aws/q9t5s3a7/vllm-release-repo) as well from this PR. Nice!

youkaichao · 2025-09-03T02:15:16Z

close as we merged #24073 now. thanks to @huydhn !

Update release pipeline post PyTorch 2.8.0 update

7db334d

Signed-off-by: Huy Do <[email protected]>

mergify bot added the ci/build label Aug 29, 2025

huydhn mentioned this pull request Aug 29, 2025

Update PyTorch to 2.8.0 #20358

Merged

10 tasks

nWEIdia reviewed Aug 29, 2025

View reviewed changes

docker/Dockerfile Outdated Show resolved Hide resolved

gemini-code-assist bot reviewed Aug 29, 2025

View reviewed changes

docker/Dockerfile Outdated Show resolved Hide resolved

huydhn added 2 commits August 29, 2025 14:42

Address review comments

f106b84

Signed-off-by: Huy Do <[email protected]>

Another tweak to build deepgemm

646428f

Signed-off-by: Huy Do <[email protected]>

Is this working now?

87a4a5c

Signed-off-by: Huy Do <[email protected]>

seemethere approved these changes Aug 29, 2025

View reviewed changes

Build CUDA aarch64 on 12.9

7b7f903

Signed-off-by: Huy Do <[email protected]>

atalman approved these changes Aug 29, 2025

View reviewed changes

malfet approved these changes Aug 30, 2025

View reviewed changes

nWEIdia reviewed Aug 30, 2025

View reviewed changes

nWEIdia mentioned this pull request Sep 1, 2025

update to cuda 12.9 #24020

Closed

5 tasks

youkaichao reviewed Sep 1, 2025

View reviewed changes

docker/Dockerfile Show resolved Hide resolved

youkaichao reviewed Sep 1, 2025

View reviewed changes

.buildkite/scripts/upload-wheels.sh Outdated Show resolved Hide resolved

youkaichao approved these changes Sep 1, 2025

View reviewed changes

huydhn added 2 commits September 1, 2025 10:43

Update cu129 wheel to nightly

8988fc1

Signed-off-by: Huy Do <[email protected]>

Revert libnuma change to see if it passes on cu129

b72ebd5

Signed-off-by: Huy Do <[email protected]>

youkaichao mentioned this pull request Sep 2, 2025

Update release pipeline post PyTorch 2.8.0 update #24073

Merged

5 tasks

nWEIdia approved these changes Sep 2, 2025

View reviewed changes

youkaichao closed this Sep 3, 2025

Uh oh!

Update release pipeline post PyTorch 2.8.0 update #23960

Update release pipeline post PyTorch 2.8.0 update #23960

Uh oh!

Conversation

huydhn commented Aug 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

vadimkantorov commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

huydhn commented Aug 29, 2025

Uh oh!

nWEIdia commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vadimkantorov commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vadimkantorov commented Aug 29, 2025

Uh oh!

nWEIdia commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vadimkantorov commented Aug 29, 2025

Uh oh!

seemethere left a comment

Choose a reason for hiding this comment

Uh oh!

malfet left a comment

Choose a reason for hiding this comment

Uh oh!

nWEIdia left a comment

Choose a reason for hiding this comment

Uh oh!

nvpohanh commented Sep 1, 2025

Uh oh!

nWEIdia commented Sep 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nvpohanh commented Sep 1, 2025

Uh oh!

Uh oh!

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

nWEIdia left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

youkaichao commented Sep 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

huydhn commented Aug 29, 2025 •

edited by github-actions bot

Loading

vadimkantorov commented Aug 29, 2025 •

edited

Loading

nWEIdia commented Aug 29, 2025 •

edited

Loading

vadimkantorov commented Aug 29, 2025 •

edited

Loading

nWEIdia commented Aug 29, 2025 •

edited

Loading

nWEIdia commented Sep 1, 2025 •

edited

Loading

nWEIdia left a comment •

edited

Loading