Skip to content

[Question] MBU in automated CI? #237

@cadedaniel

Description

@cadedaniel

Hi folks, thanks for the great work.

With #135 merged, vLLM could see benefit from torch.compile backend given compiler-native integration with PagedAttention kernels.

Is there an easy way to see what the latest/nightly MBU is for torch compile on say, H100 / Llama3 70B?

Also interested in cold start compile time

cc @msaroufim

Activity

supriyar

supriyar commented on May 10, 2024

@supriyar
Contributor

@anijain2305 do we have any benchmark numbers for the cold start compile time?

msaroufim

msaroufim commented on May 11, 2024

@msaroufim
Member
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @cadedaniel@supriyar@msaroufim

        Issue actions

          [Question] MBU in automated CI? · Issue #237 · pytorch/ao