Skip to content

fix: don't perform memory estimation for star_attention #3485

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Apr 12, 2025

Conversation

HuiGao-NV
Copy link
Collaborator

Mem estimation for star attention is not supportted now. So need to skip memory estimation for start_attention.

@HuiGao-NV HuiGao-NV self-assigned this Apr 11, 2025
@HuiGao-NV HuiGao-NV requested a review from QiJune April 11, 2025 13:10
@HuiGao-NV
Copy link
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1936 [ run ] triggered by Bot

@yuxianq
Copy link
Collaborator

yuxianq commented Apr 11, 2025

@HuiGao-NV Please unwaive the tests in #3464.

@HuiGao-NV
Copy link
Collaborator Author

/bot kill

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1942 [ kill ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1936 [ run ] completed with state ABORTED

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1942 [ kill ] completed with state SUCCESS
Successfully killed previous jobs for commit 04c56ce

@HuiGao-NV
Copy link
Collaborator Author

/bot run --disable-fail-fast

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1945 [ run ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1945 [ run ] completed with state SUCCESS
/LLM/main/L0_MergeRequest_PR pipeline #1432 completed with status: 'SUCCESS'

@HuiGao-NV HuiGao-NV force-pushed the skip_mem_est_for_star_attention branch from 04c56ce to 2f81f37 Compare April 11, 2025 23:43
@HuiGao-NV HuiGao-NV enabled auto-merge (squash) April 11, 2025 23:43
@juney-nvidia juney-nvidia changed the title fix: don't perform memory estimation for start_attention fix: don't perform memory estimation for star_attention Apr 12, 2025
@yuxianq
Copy link
Collaborator

yuxianq commented Apr 12, 2025

/bot reuse-pipeline

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1989 [ reuse-pipeline ] triggered by Bot

@tensorrt-cicd
Copy link
Collaborator

PR_Github #1989 [ reuse-pipeline ] completed with state SUCCESS
Reusing PR_Github #1945 for commit eb34f7c

@HuiGao-NV HuiGao-NV merged commit c51e90d into NVIDIA:main Apr 12, 2025
3 checks passed
@HuiGao-NV HuiGao-NV deleted the skip_mem_est_for_star_attention branch April 18, 2025 08:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants