Skip to content

Conversation

Artem-B
Copy link
Member

@Artem-B Artem-B commented Sep 20, 2023

Some critical code paths we have depend on efficient byte extraction from data loaded as integers.
By default LLVM tries to extract by storing/loading from stack, which is very inefficient on GPU.

Some critical code paths we have depend on efficient byte extraction from data
loaded as 32-bit integers.
@Artem-B Artem-B requested review from d0k and ThomasRaoux September 20, 2023 19:14
@Artem-B Artem-B self-assigned this Sep 20, 2023
@Artem-B Artem-B added enhancement Improving things as opposed to bug fixing, e.g. new or missing feature backend:NVPTX labels Sep 20, 2023
Copy link
Contributor

@ThomasRaoux ThomasRaoux left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

@Artem-B Artem-B merged commit d06b3e3 into llvm:main Sep 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend:NVPTX enhancement Improving things as opposed to bug fixing, e.g. new or missing feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants