Skip to content

Conversation

MasterJH5574
Copy link
Contributor

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16. We added a few vector data conversion util functions.

@MasterJH5574 MasterJH5574 force-pushed the tvm-dev/2025-03-11-bf16-fp8-vector-cast branch 2 times, most recently from 6647447 to 81d6a15 Compare March 12, 2025 02:13
This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
@MasterJH5574 MasterJH5574 force-pushed the tvm-dev/2025-03-11-bf16-fp8-vector-cast branch from 81d6a15 to db749ed Compare March 12, 2025 03:30
Copy link
Contributor

@cyx-6 cyx-6 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, verified on B100:)

@Hzfengsy Hzfengsy merged commit 6b89f95 into apache:main Mar 12, 2025
15 checks passed
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 8, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 8, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 9, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 9, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 9, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 9, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
JoieAli pushed a commit to JoieAli/mcTVM that referenced this pull request Jul 9, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025
apache#17741)

This PR fixes the CUDA code generation for fp8 (also fp4) and bfloat16.
We added a few vector data conversion util functions.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants