-
Notifications
You must be signed in to change notification settings - Fork 474
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[PyTorch] Register weight and bias params in linear op
bug
Something isn't working
#2027
opened Aug 4, 2025 by
timmoon10
Loading…
6 of 13 tasks
Offloading support for multiple attention layouts
#2024
opened Aug 3, 2025 by
sanandaraj5597
Loading…
[PyTorch] Fix bug when deducing dtype in linear functional API
bug
Something isn't working
#2017
opened Aug 1, 2025 by
timmoon10
Loading…
6 of 13 tasks
[Draft] Add FP8 attention with current scaling
#2012
opened Jul 30, 2025 by
cyanguwa
Loading…
8 of 13 tasks
[PyTorch] Disable fused dbias-quantize kernel for unsupported recipes
bug
Something isn't working
#2007
opened Jul 30, 2025 by
timmoon10
Loading…
6 of 13 tasks
Remove if-else and torch.tensor to meet cudagraph requirement
#1997
opened Jul 25, 2025 by
katec846
Loading…
13 tasks
Use userbuffers for MXFP8 wgrad all-gather overlap
#1982
opened Jul 22, 2025 by
djns99
Loading…
1 of 13 tasks
[PyTorch][Mcore] Fix illegal memory access issue while using Mcore async checkpoint with fp8 tensorwise recipe
bug
Something isn't working
#1956
opened Jul 16, 2025 by
zhongbozhu
Loading…
13 tasks
[PyTorch] Support delay_wgrad_compute cudagraph
#1948
opened Jul 14, 2025 by
buptzyb
Loading…
2 of 13 tasks
[PyTorch] Fuse permute+pad and unpermute+unpad ops for FP8 optimization
#1921
opened Jul 3, 2025 by
xiaoxi-wangfj
Loading…
3 of 12 tasks
Fix import error when flash attention 3 is installed
#1913
opened Jun 30, 2025 by
HollowMan6
Loading…
7 of 13 tasks
[PyTorch debug] Improve precision debug tools performance
#1909
opened Jun 30, 2025 by
pggPL
Loading…
9 of 13 tasks
[PyTorch] Support FA3 MLA CP feature
#1907
opened Jun 28, 2025 by
zhujian19891203
Loading…
7 of 13 tasks
[PyTorch Debug] Support log fp8 tensor stats for blockwise recipe
#1905
opened Jun 27, 2025 by
lengerfulluse
Loading…
12 tasks
[Common] NVFP4 kernels
enhancement
New feature or request
#1904
opened Jun 27, 2025 by
Oleg-Goncharov
•
Draft
5 of 13 tasks
[PyTorch Debug] More advanced stats for Quantized Tensors
#1897
opened Jun 26, 2025 by
pggPL
Loading…
2 of 13 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.