forked from pytorch/pytorch
-
Notifications
You must be signed in to change notification settings - Fork 68
Pull requests: ROCm/pytorch
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[release/2.7][ROCm][TunableOp] Fix offline tuning for ScaledGEMM. (#149677)
#2085
opened May 1, 2025 by
naromero77amd
Loading…
[AUTOGENERATED] [release/2.7] [ROCm] Update maxpool launch config
#2084
opened May 1, 2025 by
okakarpa
Loading…
[AUTOGENERATED] [release/2.7] [ROCm] Maxpool backward NHWC Perf Improvement targeting Resnet scenarios
#2083
opened May 1, 2025 by
okakarpa
Loading…
[AUTOGENERATED] [release/2.6] [ROCm] Update maxpool launch config
#2082
opened May 1, 2025 by
okakarpa
Loading…
[release/2.6] Change gfx110x BLAS preferred backend
#2053
opened Apr 25, 2025 by
amd-imilenko
Loading…
Enable load-compute-store interleaving for unrolled elementwise kernel.
#1886
opened Feb 6, 2025 by
carlobertolli
•
Draft
[Do NOT MERGE] [release/2.5] Enable tf32 testing on test_nn
#1859
opened Jan 27, 2025 by
jagadish-amd
Loading…
[ROCm] Eliminate the need for divisions in layernorm for default vector size.
#1850
opened Jan 22, 2025 by
doru1004
Loading…
[ROCm][WIP] Improve performance of casted elementwise add operations
#1805
opened Dec 20, 2024 by
doru1004
Loading…
[release/2.5] Fixed string comparison in test_cpp_wrapper_hipify
#1760
opened Nov 29, 2024 by
iupaikov-amd
Loading…
[release/2.5] Enabled force_shape_pad for test_pad_mm and test_slice_mm_bandwidth_computation
#1755
opened Nov 28, 2024 by
iupaikov-amd
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.