Skip to content

Conversation

xw285cornell
Copy link
Contributor

@xw285cornell xw285cornell commented Jun 22, 2024

Summary: Avoid latency of launching hipMemcpyAsync. Could see 3-4us reduction in benchmarking. Also see improvements in end to end testing.

Moved from #2693 to fix some formatting issue. Thanks @wenkaidu for contributing.

Reviewed By: sryap, jianyuh

Differential Revision: D58223358

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D58223358

Copy link

netlify bot commented Jun 22, 2024

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 3b52730
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/667638a4cd502500086fab06
😎 Deploy Preview https://deploy-preview-2770--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D58223358

xw285cornell added a commit to xw285cornell/FBGEMM that referenced this pull request Jun 22, 2024
Summary:
Pull Request resolved: pytorch#2770

Avoid latency of launching hipMemcpyAsync. Could see 3-4us reduction in benchmarking. Also see improvements in end to end testing.

Reviewed By: sryap, jianyuh

Differential Revision: D58223358
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D58223358

xw285cornell added a commit to xw285cornell/FBGEMM that referenced this pull request Jun 22, 2024
Summary:
Pull Request resolved: pytorch#2770

Avoid latency of launching hipMemcpyAsync. Could see 3-4us reduction in benchmarking. Also see improvements in end to end testing.

Reviewed By: sryap, jianyuh

Differential Revision: D58223358
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D58223358

xw285cornell added a commit to xw285cornell/FBGEMM that referenced this pull request Jun 22, 2024
Summary:
Pull Request resolved: pytorch#2770

Avoid latency of launching hipMemcpyAsync. Could see 3-4us reduction in benchmarking. Also see improvements in end to end testing.

Reviewed By: sryap, jianyuh

Differential Revision: D58223358
Summary:
Pull Request resolved: pytorch#2770

Avoid latency of launching hipMemcpyAsync. Could see 3-4us reduction in benchmarking. Also see improvements in end to end testing.

Reviewed By: sryap, jianyuh

Differential Revision: D58223358
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D58223358

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 7f77444.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants