Add more NVFuser microbenchmarks #801

davidberard98 · 2022-03-16T00:18:43Z

Stack from ghstack:

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it.

Differential Revision: D35732497

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 67cd76b Pull Request resolved: #801

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 00f798f Pull Request resolved: #801

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 2e5074d Pull Request resolved: #801

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: a3028c6 Pull Request resolved: #801

eellison

Is there anything blocking this ?

davidberard98 · 2022-04-08T23:21:31Z

@eellison they are still erroring pytorch/pytorch#75282

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 13d2a15 Pull Request resolved: #801

davidberard98 · 2022-04-18T22:47:33Z

@davidberard98 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…ils" [NVFuser] always fallback if fusion fails 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms [ghstack-poisoned]

[NVFuser] always fallback if fusion fails 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms [ghstack-poisoned]

1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms ghstack-source-id: 60c31f7 Pull Request resolved: #75983

…ils" [NVFuser] always fallback if fusion fails 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms [ghstack-poisoned]

[NVFuser] always fallback if fusion fails 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms [ghstack-poisoned]

1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms ghstack-source-id: 59be971 Pull Request resolved: #75983

1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: #75983 Approved by: https://github.com/jjsjann123

Retry of pytorch#75983. The change is to handle cases where attr::cache_id is not set. This can happen if compilation fails. Original message: 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms

Retry of #75983. The change is to handle cases where attr::cache_id is not set. This can happen if compilation fails. Original message: 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: #76505 Approved by: https://github.com/eellison

Summary: Retry of #75983. The change is to handle cases where attr::cache_id is not set. This can happen if compilation fails. Original message: 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: #76505 Approved by: https://github.com/eellison Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/e52dc9888bd7e30e467bd7ae729791885ec43f58 Reviewed By: osalpekar Differential Revision: D36042346 Pulled By: davidberard98 fbshipit-source-id: 7f34a0ae65f9583b8390383400fd91f69c635fc8

1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: pytorch/pytorch#75983 Approved by: https://github.com/jjsjann123

Retry of #75983. The change is to handle cases where attr::cache_id is not set. This can happen if compilation fails. Original message: 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: pytorch/pytorch#76505 Approved by: https://github.com/eellison

1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: pytorch/pytorch#75983 Approved by: https://github.com/jjsjann123

Retry of #75983. The change is to handle cases where attr::cache_id is not set. This can happen if compilation fails. Original message: 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (pytorch/benchmark#801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: pytorch/pytorch#76505 Approved by: https://github.com/eellison

[WIP] Add more NVFuser microbenchmarks

ceb0277

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

davidberard98 mentioned this pull request Mar 16, 2022

NVFuser microbenchmarks and filtering tool #793

Closed

facebook-github-bot added the cla signed label Mar 16, 2022

davidberard98 added a commit that referenced this pull request Mar 16, 2022

[WIP] Add more NVFuser microbenchmarks

9373ebe

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 67cd76b Pull Request resolved: #801

Update on "[WIP] Add more NVFuser microbenchmarks"

f98d170

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

davidberard98 added a commit that referenced this pull request Mar 16, 2022

[WIP] Add more NVFuser microbenchmarks

7c2e993

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 00f798f Pull Request resolved: #801

davidberard98 marked this pull request as draft March 16, 2022 00:23

Update on "[WIP] Add more NVFuser microbenchmarks"

7464dfa

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

davidberard98 added a commit that referenced this pull request Mar 16, 2022

[WIP] Add more NVFuser microbenchmarks

15e75a2

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 2e5074d Pull Request resolved: #801

Update on "[WIP] Add more NVFuser microbenchmarks"

d55ade5

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

davidberard98 added a commit that referenced this pull request Apr 5, 2022

[WIP] Add more NVFuser microbenchmarks

49b76ed

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: a3028c6 Pull Request resolved: #801

jjsjann123 mentioned this pull request Apr 5, 2022

nvfuser fails on benchmark pytorch/pytorch#75282

Closed

davidberard98 mentioned this pull request Apr 6, 2022

NVFuser bad "reshape" performance pytorch/pytorch#75371

Open

eellison reviewed Apr 8, 2022

View reviewed changes

davidberard98 mentioned this pull request Apr 15, 2022

autogen-58 microbenchmark fails on NNC gpu fusion pytorch/pytorch#75925

Open

Update on "[WIP] Add more NVFuser microbenchmarks"

c8951ea

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. [ghstack-poisoned]

davidberard98 added a commit that referenced this pull request Apr 15, 2022

[WIP] Add more NVFuser microbenchmarks

8d04b3e

Waiting on pytorch/pytorch#73627 to land, because some of these don't pass without it. ghstack-source-id: 13d2a15 Pull Request resolved: #801

davidberard98 mentioned this pull request Apr 15, 2022

[WIP] NVFuser microbenchmark - add autogen-58 #870

Closed

davidberard98 marked this pull request as ready for review April 18, 2022 20:11

davidberard98 changed the title ~~[WIP] Add more NVFuser microbenchmarks~~ Add more NVFuser microbenchmarks Apr 18, 2022

davidberard98 requested a review from eellison April 18, 2022 22:51

davidberard98 mentioned this pull request Apr 19, 2022

Improve NVFuser fallback performance pytorch/pytorch#74559

Closed

eellison approved these changes Apr 19, 2022

View reviewed changes

facebook-github-bot closed this in 366b184 Apr 19, 2022

davidberard98 mentioned this pull request Apr 20, 2022

[NVFuser] always use fallback if fusion fails pytorch/pytorch#75983

Closed

facebook-github-bot deleted the gh/davidberard98/2/head branch April 23, 2022 14:15

davidberard98 mentioned this pull request Apr 27, 2022

Retry - [NVFuser] always use fallback if fusion fails pytorch/pytorch#76505

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add more NVFuser microbenchmarks #801

Add more NVFuser microbenchmarks #801

Uh oh!

davidberard98 commented Mar 16, 2022 •

edited

Loading

Uh oh!

eellison left a comment

Uh oh!

davidberard98 commented Apr 8, 2022

Uh oh!

davidberard98 commented Apr 18, 2022

Uh oh!

Uh oh!

Add more NVFuser microbenchmarks #801

Add more NVFuser microbenchmarks #801

Uh oh!

Conversation

davidberard98 commented Mar 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

davidberard98 commented Apr 8, 2022

Uh oh!

davidberard98 commented Apr 18, 2022

Uh oh!

Uh oh!

davidberard98 commented Mar 16, 2022 •

edited

Loading