Skip to content

Conversation

zasdfgbnm
Copy link
Collaborator

This would be needed for arange support, because for arange, then output tensor size is
ceil((end - start)/step), where end, start, step could be floating points.

@zasdfgbnm zasdfgbnm marked this pull request as draft August 3, 2022 21:53
@zasdfgbnm zasdfgbnm marked this pull request as ready for review August 3, 2022 22:27
Copy link
Collaborator

@naoyam naoyam left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Left some comments

@zasdfgbnm zasdfgbnm merged commit aafe2d0 into devel Aug 9, 2022
@zasdfgbnm zasdfgbnm deleted the eval branch August 9, 2022 07:31
jjsjann123 added a commit that referenced this pull request Aug 29, 2022
Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/

Code changes includes:

- codegen improvements:
  1. double support in expression evaluator
- bug fixes:
  1. dropout fix - rework RNG to support broadcasted dropout (Fixes pytorch#82784)
  2. expand fix - Patch expand+reduction, expand+view, rework view analysis and guard
- scheduler:
  1. manual transpose schedule example
  2. WIP transpose scheduler

Commits that's in this PR from the devel branch:

```
b7435af Transpose scheduler, step 1 (#1854)
8a45dbf Add an example on how to manually schedule transpose (#1889)
83dbf56 Patch dropout fix (#1898)
69d3519 Expand+Reduction, Expand+View support, rework View analysis and guards (#1883)
15091c4 Rework RNG to correctly support broadcasted dropout (#1888)
aafe2d0 Make ExpressionEvaluator support Double (#1885)
```

RUN_TORCHBENCH: nvfuser

Differential Revision: [D38657074](https://our.internmc.facebook.com/intern/diff/D38657074)
Pull Request resolved: pytorch#83239
Approved by: https://github.com/davidberard98
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants