forked from pytorch/pytorch
-
Notifications
You must be signed in to change notification settings - Fork 7
Continue broadcast support #77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
51 commits
Select commit
Hold shift + click to select a range
95287c2
Simplify a few test cases
tlemo 33a1b76
ExpressionEvaluator
tlemo 2adcd3c
Stricter EvaluationContext binding rules
tlemo a15ca79
Fix clang-format errors
tlemo 15fcfe0
Switch to Int::ScalarType
tlemo ca2ba1d
Avoid a fight with clang-tidy
tlemo 9a1c4c5
Add an optional arc from TensorView to its root domain
tlemo aa1e89e
Check the numbers of kernel input and output parameters
naoyam 787540d
Checks kernel arguments
naoyam 2763c13
Prefer pointers over references
naoyam 5d911c4
Bug fix
naoyam bc32d63
Fix accidental construction of IValue
naoyam b09fedd
Use noReduction
naoyam f7f261f
Add const to const pointer
naoyam de40ff6
Make an integer tensor an error as it is not yet supported
naoyam 6ef8a8d
clang-tidy
naoyam 1648bef
Incorporate review feedback
tlemo 74b2ed5
added lerp support in parser
jjsjann123 977d1fe
add missing addcmul parser and tests
jjsjann123 d32e77e
clang_format
jjsjann123 42213f2
Return TensorView* from binary/compound/ternary ops
naoyam c38b5cd
clang-format
naoyam 71c95c8
Use TensorView* param in reductionOp and sum
naoyam 2604af6
Prefer as instead of static_cast
naoyam 2f909f2
Transform replay refactor (#53)
csarofeen 82226e5
python test fixes (#52)
jjsjann123 dcb796e
[nvFuser] add torch.jit.fuser context manager (#38993) (#54)
jjsjann123 65ff3eb
Add another reduction example, change fusion printMath.
csarofeen 272aa1b
Small test fix.
csarofeen 5726ab4
Change Reduction4 test to use TIDx.x
csarofeen 73d7401
Minor cleanup.
csarofeen e1e5667
Clean up some noexcepts.
csarofeen 490d101
More cleanup.
csarofeen e3d8441
Refactor computeAt, get first broadcast example working.
csarofeen 831b222
Validate first non-trivial broadcast kernel.
csarofeen 1a88ce8
Fix replay when broadcast is merged with non-broadcast dim.
csarofeen 7c15591
Add constness in replay and index compute.
csarofeen 581223b
Add another broadcast test. Rework index computation for producers, b…
csarofeen 21e2989
Val isCconst fix.
csarofeen 71e4219
Add dot product gemm example.
csarofeen 550ca71
Clang.
csarofeen 4693817
Minor bug fixes.
csarofeen 943a15a
Format and add comments to GEMM test.
csarofeen 9dd01bd
WIP: Fix for enabling broadcast after reduction plus a Softmax test. …
kevinstephano b48a826
Backout bad merge conflict resolutions.
csarofeen 61ca498
More post rebase cleanup.
csarofeen fa80486
Refix a few tests. Some from a bad rebase.
csarofeen ad017a7
Address comments.
csarofeen 95f9f80
Missed some review comments.
csarofeen 2b77eb7
Merge branch '20_6_11_devel' of https://www.github.com/csarofeen/pyto…
csarofeen 629ec01
tmp
csarofeen File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
constexpr?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Probably not. We should want to verify the tests work with dynamically sized tensors.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think constexpr buys us anything here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These are compile time constants, right? If that's the case constexpr is the most explict language construct to express it (it would also prevent accidental value updates)