[RFC] Added unpaddedSize_ to Tensor #2970

gcatron · 2019-05-24T00:24:29Z

Summary:
This PR adds a new field to Tensor: unpaddedSize_.
Since we have a static batch size we copy the entire tensor to the device regardless of number of inputs. This would allow a DeviceManager to only copy the inputs if it supports it.
This means adding additional metaData to Tensor, this seemed cleaner than passing side channel data.

Documentation: NA

Test Plan: ninja test

bertmaher · 2019-05-24T05:56:53Z

I had a pretty evil thought: do we ever check that the Tensor and bound Placeholder in PlaceholderBindings actually have the same type? If we relax that restriction, we could simply bind smaller tensors.

I'm not sure that's a good idea, I'm just thinking about ways to avoid cluttering the main Tensor class.

bertmaher · 2019-05-24T06:04:45Z

Here's a different thought that feels safer than binding wrong-sized tensors... we already create an ExecutionContext that gets passed all the way down to the execute method where we need the size; maybe we should keep a parallel map of physical sizes in that object.

bertmaher · 2019-05-24T06:21:28Z

If we go with the design in this PR (passing the unpadded size in the Tensor) I'd like to make it a bit safer than simply adding a getter/setter. Some suggestions:

This optimization only makes sense for unowned tensors; add a constructor for this case:

Tensor(void *data, TypeRef ty, size_t unpaddedSize);

Only provide a getUnpaddedSize(), ~~and have it return size() if the tensor is not padded.~~ This is what you have.
I guess I'm nervous about setting unpaddedSize=0 to mean "the tensor is not padded". I'd prefer to always set unpaddedSize, but I guess this is a bummer because you have to do some multiplications every time you allocate a Tensor.
Maybe call the convenience method getUnpaddedSizeInBytes() to follow convention elsewhere.

include/glow/Base/Tensor.h

bertmaher · 2019-05-24T06:27:25Z

lib/Base/Tensor.cpp

+
+uint64_t Tensor::getUnpaddedSize() {
+  if (unpaddedSize_) {
+    return unpaddedSize_;


If I'm right that virtually-padded tensors are only valid if unowned, assert that the tensor is unowned here.

gcatron · 2019-05-24T15:32:39Z

Here's a different thought that feels safer than binding wrong-sized tensors... we already create an ExecutionContext that gets passed all the way down to the execute method where we need the size; maybe we should keep a parallel map of physical sizes in that object.

Hmm, maybe we add it to PlaceholderBindings, I'm not sure I like that better than metaData directly on the Tensor but that would work too... I guess the question there is, Do we ever create a padded Tensor so far upstream that it'd be a pain to plumb the padding factor down to context creation?

bertmaher · 2019-05-24T19:41:29Z

OK, after sleeping on it I'm on board with the add-it-to-Tensor approach :-)

bertmaher · 2019-05-24T19:43:37Z

include/glow/Base/Tensor.h

+  size_t getUnpaddedSizeInBytes();
+
+  /// Set the size of unpaddedSize_ to \p size
+  void setUnpaddedSize(size_t size) { unpaddedSize_ = size; }


What do you think about killing the setter in favor of a new flavor of unowned constructor? I always like to avoid exposing data members that can be arbitrarily twiddled.

bertmaher

I'm comfortable approving this, esp to unblock work on top of it.

jfix71 · 2019-05-24T21:09:53Z

Would it make sense to add this to Type instead of Tensor?

nadavrot · 2019-05-24T21:28:16Z

CC @opti-mix who is looking into generalizing the padding support in the type system.

opti-mix · 2019-05-24T21:42:27Z

Would it make sense to add this to Type instead of Tensor?

I've implemented some support for providing custom strides/alignments for dimensions of tensors on my private branch ( related to #2686 ). E.g. you can say that a stride for a given dimension is a multiple of 64. If you only have 3 elements as the actual size of the dimension, the remaining 61 elements would be just "padding" elements filled with some garbage data.

I've been planning to upstream it, though not necessarily in the coming weeks. On this branch, I've added a new strides_ member to Type and updated related places affected by this change.

If you think that the ability to specify strides on a per-dimension basis would help with your issue, we could upstream it earlier than I initially planned.

But I'm under the impression that you are trying to do something different here.

bertmaher · 2019-05-24T22:17:57Z

To provide context here, the problem we're trying to solve is that PCIe traffic sending inputs to devices is a bottleneck when we're using SparseLengthsSum operators. The problem is that we're forced to provide a maximally-sized "indices" input, when it fact we often use fewer than the maximum number of indices. (E.g., we might compile to a sequence length of 1000, but frequently use only 100). So we end up transferring a bunch of zeros (or garbage) over PCIe, despite knowing (at runtime) the correct amount of data to transfer.

I think this feature is probably orthogonal to strided tensors; it's more like cheapo support for dynamic shapes ;-).

bertmaher · 2019-05-24T22:20:29Z

Would it make sense to add this to Type instead of Tensor?

That could be reasonable... although I'm not sure whether it's really a type-system property. I've been thinking of it more as a storage property of the Tensor, like the type is still Int64[2000] but it's "virtually padded" as needed rather than being physically padded to the right size.

opti-mix · 2019-05-25T18:04:42Z

I'd suggest not merging this PR yet. I've got some ideas I'd like to discuss next week.

nickgg · 2019-05-29T17:54:55Z

Initially I preferred the Tensor approach, but I'm leaning towards adding it to Type now - what does it mean to e.g. MatMul two tensors with the same Type but different paddings?

bertmaher · 2019-05-30T05:25:06Z

You could actually use this for matmul to express padding in the batch dimension. There are maybe two concepts here, (1) can this dimension be padded, which could go in Type, and (2) how much padding is there actually, which probably has to go on Tensor (or some other runtime structure).

…y the inputs and not padding in the case of padded inputs

bertmaher

lgtm, thanks!

facebook-github-bot

@gcatron has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2019-05-31T20:32:09Z

@gcatron merged this pull request in ee5e2fd.

facebook-github-bot added the CLA Signed label May 24, 2019

gcatron force-pushed the add_tensor_unpadded_size branch from 1038f80 to 249422d Compare May 24, 2019 00:45

bertmaher reviewed May 24, 2019

View reviewed changes

include/glow/Base/Tensor.h Outdated Show resolved Hide resolved

bertmaher reviewed May 24, 2019

View reviewed changes

gcatron force-pushed the add_tensor_unpadded_size branch from 249422d to c0634f6 Compare May 24, 2019 15:23

gcatron force-pushed the add_tensor_unpadded_size branch from c0634f6 to 6560ec5 Compare May 24, 2019 16:55

bertmaher reviewed May 24, 2019

View reviewed changes

bertmaher approved these changes May 24, 2019

View reviewed changes

gcatron force-pushed the add_tensor_unpadded_size branch 4 times, most recently from cc08fa6 to 7ed15f2 Compare May 30, 2019 23:59

Added unpaddedSize_ to Tensor, this allows DeviceManagers to copy onl…

72bc00d

…y the inputs and not padding in the case of padded inputs

gcatron force-pushed the add_tensor_unpadded_size branch from 7ed15f2 to 72bc00d Compare May 31, 2019 00:18

bertmaher approved these changes May 31, 2019

View reviewed changes

facebook-github-bot reviewed May 31, 2019

View reviewed changes

facebook-github-bot closed this in ee5e2fd May 31, 2019

facebook-github-bot added the Merged label May 31, 2019

[RFC] Added unpaddedSize_ to Tensor #2970

[RFC] Added unpaddedSize_ to Tensor #2970

Uh oh!

Conversation

gcatron commented May 24, 2019

Uh oh!

bertmaher commented May 24, 2019

Uh oh!

bertmaher commented May 24, 2019

Uh oh!

bertmaher commented May 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

bertmaher May 24, 2019

Choose a reason for hiding this comment

Uh oh!

gcatron commented May 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bertmaher commented May 24, 2019

Uh oh!

bertmaher May 24, 2019

Choose a reason for hiding this comment

Uh oh!

bertmaher left a comment

Choose a reason for hiding this comment

Uh oh!

jfix71 commented May 24, 2019

Uh oh!

nadavrot commented May 24, 2019

Uh oh!

opti-mix commented May 24, 2019

Uh oh!

bertmaher commented May 24, 2019

Uh oh!

bertmaher commented May 24, 2019

Uh oh!

opti-mix commented May 25, 2019

Uh oh!

nickgg commented May 29, 2019

Uh oh!

bertmaher commented May 30, 2019

Uh oh!

bertmaher left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 31, 2019

Uh oh!

Uh oh!

bertmaher commented May 24, 2019 •

edited

Loading

gcatron commented May 24, 2019 •

edited

Loading