Cleaner semantics for Reserve #10261

jerryzh168 · 2018-08-06T17:25:16Z

Summary:

Reserve
Currently, Reserve will allocate new memory and old data in the tensor is also preserved,
and Resize is relying on this behavior in some call-site, e.g. https://github.com/pytorch/pytorch/blob/master/caffe2/operators/reservoir_sampling.cc#L103, where we should be using Extend.
We want to bring semantics of Reserve to be more aligned with std::vector, i.e. we want it to be
an optimization about memory allocation and remove the semantics about preserving the data. We'll remove the guarantee that data will be preserved after Reserve, and Extend will be the only API that preserves old data when we do in-place extension of memory. This also helps with the later refactoring on split Storage from Tensor.
Also, we'll only pass in the outer dimension to Reserve which means the later dimensions should be set before we call Reserve.
Extend/Shrink
Previously, Extend actually means ExtendBy and Shrink means ShrinkTo, I would like to add a ExtendTo for convenience, and change Shrink to ShrinkTo.
Old functions calling Extend is still there, although it actually means Extend by, but I think it still makes sense to have it.
Usage Patterns

The expected usage patterns right now is:

t->Resize({0, 32, 32, 32});
t->template mutable_data<T>(); // set meta_
t->Reserve(100);
auto* t_data = t->template mutable_data<T>();
// feed data to tensor using t_data
for (int i = 0; i < 100; ++i) {
  t->Extend(1, 50, &context_);
  // you can continue to use t_data if you have reserved enough space
  // otherwise, you should call t->template mutable_data<T> again to
  // get the new data pointer since Extend will allocate new memory even
  // though the original data is preserved.
}

Reviewed By: ezyang

Differential Revision: D9128147

Summary: 1. Reserve Currently, Reserve will allocate new memory and old data in the tensor is also preserved, and Resize is relying on this behavior in some call-site, e.g. https://github.com/pytorch/pytorch/blob/master/caffe2/operators/reservoir_sampling.cc#L103, where we should be using Extend. We want to bring semantics of Reserve to be more aligned with std::vector, i.e. we want it to be an optimization about memory allocation and remove the semantics about preserving the data. We'll remove the guarantee that data will be preserved after Reserve, and Extend will be the only API that preserves old data when we do in-place extension of memory. This also helps with the later refactoring on split Storage from Tensor. Also, we'll only pass in the outer dimension to Reserve which means the later dimensions should be set before we call Reserve. 2. Extend/Shrink Previously, Extend actually means ExtendBy and Shrink means ShrinkTo, I would like to add a ExtendTo for convenience, and change Shrink to ShrinkTo. Old functions calling Extend is still there, although it actually means Extend by, but I think it still makes sense to have it. 3. Usage Patterns The expected usage patterns right now is: ``` t->Resize({0, 32, 32, 32}); t->template mutable_data<T>(); // set meta_ t->Reserve(100); auto* t_data = t->template mutable_data<T>(); // feed data to tensor using t_data for (int i = 0; i < 100; ++i) { t->Extend(1, 50, &context_); // you can continue to use t_data if you have reserved enough space // otherwise, you should call t->template mutable_data<T> again to // get the new data pointer since Extend will allocate new memory even // though the original data is preserved. } ``` Reviewed By: ezyang Differential Revision: D9128147 fbshipit-source-id: 6163fe35fe729e71e1c10ec255011c92b70c60a7

jerryzh168 · 2018-08-06T19:18:01Z

rebuild succeeded.

jerryzh168 · 2018-08-06T19:18:15Z

https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-cuda9.0-cudnn7-windows-trigger-build/11220/console

This reverts commit e0d4357.

Summary: Pull Request resolved: pytorch#10261 1. Reserve Currently, Reserve will allocate new memory and old data in the tensor is also preserved, and Resize is relying on this behavior in some call-site, e.g. https://github.com/pytorch/pytorch/blob/master/caffe2/operators/reservoir_sampling.cc#L103, where we should be using Extend. We want to bring semantics of Reserve to be more aligned with std::vector, i.e. we want it to be an optimization about memory allocation and remove the semantics about preserving the data. We'll remove the guarantee that data will be preserved after Reserve, and Extend will be the only API that preserves old data when we do in-place extension of memory. This also helps with the later refactoring on split Storage from Tensor. Also, we'll only pass in the outer dimension to Reserve which means the later dimensions should be set before we call Reserve. 2. Extend/Shrink Previously, Extend actually means ExtendBy and Shrink means ShrinkTo, I would like to add a ExtendTo for convenience, and change Shrink to ShrinkTo. Old functions calling Extend is still there, although it actually means Extend by, but I think it still makes sense to have it. 3. Usage Patterns The expected usage patterns right now is: ``` t->Resize({0, 32, 32, 32}); t->template mutable_data<T>(); // set meta_ t->Reserve(100); auto* t_data = t->template mutable_data<T>(); // feed data to tensor using t_data for (int i = 0; i < 100; ++i) { t->Extend(1, 50, &context_); // you can continue to use t_data if you have reserved enough space // otherwise, you should call t->template mutable_data<T> again to // get the new data pointer since Extend will allocate new memory even // though the original data is preserved. } ``` Reviewed By: ezyang Differential Revision: D9128147 fbshipit-source-id: e765f6566d73deafe2abeef0b2cc0ebcbfebd096

jerryzh168 changed the title ~~Cleaner semantics for Reserve and unification of Extend/Shrink~~ Cleaner semantics for Reserve Aug 6, 2018

facebook-github-bot closed this in e0d4357 Aug 6, 2018

ezyang added a commit to ezyang/pytorch that referenced this pull request Aug 6, 2018

Revert "Cleaner semantics for Reserve (pytorch#10261)"

1cd11db

This reverts commit e0d4357.

ezyang added the merged label Jun 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cleaner semantics for Reserve #10261

Cleaner semantics for Reserve #10261

Uh oh!

jerryzh168 commented Aug 6, 2018

Uh oh!

jerryzh168 commented Aug 6, 2018

Uh oh!

jerryzh168 commented Aug 6, 2018

Uh oh!

Uh oh!

Cleaner semantics for Reserve #10261

Cleaner semantics for Reserve #10261

Uh oh!

Conversation

jerryzh168 commented Aug 6, 2018

Uh oh!

jerryzh168 commented Aug 6, 2018

Uh oh!

jerryzh168 commented Aug 6, 2018

Uh oh!

Uh oh!