Add fetchVariable method to Session to get value of resource variable #261

rnett · 2021-03-30T02:55:51Z

Small PR to add a method to Session to allow for the fetching of the values of resource variables. Fetches the resource tensor then uses an eager session to read the value.

Signed-off-by: Ryan Nett <[email protected]>

rnett · 2021-03-30T03:24:18Z

CI errors are from Javadoc, tests are passing.

Craigacp · 2021-03-30T13:15:55Z

I don't think splitting fetch and fetchVariable up is a good idea. Users have no way to know if the thing they are fetching should be fetched with fetchVariable or fetch without trying both and catching the runtime exception, and we should never want users to catch runtime exceptions.

In addition to the API issue, why does this require creating an eager session and constant op to read a value? What are the alternatives?

rnett · 2021-03-30T18:04:06Z

Users can tell if their operand is a variable by checking the data type, maybe we should add a Operand.isResourceVariable()? But yeah, since we don't have a tensor type for resources there's no good way to make it typesafe. Adding it is a possibility, and probably good to do at some point, but beyond me at the moment (it's not mappable to NDArrays, so it doesn't fit well with the existing tensor type setup).

Fetching a variable is almost always an error anyways, since it's not usually an output, especially if you don't realize what you're fetching is a variable (i.e. #260). This is intended for the special use cases where you want to get a variable, in which case you'll know ahead of time it's a variable.

To read a value from a resource tensor, you need to use the readVariableOp Op, either in the graph or eager session. I could add the ops to the graph and fetch them instead, using reads if they are already present, but that adds extra ops to the graph and it's not easy to detect if you already have a read, unless we limit it to ones added by this (attributes need to match too (i.e. not being on another device) and we don't have a way to check those). It's worth considering, but creating eager sessions doesn't seem heavy, so I went with that.

Craigacp · 2021-03-30T18:27:23Z

I think fetching variables is entirely reasonable as you can fetch everything else in the graph by name. Having a set of things that you can't fetch and the only way to find out is by trying it and catching a runtime exception is bad.

I'd expect TF python to add readVariableOp during variable creation or when the graph is saved, so shouldn't those ops exist in any graph that has resource tensors?

rnett · 2021-03-30T21:36:34Z

I think fetching variables is entirely reasonable as you can fetch everything else in the graph by name. Having a set of things that you can't fetch and the only way to find out is by trying it and catching a runtime exception is bad.

Agreed, but I don't think it's possible, at least for now. The reason I did this in a separate method is because we can't fetch variables normally. There's no Resource TType, so tensor creation fails. Even if we did add one, it doesn't fit with the existing TType setup as the only method we would want to provide is read(Class<? extends TType>). We can't even auto-read variables in fetch since they don't specify their value's dtype (at least that I can find), so it needs the class parameter.

I'd expect TF python to add readVariableOp during variable creation or when the graph is saved, so shouldn't those ops exist in any graph that has resource tensors?

I think it does, but it's possible for people to use the raw op rather than tf.Variable. Checking for a read op is probably the better route though.

Craigacp · 2021-03-30T21:55:10Z

I think fetching variables is entirely reasonable as you can fetch everything else in the graph by name. Having a set of things that you can't fetch and the only way to find out is by trying it and catching a runtime exception is bad.

Agreed, but I don't think it's possible, at least for now. The reason I did this in a separate method is because we can't fetch variables normally. There's no Resource TType, so tensor creation fails. Even if we did add one, it doesn't fit with the existing TType setup as the only method we would want to provide is read(Class<? extends TType>). We can't even auto-read variables in fetch since they don't specify their value's dtype (at least that I can find), so it needs the class parameter.

I'd expect TF python to add readVariableOp during variable creation or when the graph is saved, so shouldn't those ops exist in any graph that has resource tensors?

I think it does, but it's possible for people to use the raw op rather than tf.Variable. Checking for a read op is probably the better route though.

Why not have it check for the read op and then silently redirect the fetch to that read op. That will give us back a non-resource tensor right? Otherwise unpicking the resource and exporting it into a regular tensor seems fine too. I'm wary of spinning up and throwing away eager sessions to access a variable. There's a bunch of stuff that happens in TFE_NewContext.

rnett · 2021-03-30T22:49:07Z

It looks like it is possible to get the variable dtype from the attribute, so I can do everything using graph based ops, and automatically. However, eventually, once we have support for resource tensors, fetching a resource tensor will be a legitimate operation. So there still needs to be some way to differentiate between fetch w/ conversion and fetch without. Maybe make fetch auto-convert for now, and once we have resource tensor support add fetchVariable that gets the resource tensor?

Signed-off-by: Ryan Nett <[email protected]>

rnett · 2021-03-30T23:26:27Z

Ok, it automatically wraps variables in a read in the graph in fetch now.

karllessard · 2021-03-31T13:03:23Z

@rnett, did you try simply using the TF_TensorData endpoint of the C API to get a pointer to the variable native buffer directly? That's what the 1.x client was using before, making a copy first before returning it though. And I'm not sure if that was supporting variables but it could be interesting to try?

BTW this is mostly out of curiosity, I'm also fine with the usage of readVariableOp you and @Craigacp proposed earlier.

Signed-off-by: Ryan Nett <[email protected]>

rnett · 2021-03-31T18:57:14Z

I didn't try that, but I assume the 1.x client was using the old variables, not resource variables. I'm not sure how to get a tensor from that buffer, either.

karllessard · 2021-03-31T20:02:14Z

I didn't try that, but I assume the 1.x client was using the old variables, not resource variables. I'm not sure how to get a tensor from that buffer, either.

Probably using that endpoint, should be easy enough to convert the TF_Buffer to a ByteDataBuffer.

But anyway, if Python does it with the readVariableOp, then let's do that as well

rnett · 2021-03-31T21:30:06Z

But anyway, if Python does it with the readVariableOp, then let's do that as well

👍 They are already added to the graph during the variable's creation in Python, and I plan on doing something similar for our version, so it should be fine.

Craigacp

A couple of small things about logging/messages, and I think a test loading a python graph is important to have.

tensorflow-core/tensorflow-core-api/src/main/java/org/tensorflow/Session.java

tensorflow-core/tensorflow-core-api/src/test/java/org/tensorflow/SessionTest.java

Signed-off-by: Ryan Nett <[email protected]>

Craigacp

LGTM

…variable (#261)" This reverts commit e229028.

rnett added 3 commits March 29, 2021 19:48

Add fetchVariable method to Session to get value of resource variable

e08eb06

Signed-off-by: Ryan Nett <[email protected]>

Format

f8f9a2f

Signed-off-by: Ryan Nett <[email protected]>

More Formatting

951a2d9

Signed-off-by: Ryan Nett <[email protected]>

rnett added 2 commits March 30, 2021 16:25

Rework, automatically wrap variables in read when fetched

6d9317f

Signed-off-by: Ryan Nett <[email protected]>

Forgot to format

e5d8512

Signed-off-by: Ryan Nett <[email protected]>

Remove obsolete method

fb5c319

Signed-off-by: Ryan Nett <[email protected]>

Craigacp requested changes Apr 2, 2021

View reviewed changes

rnett added 2 commits April 3, 2021 18:13

Small fixes

b40fb0c

Signed-off-by: Ryan Nett <[email protected]>

Python model loading + variable fetching test

a6496ce

Signed-off-by: Ryan Nett <[email protected]>

rnett requested a review from Craigacp April 4, 2021 02:03

Craigacp approved these changes Apr 5, 2021

View reviewed changes

Craigacp merged commit e229028 into tensorflow:master Apr 6, 2021

Craigacp added a commit that referenced this pull request Apr 6, 2021

Revert "Add fetchVariable method to Session to get value of resource …

9c1b26e

…variable (#261)" This reverts commit e229028.

Craigacp mentioned this pull request Apr 6, 2021

Fetch resource variable fix #276

Merged

Add fetchVariable method to Session to get value of resource variable #261

Add fetchVariable method to Session to get value of resource variable #261

Uh oh!

Conversation

rnett commented Mar 30, 2021

Uh oh!

rnett commented Mar 30, 2021

Uh oh!

Craigacp commented Mar 30, 2021

Uh oh!

rnett commented Mar 30, 2021

Uh oh!

Craigacp commented Mar 30, 2021

Uh oh!

rnett commented Mar 30, 2021

Uh oh!

Craigacp commented Mar 30, 2021

Uh oh!

rnett commented Mar 30, 2021

Uh oh!

rnett commented Mar 30, 2021

Uh oh!

karllessard commented Mar 31, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rnett commented Mar 31, 2021

Uh oh!

karllessard commented Mar 31, 2021

Uh oh!

rnett commented Mar 31, 2021

Uh oh!

Craigacp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Craigacp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

karllessard commented Mar 31, 2021 •

edited

Loading