Refactor modules and integrate JavaCPP to map the C API #1

saudet · 2019-09-26T15:57:35Z

This basically integrates JavaCPP in the way discussed previously at karllessard#2, in addition to building with MKL enabled and upgrading to TF 2.0.0-rc2. (Maybe src/gen/java is too much? It should change only little by little from now on though.)

I also took the liberty to refactor tentatively the modules a bit, especially the naming. The module that launches Bazel is "tensorflow-core-api", whose parent is "tensorflow-core", so we don't end up with something odd like "parent-core", which also doesn't match with the subdirectory name. Other modules in "tensorflow-core" also use it as prefix, "tensorflow-core-platform" and "tensorflow-core-examples", which contains the LabelImage example that I moved there and a HelloWorld one for the C API. I've also moved the single source file from "annotation-processor" to "tensorflow-core-api" since we can execute all that in the same module. What do you think about that? It's not supposed to be used for anything else, right?

This naming scheme also works for others: "tensorflow-utils" -> "tensorflow-utils-nio", "tensorflow-frameworks" -> "tensorflow-frameworks-keras", but we can still deviate a bit and do something like "tensorflow-keras". (BTW, I think we should use a different name than "keras", like "kerasj", for example.) If people follow even approximately these rules, I think we should get a pretty nice and consistent repository such as the one for Spring Boot: https://github.com/spring-projects/spring-boot

The project builds and runs fine on Linux and Mac here, but I still get a weird compiler error on Windows. I'll eventually get it working, but other than that I think it's ready for a review! (BTW, I had to copy .bazelrc in there. Is there a better way to achieve this?)

googlebot · 2019-09-26T15:57:40Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

saudet · 2019-09-26T22:36:54Z

@googlebot I signed it!

googlebot · 2019-09-26T22:36:57Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

karllessard · 2019-09-28T21:55:25Z

I also took the liberty to refactor tentatively the modules a bit, especially the naming. The module that launches Bazel is "tensorflow-core-api", whose parent is "tensorflow-core", so we don't end up with something odd like "parent-core", which also doesn't match with the subdirectory name. Other modules in "tensorflow-core" also use it as prefix, "tensorflow-core-platform" and "tensorflow-core-examples", which contains the LabelImage example that I moved there and a HelloWorld one for the C API.

I like those new names as well. But I would remove completely the tensorflow-core-examples folder, I was planning to move those samples to our second repo (github.com/tensorflow/java-models).

I've also moved the single source file from "annotation-processor" to "tensorflow-core-api" since we can execute all that in the same module. What do you think about that? It's not supposed to be used for anything else, right?

Like we discussed on the call, I'm pretty sure the annotation processor needs to be in its own artifact because we should depend on it when building the tensorflow-core-api jar. See how it is done here and here. Let me know you find another way to make it work, but I suggest that we run that processor outside Bazel even in that first phase.

(BTW, I had to copy .bazelrc in there. Is there a better way to achieve this?)

Looks like TensorFlow Serving is doing the same so I guess it is OK?

karllessard · 2019-09-28T22:05:51Z

tensorflow-core/tensorflow-core-api/build.sh

+fi
+
+# Build TensorFlow itself
+bazel build --python_path="$PYTHON_BIN_PATH" --config=mkl --output_filter=DONT_MATCH_ANYTHING --verbose_failures @org_tensorflow//tensorflow:tensorflow @org_tensorflow//tensorflow/java:tensorflow


Can we build only targets that we actually need for the Java part? We don't have to build the whole client, afaik we just need the operation wrappers, which are provided by this target //tensorflow/java:java_op_gen_sources, in replacement to //tensorflow/java:tensorflow.

No, we can't, unfortunately, that is, not without patching TensorFlow, those targets are not public. Though, I've had to add a patch for MKL, since the fix isn't in 2.0.0. We can add more patches if we want to go that way.

Yeah, I think we can have @sjamesr commit this very quickly in the main repository, I'll check with him but not super important neither, we don't have to wait for this.

tensorflow-core/tensorflow-core-api/build.sh

karllessard · 2019-09-28T22:08:48Z

tensorflow-core/tensorflow-core-api/pom.xml

+      <artifactId>javapoet</artifactId>
+      <version>1.11.1</version>
+      <optional>true</optional> <!-- for compilation only -->
+    </dependency>


javapoet and guava are just required for the annotation-processor, which should be move to its own artifact (unless you find a way to make it work where you put it).

AFAIK, it's working just fine, yes. If we uncomment the lines that start with <!-- Bazel currently runs this, it gets executed just fine. However, it outputs an error because it doesn't want to overwrite existing classes:

[ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.8.0:compile (default-compile) on project tensorflow-core-api: Fatal error compiling: java.lang.AssertionError: javax.annotation.processing.FilerException: Attempt to recreate a file for type org.tensorflow.op.NnOps -> [Help 1]

Yes, that's because you copied those files (NnOps & friends) in your bash script previously, if you remove that last line in the script I pointed out, it should be fine... so you are saying that the annotation-processor runs successfully directly from the core-api jar?

Yes, that's correct. If I delete src/gen/java, it ends up generating them in target/generated-sources/annotations/. Is there a way to change that?

Ah, here we go: https://maven.apache.org/plugins/maven-compiler-plugin/compile-mojo.html#generatedSourcesDirectory So that should work alright...

Ok, done! But I haven't been able to get it to output to src/gen/java. I tried to hack it a dozen ways, but maven-compiler-plugin is hard coded to not compile anything from the generatedSourcesDirectory directory, which makes sense I guess, but it would be nice if we could use the includes and excludes filters instead. Anyway, it's now outputting to src/gen/annotations. Maybe we can do the same with the others like src/gen/javacpp and src/gen/ops...

tensorflow-core/tensorflow-core-api/src/main/java/org/tensorflow/examples/BUILD

tensorflow-core/tensorflow-core-examples/src/main/java/org/tensorflow/examples/LabelImage.java

karllessard · 2019-09-28T22:20:05Z

tensorflow-core/tensorflow-core-api/src/main/java/org/tensorflow/c_api/AbstractTF_Buffer.java

+    public void delete() {
+        deallocate();
+    }
+}


Is this class (and all other classes found in src/main/java/org/tensorflow/c_api) actually used by your PR? If not, I would prefer we wait for the second phase before adding them.

Well, they are the base classes of TF_Buffer, etc. They are functional, and we can modify them later as well. What are you worried about?

I just don't find any additional value for having classes in the code base that are actually not used, it creates unnecessary confusion and they can be easily reviewed and added later as part of the whole migration of the JNI code.

But they are used. Why do you say that they are not?

Oops, sorry I didn’t read carefully that last reply of yours, all good then.

So generated classes can extend from human-written ones? Just for my knowledge, where this magic is happening?

Ok got it: https://github.com/saudet/tensorflow-java/blob/18ea0b7939442d4539a61c8c0a164bc15f9ead51/tensorflow-core/tensorflow-core-api/src/main/java/org/tensorflow/c_api/presets/tensorflow.java#L118

karllessard · 2019-09-28T22:25:21Z

tensorflow-core/tensorflow-core-api/pom.xml

+            <configuration>
+              <skip>${javacpp.parser.skip}</skip>
+              <outputDirectory>${project.basedir}/src/gen/java</outputDirectory>
+              <classOrPackageName>org.tensorflow.c_api.presets.*</classOrPackageName>


The TF Java code is based upon the Google Java Style Guide, which states that package names shouldn't have underscores in it...

Can we rename the c_api package to something else, like nativeapi?

(BTW would be great that generated class names also stick to camelcase, with no underscores, but I guess that could be a challenge to change this, right? e.g TFEContext instead of TFE_Context, or DeallocatorPointerLongPointer instead of Deallocator_Pointer_long_Pointer)

Well, we can start customizing names like that yes, but the idea with JavaCPP is to map the underlying API to something that's as close as possible to the native API. If we start renaming things randomly like that, it increases the chances of conflicts and it makes it harder to understand the mapping. Besides, the C API is very hard to use, it's not meant to be used by end users, and making it even harder to use isn't going to help. :)

Since the classes won't be public, I'm ok to just rename the package as well.

My thought was that having a package name like c_api makes it less likely for the casual to want to use it. I also thought about capi, but I just don't find it as obvious as c_api, which is used by TensorFlow in the docs making it clearer that it's related: https://www.tensorflow.org/install/lang_c

Maybe let's get the opinion of at least one other person? In a lot of places where JavaCPP just uses _, I know Panama uses $ because, you know, $ isn't a valid character in C/C++ identifiers, but it is in Java!! (I wonder how they are going to get this passed the CSR group...) @Craigacp Any opinions about this?

I think for internal package names which we won't expose to the user then _ is fine. Panama uses $ for internally generated things because it guarantees no conflicts, but in this case the header file is actually called c_api.h, so mirroring the name is fine. Panama's jextract gives it the same name (see here https://hg.openjdk.java.net/panama/dev/raw-file/c359a9e944de/doc/panama_foreign.html#using-tensorflow-c-api-mac-os).

karllessard · 2019-09-28T22:31:59Z

(Maybe src/gen/java is too much? It should change only little by little from now on though.)

Yeah, I'm still not sure what we should do with those generated files neither... not having them committed though is pretty annoying when you want to write unit tests that uses them, for example... let's keep it there and agree that we might change our mind later.

tensorflow-core/tensorflow-core-api/pom.xml

pom.xml

karllessard · 2019-09-28T23:48:14Z

An idea for the annotation processor artifact: when we'll migrate the Java operators generator from C++ to Java, we will probably move the code in the same artifact as this processor. So we can give it a more generic name, like tensorflow-op-processor or anything better you may find

saudet · 2019-09-29T10:01:43Z

An idea for the annotation processor artifact: when we'll migrate the Java operators generator from C++ to Java, we will probably move the code in the same artifact as this processor. So we can give it a more generic name, like tensorflow-op-processor or anything better you may find

Good point. We can make the name even more generic too, like tensorflow-core-generator to stuff there anything we don't need at runtime.

SidneyLann · 2019-10-01T07:59:30Z

The java TF 2.0 available now? have more than 90% functionalities of TF 2.0?

karllessard · 2019-10-02T01:29:10Z

Good job @saudet , we are almost good to go, just want to follow up with you about my previous suggestion to remove completely tensorflow-core-examples from this repo.

I think it could be nice to have all the sample code in the java-models repo instead, to replicate what the core team is doing (sample code is found in /tensorflow/models instead of /tensorflow/tensorflow).

Otherwise, everything looks fine to me

karllessard · 2019-10-02T01:48:57Z

The java TF 2.0 available now? have more than 90% functionalities of TF 2.0?

@SidneyLann, I think the closest TF Java is to TF 2.0 is that it supports eager execution. Work is currently in progress to add a Keras-like API on top of it and and to handle functional graphs.

SidneyLann · 2019-10-02T02:56:11Z

The java TF 2.0 available now? have more than 90% functionalities of TF 2.0?

@SidneyLann, I think the closest TF Java is to TF 2.0 is that it supports eager execution. Work is currently in progress to add a Keras-like API on top of it and and to handle functional graphs.

https://github.com/tensorflow/swift/blob/master/docs/WhySwiftForTensorFlow.md, this link said java can't be used for Graph Program Extraction because java can't do static analysis, need to use swift, can you talk about this problem?

Craigacp · 2019-10-02T03:04:13Z

@SidneyLann Those are different issues. Functional graphs as implemented in TF2 in python are different from extracting a graph program directly from the Swift program. As I understand it the Swift one is more precise, whereas the Python one requires tracing the annotated functions to determine the currently executing codepath. It's possible to do the tracing in Java, that's kinda how eager mode works for backprop.

Java's type system isn't quite powerful enough to express the kind of generics you'd like to use for static compile time typing of tensors (including their shapes), but to get a type system that powerful requires lots of other features which make it harder to use and can lead to a lot of complexity. I'm not sure if Swift's is powerful enough for that, but the Swift for Tensorflow project encompasses changes to the Swift language and core libraries, which allows you to do more than is possible in a normal library.

Also, this is pretty off topic for this pull request, if you want to talk about TF on the JVM you can post to the Tensorflow JVM SIG Google Group, or go to the Gitter (https://gitter.im/tensorflow/sig-jvm).

saudet · 2019-10-02T13:11:36Z

Good job @saudet , we are almost good to go, just want to follow up with you about my previous suggestion to remove completely tensorflow-core-examples from this repo.

I think it could be nice to have all the sample code in the java-models repo instead, to replicate what the core team is doing (sample code is found in /tensorflow/models instead of /tensorflow/tensorflow).

Otherwise, everything looks fine to me

Ah, I didn't realize you wanted to remove everything even my little "HelloWorld" :) I was thinking we could leave snippets and maybe even some documentation there for "core developers". Do you see a better place for those? They wouldn't go in another repo... Maybe the wiki? It doesn't appear to be enabled on TensorFlow repos though.

karllessard · 2019-10-02T22:15:31Z

@saudet, I am discussing this with @tzolov, we’ll definitely have a page for developers, need to figure out where, but let’s keep that topic pending for now and remove this example artifact from the current PR, I don’t think your “Hello World” will be missed anyway :D

But if you feel sentimental about it, maybe you could move it as an integration test in the src/test folder of the core-api until we figure this out?

karllessard · 2019-10-02T23:23:32Z

... at the same time, it’s up to you, we can keep it there and move it later as well, let me know.

saudet · 2019-10-03T01:35:32Z

Yes, unit tests as documentation, that works too. Ok, it's done!

Sync Clarke fork

Refactor modules and integrate JavaCPP to map the C API

b3abb9b

karllessard requested changes Sep 28, 2019

View reviewed changes

tensorflow-core/tensorflow-core-api/pom.xml Outdated Show resolved Hide resolved

karllessard requested changes Sep 28, 2019

View reviewed changes

pom.xml Outdated Show resolved Hide resolved

saudet added 3 commits September 29, 2019 09:38

Fix build on Windows

b762ccb

Remove completely examples from tensorflow-core-api module

a9e2c7c

Revert to a released version of JavaCPP

477a891

Change version to 0.1.0-SNAPSHOT and remove LabelImage example

18ea0b7

Use OperatorProcessor as part of new tensorflow-core-generator module

13cfed2

Update to released version of TensorFlow 2.0.0

ab324e9

Move HelloWorld example to a unit test of tensorflow-core-api

3e1746d

Update README.md with new module names and version number

08477d4

saudet changed the title ~~[WIP] Refactor modules and integrate JavaCPP to map the C API~~ Refactor modules and integrate JavaCPP to map the C API Oct 3, 2019

karllessard approved these changes Oct 4, 2019

View reviewed changes

karllessard merged commit 9eb0357 into tensorflow:master Oct 4, 2019

karllessard added a commit that referenced this pull request Apr 28, 2020

Skip MKL-specific op and update protos (#1)

467fb7a

karllessard pushed a commit that referenced this pull request Oct 8, 2020

Merge pull request #1 from tensorflow/master

c86d09b

Sync Clarke fork

JimClarke5 mentioned this pull request May 19, 2021

Framework: Move Ops parameter to call method where possible #202

Open

Refactor modules and integrate JavaCPP to map the C API #1

Refactor modules and integrate JavaCPP to map the C API #1

Uh oh!

Conversation

saudet commented Sep 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

googlebot commented Sep 26, 2019

What to do if you already signed the CLA

Individual signers

Corporate signers

Uh oh!

saudet commented Sep 26, 2019

Uh oh!

googlebot commented Sep 26, 2019

Uh oh!

karllessard commented Sep 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

karllessard Sep 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

karllessard commented Sep 28, 2019

Uh oh!

Uh oh!

Uh oh!

karllessard commented Sep 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

saudet commented Sep 29, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SidneyLann commented Oct 1, 2019

Uh oh!

saudet commented Sep 26, 2019 •

edited

Loading

karllessard commented Sep 28, 2019 •

edited

Loading

karllessard Sep 28, 2019 •

edited

Loading

karllessard commented Sep 28, 2019 •

edited

Loading

saudet commented Sep 29, 2019 •

edited

Loading

Craigacp commented Oct 2, 2019 •

edited

Loading