Implement just-in-time context resolution. #342

dlongley · 2019-12-07T18:35:38Z

This PR removes the preprocessing step for resolving contexts and replaces its functionality with JIT context resolution. This should fix problems with mutating JSON literal content. This feature also allows document loaders to return an additional tag property that jsonld.js can use to more efficiently cache and reuse already processed contexts.

Add optional tag feature processing to returned RemoteDocuments.
A tag will be understood to mean that the same context document
needn't be processed twice. A special tag value of static is
interpreted to mean that a context does not even need to be retrieved
more than once.
Enables greater reuse of already processed contexts and quicker
discovery (via static tag) of already processed contexts for
a given context URL (instead of requiring the context content
itself to be seached for and found in a cache, only its URL
needs to be found).
Addresses Do not process @context in JSON literals #339.

- Add optional `tag` feature processing to returned RemoteDocuments. A `tag` will be understood to mean that the same context document needn't be processed twice. A special `tag` value of `static` is interpreted to mean that a context does not even need to be retrieved more than once. - Enables greater reuse of already processed contexts and quicker discovery (via `static` tag) of already processed contexts for a given context URL (instead of requiring the context content itself to be seached for and found in a cache, only its URL needs to be found). - Addresses #339.

gkellogg

Generally, LGTM, aside from some bits I don't quite understand without walking through it with an example.

Presumably, base is handled properly per recent tests.

gkellogg · 2019-12-07T18:59:11Z

lib/context.js

+  const resolved = await options.contextResolver.resolve({
+    context: localCtx,
+    documentLoader: options.documentLoader,
+    base: options.base


Is this the actual document location, or potential value of @base?

I don't think it's ever @base (could be wrong). It is either a document location (if the API was invoked with a URL) or the base passed via the API options. The tests pass.

gkellogg · 2019-12-07T19:02:48Z

lib/ContextResolver.js

+        if(!resolved) {
+          // not resolved yet, resolve
+          resolved = await this._resolveRemoteContext(
+            {url: ctx, documentLoader, base, cycles});


Does this have the affect of serializing all remote context fetches?

_resolveRemoteContext will fetch all remote contexts (recursively) and return an array of resolved contexts. Note that this does not include any scoped contexts, i.e., it doesn't deeply inspect the resolved contexts for those, but it will resolve any relative URLs encountered in scoped contexts to ensure the base URL used is proper. Any scoped contexts will be resolved later, JIT.

If by this you meant "the await" -- then, yes, an array of context URLs will be loaded serially now. A future PR can add more complexity here to parallelize that.

gkellogg · 2019-12-07T19:06:42Z

lib/ContextResolver.js

+    let remoteDoc;
+
+    try {
+      remoteDoc = await documentLoader(url);


Note that we're going to need to pass in API arguments to the documentLoader for options such as extractAllScripts, but this can be handled in a different PR.

Yeah. I didn't want to do anything with that in this PR but it will need to be addressed in a subsequent PR.

dlongley · 2019-12-07T21:03:20Z

@gkellogg,

Presumably, base is handled properly per recent tests.

The tests pass -- including the URL resolution follows RFC3986 ones.

davidlehn · 2019-12-09T17:55:09Z

lib/jsonld.js

@@ -90,6 +91,11 @@ const wrapper = function(jsonld) {
 /** Registered RDF dataset parsers hashed by content-type. */
 const _rdfParsers = {};

+// resolved context cache
+// TODO: consider basing max on context size rather than number
+const RESOLVED_CONTEXT_CACHE_MAX_SIZE = 100;


Is this adjustable? Should there be some API so it can be adjustable or a custom cache could be used?

It was not adjustable in the previous version, so there's no change here. We can make it so in a future PR.

davidlehn · 2019-12-09T17:56:01Z

lib/ContextResolver.js

+    // resolve, cache, and return context
+    const resolved = await this.resolve(
+      {context, documentLoader, base, cycles});
+    this._cacheResolvedContext({key: url, resolved, tag: remoteDoc.tag});


Need some docs on the tag feature. Is 'static' special?

static is a special value, yes. We can document in a separate PR.

dlongley requested review from gkellogg and davidlehn December 7, 2019 18:35

gkellogg approved these changes Dec 7, 2019

View reviewed changes

davidlehn reviewed Dec 9, 2019

View reviewed changes

davidlehn approved these changes Dec 9, 2019

View reviewed changes

davidlehn merged commit 185ff74 into master Dec 9, 2019

davidlehn deleted the jit-context-processing branch December 9, 2019 21:03

This was referenced Dec 9, 2019

Add RemoteDocument tag documentation #343

Open

Resolve remote documents in parallel #344

Open

dlongley mentioned this pull request May 26, 2020

Do not process @context in JSON literals #339

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement just-in-time context resolution. #342

Implement just-in-time context resolution. #342

Uh oh!

dlongley commented Dec 7, 2019 •

edited

Loading

Uh oh!

gkellogg left a comment

Uh oh!

gkellogg Dec 7, 2019

Uh oh!

dlongley Dec 7, 2019

Uh oh!

gkellogg Dec 7, 2019

Uh oh!

dlongley Dec 7, 2019

Uh oh!

dlongley Dec 9, 2019

Uh oh!

gkellogg Dec 7, 2019

Uh oh!

dlongley Dec 7, 2019

Uh oh!

dlongley commented Dec 7, 2019

Uh oh!

davidlehn Dec 9, 2019

Uh oh!

dlongley Dec 9, 2019

Uh oh!

davidlehn Dec 9, 2019

Uh oh!

dlongley Dec 9, 2019

Uh oh!

Uh oh!

Implement just-in-time context resolution. #342

Implement just-in-time context resolution. #342

Uh oh!

Conversation

dlongley commented Dec 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gkellogg left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dlongley commented Dec 7, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dlongley commented Dec 7, 2019 •

edited

Loading