Import callback should separate relative path conversion from returning file content #239

fare · 2016-10-04T16:20:43Z

When you import a same file twice, the evaluation results are not cached.

Worse: the callback API for import does not distinguish (1) a file lookup step, (2) a file read step and (3) a code evaluation step. Therefore, a second import of the same file will have the callback return new file contents, which makes either cache or reevaluation cumbersome.

For compatibility with existing API, a SHA checksum for the contents could serve as the key to cache evaluation results.

In a newer API, the import function would be broken in three steps, and each could be cached separately, so that (1) file lookup returns an absolute file path or URL from the (probably relative) string of the requested import depending on the current file, (2) file read returns the file contents given the path or URL, and (3) code evaluation evaluates the code.

fare · 2016-10-04T16:24:44Z

Note that with library files that include other library files, etc., you can easily get exponential blow off in re-importing the very same library functions N times:

B imports A
C imports B, A
D imports C, B, A
...
Z imports Y, X, ... B, A

Even with import only as headers, Z may load A factorial 26 times, instead of just once.

mikedanese · 2016-10-04T16:50:57Z

When you import a same file twice, the evaluation results are not cached.

The vm caches file content.

https://github.com/google/jsonnet/blob/master/core/vm.cpp#L465-L467

Are you talking about the parsed AST?

sparkprime · 2016-10-04T18:50:41Z

On splitting into 3 parts: So, in another project I used a special string literal syntax to resolve relative paths, see http://www.gritengine.com/grit_book/lua.html#lua_paths

I was thinking in Jsonnet you sometimes want to refer to a file in the same directory as the jsonnet file, which would be properly expanded so that the manifested JSON still makes sense. There is a std.thisFile to help with this but it's uglier than the `` in Grit or the import/importstr which does it implicitly.

With that in mind, it would be nice to have some `` or other string literal syntax that means expand the relative path to an absolute path from the current working directory of the jsonnet executable. In that world, import would not resolve relative paths anymore, and that would simplify its implementation. However that is a breaking change for the import syntax so phasing it in would be tricky.

On cacheing and exponential blow up -- yes this is a concern. It would be good to have a way to optimize the case of the diamond where two files both include the same file and not having to re-execute it. It's not clear to me how much benefit this would bring in real use cases though because typically files contain a big object literal with a few locals above it for file-level constants and imports. So the constant factor would be small but maybe the exponential blowup means it is still a problem.

fare · 2016-10-04T22:39:59Z

The pattern I'm using in my jsonnet libraries, which reminds me of how they do things in NixOS, is that instead of importing other files and evaluating to an object, like they used to do, my files instead evaluate to a function that takes an environment as argument and return the object as "linked" to the environment. The toplevel library file closes the loop by defining the toplevel environment recursively passed to all the other files.

One advantage is performance by only loading each file once (at most, if used, lazily). Another advantage is that it becomes easy to enable or disable debugging in all files "just" by toggling it in the toplevel environment.

sparkprime · 2016-10-06T14:31:57Z

In that case the file is a big function literal :)

There is currently no optimization to hoist local variables out of the function when their expression does not depend on the function parameters. This means it will be recomputed each time you call the function. You can do that optimization manually and if it's a big help you can file a bug to automate it (or even try implementing that yourself).

sparkprime · 2016-10-17T17:22:52Z

Renamed issue to capture the remaining part. I do support this change as it would cause more cacheing of imports. However it would not be backwards compatible so I'll have to think about whether the old API has to be kept around for a deprecation cycle.

sparkprime mentioned this issue Oct 4, 2016

Improve performance of repeated same file import #240

Merged

sparkprime changed the title ~~Cache imports~~ Import callback should separate relative path conversion from returning file content Oct 17, 2016

sparkprime added the enhancement label Oct 17, 2016

sparkprime mentioned this issue Feb 17, 2018

Import path should be opaque google/go-jsonnet#190

Closed

This was referenced Nov 20, 2023

stack-overflow exists in the function parse in parser.cpp #1116

Open

stack-overflow exists in the function maybeParseGreedy in parser.cpp #1117

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Import callback should separate relative path conversion from returning file content #239

Import callback should separate relative path conversion from returning file content #239

fare commented Oct 4, 2016

fare commented Oct 4, 2016

mikedanese commented Oct 4, 2016 •

edited

sparkprime commented Oct 4, 2016

fare commented Oct 4, 2016

sparkprime commented Oct 6, 2016

sparkprime commented Oct 17, 2016

Import callback should separate relative path conversion from returning file content #239

Import callback should separate relative path conversion from returning file content #239

Comments

fare commented Oct 4, 2016

fare commented Oct 4, 2016

mikedanese commented Oct 4, 2016 • edited

sparkprime commented Oct 4, 2016

fare commented Oct 4, 2016

sparkprime commented Oct 6, 2016

sparkprime commented Oct 17, 2016

mikedanese commented Oct 4, 2016 •

edited