Architecture: Modify the AST directly #44

iccir · 2015-02-03T01:52:22Z

The oj compiler currently modifies lines of code directly (via the Modifier class), rather than operating on the AST. This is due to historical reasons: before source maps became widely supported, I relied on preserving line numbers between oj source and the generated js source to aid debugging.

Now that source maps are common, and escodegen supports them, the compiler should modify the oj AST directly: converting the oj AST into an ECMAScript 5 AST. We can then pass that AST into escodegen to generate the resulting js source.

We can also pass that AST into ESLint (see #40) or 6to5 (http://6to5.org) without having to go through a codegen/reparse pass. More importantly, we can expose the AST to clients and allow them to write custom modifiers before passing into escodegen.

The oj->TypeScript code generator will still be string-based, as the TypeScript compiler doesn't allow AST input, and there is no real specification for a TypeScript AST.

The long term plan:

modifier.js and generator.js are only used by the typechecker.
A new replacer.js (or transformer.js ?) is responsible for replacing/transforming the oj AST to a js AST.

Strategy:

Don't touch modifier.js and generator.js
Clone generator.js to replacer.js. Add debug option to use the replacer rather than the generator
Command line tool to spin up ojc with the generator, then with the replacer, and then compare the output
Ensure the outputs of our source base (and AL's source base) are functionally identical.
Remove ES5 output from generator

The text was updated successfully, but these errors were encountered:

iccir · 2015-02-03T02:18:05Z

This may take awhile, development will occur in the ast branch

iccir · 2015-10-21T10:02:05Z

One major issue here is that the TypeScript AST is not the same as the Esprima AST. Hence, we still would have to have generator.js around and maintain two very different code paths.

I think this is out of scope for 2.0.

IngwiePhoenix · 2015-10-21T11:07:10Z

What does the TS AST have to do with the Esprima JS AST? just to make sure I get this right.

iccir · 2015-10-22T00:32:05Z

There was a lot of frustration that occurred in the hours leading up to that comment :) None of this is infeasible from an implementation standpoint, it's an architectural question of "what's the simplest thing to do that accomplishes our goals" and "how much work should ojc do, how much can we leverage existing ES tools".

I'm hesitant on the long term plan of having two very different paths (one AST based for ES5/6 generation, one string based for type-checking). Right now, generator.js is used by both paths. Although, over time, the number of if (language === LanguageTypechecker) checks inside of generator has grown. Ideally, everything would be AST-based and use the ESTree specification. TypeScript would use an superset of the ESTree specification. transformer.js would be used for both paths.

There's nothing wrong with the current method of generation. It just results in string output, and that string output needs to be reparsed by Babel/Uglify/ESLint/JSHint/etc.

My original comment of: "More importantly, we can expose the AST to clients and allow them to write custom modifiers before passing into escodegen." is still true, but it's also possible to accomplish the same by sending the string output of ojc into Babel and writing a Babel transformer.

IngwiePhoenix · 2015-10-22T09:55:31Z

I see what you are getting at. In fact, many people and modules seem to use a similar technique. For instance, WebPack 2 knows that Babel emits ES6 and does some more reading to do dead-code stripping - but it doesn't really pick up the AST. So, the concept is similar.

So you used TypeScript for the Type Checker?

iccir · 2015-10-22T23:39:25Z

Yep, generator.js will spit out TypeScript for the Type Checker. Note that this TypeScript doesn't actually work (don't run the output of tsc), it's just used to get warnings/type hints.

iccir · 2015-10-22T23:41:42Z

Another issue with the AST approach: JSHint integration relies on its input being a string, as JSHint uses it's own JS parser.

iccir · 2015-10-24T08:09:15Z

Modified strategy:

transformer.js is used for both the ES5 and TypeScript generation.
The output TypeScript AST will have a few non-ESTree-standard node types, for dealing with type annotations and casts.
The TypeScript AST will be handed into escodegen, which will output a JS string and a sourcemap.
We'll use this sourcemap to map errors from tsc back to the original line number.

IngwiePhoenix · 2015-10-24T08:36:41Z

Sounds solid to me. But have you taken ES6 into account, as in, how big will the efford be to change the ES5 handling to be ES6 capable? Otherwise there doesn't seem to be any problem.

iccir · 2015-10-24T08:37:58Z

The beauty of this (combined with procrastination) is that TypeScript/escodegen/ESTree/estraverse all handle ES6 now :)

IngwiePhoenix · 2015-10-24T09:27:33Z

Okay, that super-long name-chain convinced me ;)
But hey, that’s quite the good news! Can’t wait to get started using OJ with ES6.

iccir · 2015-10-27T01:10:46Z

Sadly, escodegen with source maps enabled is incredibly slow (~2 seconds to generate our source bases, compared to ~200ms with 1.1's generator/modifier). That's going to be a deal-breaker for us (at least for now).

The problem may be with source-map rather than escodegen (or specifically, escodegen's approach of wrapping the Array IR with source-map's SourceNode and then having source-map generate the string). I should recheck this once escodegen implements their Dumper (estools/escodegen#214).

iccir · 2015-10-27T01:28:15Z

(Note: this shouldn't prevent you from using ES6 with OJ. Using the AST vs. using the modifier/compiler has always been about avoiding additional parse/generate steps. Right now you can still use ES6 in the 2.0-wip branch, and you can also pass the result to Babel to transpile back to ES5).

IngwiePhoenix · 2015-10-27T01:33:20Z

I see. Well, I'd rather wait for a stable 2.0 release so I can update my oj-loader or the extended version, OhSoJuicy with ES6 support.

But I will get to test oj-2.0-alpha soon to see how it'll go. :)

iccir · 2015-10-27T01:37:30Z

Without this, I can't think of any reason why I would need to bump semver major. So the next release will probably be 1.2 (with Esprima 2.7 et al.)

IngwiePhoenix · 2015-10-27T01:46:30Z

That works for me. :) Ill I wish is to just make sure that when I update my modules with the new support, that I don't run into too many bugs :)

iccir · 2018-08-07T01:58:40Z

I've been pondering this issue again recently. There have been speed improvements to the source-map project. We could also use babel-generator rather than escodegen.

However, one glaring issue is that oj has to output Typescript in addition to JS. The current string-based solution of Modifier.js allows us to easily do this.

iccir self-assigned this Feb 3, 2015

iccir mentioned this issue Feb 3, 2015

Transpile 'let' to ES3 'var' #39

Closed

iccir mentioned this issue Feb 17, 2015

Add support for compiling files containing JSX #45

Open

iccir modified the milestone: 2.0 Aug 11, 2015

iccir removed this from the 2.0 milestone Oct 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Architecture: Modify the AST directly #44

Architecture: Modify the AST directly #44

iccir commented Feb 3, 2015

iccir commented Feb 3, 2015

iccir commented Oct 21, 2015

IngwiePhoenix commented Oct 21, 2015

iccir commented Oct 22, 2015

IngwiePhoenix commented Oct 22, 2015

iccir commented Oct 22, 2015

iccir commented Oct 22, 2015

iccir commented Oct 24, 2015

IngwiePhoenix commented Oct 24, 2015

iccir commented Oct 24, 2015

IngwiePhoenix commented Oct 24, 2015

iccir commented Oct 27, 2015

iccir commented Oct 27, 2015

IngwiePhoenix commented Oct 27, 2015

iccir commented Oct 27, 2015

IngwiePhoenix commented Oct 27, 2015

iccir commented Aug 7, 2018

Architecture: Modify the AST directly #44

Architecture: Modify the AST directly #44

Comments

iccir commented Feb 3, 2015

iccir commented Feb 3, 2015

iccir commented Oct 21, 2015

IngwiePhoenix commented Oct 21, 2015

iccir commented Oct 22, 2015

IngwiePhoenix commented Oct 22, 2015

iccir commented Oct 22, 2015

iccir commented Oct 22, 2015

iccir commented Oct 24, 2015

IngwiePhoenix commented Oct 24, 2015

iccir commented Oct 24, 2015

IngwiePhoenix commented Oct 24, 2015

iccir commented Oct 27, 2015

iccir commented Oct 27, 2015

IngwiePhoenix commented Oct 27, 2015

iccir commented Oct 27, 2015

IngwiePhoenix commented Oct 27, 2015

iccir commented Aug 7, 2018