move PNG generation out of mathjax-node #205

pkra · 2016-04-11T07:58:01Z

As per F2F, the PNG generation will be moved out of mathjax-node; there will be alternative modules to use.

Cf. the discussion in #174; see also #191.

pkra · 2016-04-14T09:10:06Z

From F2F: we will be adding width and height to the returned data (not just as an option).

pkra · 2016-04-14T09:30:22Z

While working on this, I've been wondering about structure of mj-single and the SVG modifications.

SVG modifications in mj-single
- add svg namespace https://github.com/mathjax/MathJax-node/blob/master/lib/mj-single.js#L570
- prettify and add xlink https://github.com/mathjax/MathJax-node/blob/master/lib/mj-single.js#L588-L593
- make a file version https://github.com/mathjax/MathJax-node/blob/master/lib/mj-single.js#L594-L601
- create an img tag https://github.com/mathjax/MathJax-node/blob/master/lib/mj-single.js#L608-L618

The last two seem like they should be dropped alongside the PNG (since they only come into play there). Any thoughts?

I'm also wondering if we should expose both the "HTML5" svg data alongside the namespaced version. Would anyone benefit from having the "HTML5" svg data?

pkra · 2016-04-15T08:16:47Z

I'm also wondering if we should expose both the "HTML5" svg data alongside the namespaced version. Would anyone benefit from having the "HTML5" svg data?

Or perhaps we should have a more general "standalone (prefixed?)" option affecting both MathML and SVG (and perhaps even CommonHTML?) output. Or maybe I'm just overthinking this 😉

dpvc · 2016-04-17T17:25:04Z

The <img> tag is actually for an SVG file, not a PNG file, so the file version of the SVG is used for that as well as the PNG file. But perhaps you don't want to produce the SVG <img> tag any longer and want to push that to the surrounding tool.

One of the difficulties of pushing this off to the application is the copying of the style attributes from the <svg> to the <img> tag. If we are returning just a string for the svg (as we do now), then it is harder to get the style using string manipulation rather than DOM element's methods, and that is less reliable. But if we return the DOM element rather than a string, then that means the application is now responsible for prettying it up as we currently do.

One solution might be to provide service routines to do some of this so that the application can call them as needed.

dpvc · 2016-04-17T17:28:34Z

I'm also wondering if we should expose both the "HTML5" svg data alongside the namespaced version. Would anyone benefit from having the "HTML5" svg data?

I'm not sure I understand what you are suggesting. Can you be more clear about the difference you have in mind?

Or perhaps we should have a more general "standalone (prefixed?)" option affecting both MathML and SVG (and perhaps even CommonHTML?) output.

Again, I'm not clear on what you are suggesting. Can you give more details about what is included (and not included) in each form?

pkra · 2016-04-18T09:18:01Z

The tag is actually for an SVG file, not a PNG file, so the file version of the SVG is used for that as well as the PNG file.

Right.

But perhaps you don't want to produce the SVG <img> tag any longer and want to push that to the surrounding tool.

I think creating an image tag is something that's better done outside mathjax-node. As a developer, I would frequently want to modify the img tag (add e.g., classes, data-attributes, wrapping elements).

One of the difficulties of pushing this off to the application is the copying of the style attributes from the to the

Right. Which is why I add the dimensions and styles alongside the SVG in the result.

One solution might be to provide service routines to do some of this so that the application can call them as needed.

Right. As we discussed on the last F2F, maybe exposing the SVG as an (htmlparser2?) object is useful more generally.

I'm not sure I understand what you are suggesting. Can you be more clear about the difference you have in mind?

The SVG data before/after adding namespaces and xlink prefixes (and prettying).

Again, I'm not clear on what you are suggesting. Can you give more details about what is included (and not included) in each form?

I don't really want any of this as part of mathjax-node but for completeness: just like the current code makes the SVG usable as a standalone document, it could do the same for MathML and HTML.

dpvc · 2016-04-18T12:55:44Z

I think creating an image tag is something that's better done outside mathjax-node. As a developer, I would frequently want to modify the img tag (add e.g., classes, data-attributes, wrapping elements).

If course, if we returned a DOM element (like we are considering for the SVG element itself), it would be easy to do those things.

Which is why I add the dimensions and styles alongside the SVG in the result.

I had forgotten that you included the CSS text. Of course, if we return the SVG DOM element, then it is easy to get those values from the node itself, and so they may not need to be added to the result in that case.

The SVG data before/after adding namespaces and xlink prefixes (and prettying).

I'm more and inclined to think that the DOM node is the thing that should be returned. Then the outer application can serialize it as they want, and we can provide functions to call that do the prettifying and xlink adjustments if we want.

just like the current code makes the SVG usable as a standalone document, it could do the same for MathML and HTML.

Do you mean adding the <?xml version="1.0" standalone="no"?> and <!DOCTYPE ...> to the top of the file? And for the HTML output, adding an <html>, <head>, and <body> tags (with the CSS in a <style> element in the head)?

Again, it seems like service routines to do this given a result object might be the way, rather than returning two forms based on more flags.

pkra · 2016-04-18T13:51:32Z

I'm more and inclined to think that the DOM node is the thing that should be returned.

Me too but I don't know what would count as a good DOM object in nodejs land. Of course for now we're stuck with jsdom (which I think means parse5).

We should research a few libraries, I guess (also for MathJax v3). Good starting points might be parse5 (jsdom) but also htmlparser2 (e.g., used by cheerio).

The only downside I can see is that an application using mathjax-node would need a minimal amount of understanding over the relevant structure (e.g., so as to serialize it).

Do you mean adding [...]

Pretty much.

[...] it seems like service routines to do this given a result object might be the wa

I second that.

pkra · 2016-04-20T10:17:18Z

I've moved the question of returning an object to #219.

So from my questions, this leaves only the question regarding the xlink prefixes added in https://github.com/mathjax/MathJax-node/blob/master/lib/mj-single.js#L587-L593.

I'm not sure if they're still needed (or sensible, we don't add the namespace, do we?) but this is more a 1/10 for me.

So unless we change that, the PR is ready.

pkra · 2016-04-29T07:00:52Z

Blargh. I had posted the mock module on the wrong issues so @dpvc left his thorough review there as well. Check #206, comments 2--4. Will repost when I come back to this.

pkra · 2016-05-25T08:47:00Z

So here's my simple example (erroneously posted to #206) addressing (I hope) the comments from @dpvc (from that thread). Again, this is meant to be a simple example for people to build on, not a complete replacement for the batik integration. For example, I could easily imagine people wanting to increase the dimensions of the png (svg2png doesn't have any settings so you'd have to increase the dimensions manually which is somewhat tricky since MathJax provides ex values).

Anyway, I hope this might be a sufficient example for now.

var mjAPI = require("mj-single.js");
var svg2png = require('svg2png');
mjAPI.start();

function createPNG(result, callback){
  var sourceBuffer = new Buffer(result.svg, "utf-8");
  svg2png(sourceBuffer).then(function(buffer){
    result.png = "data:image/png;base64," + buffer.toString('base64');
    return callback(result);
  })
};

exports.math2png = function(options, callback){
  var mjConfig = options.config;
  var typesetOptions = options.typeset;

  // make sure SVG output will be generated and disable mml and html
  typesetOptions.svg = true;
  typesetOptions.mml = false;
  typesetOptions.html = false;

  mjAPI.config(mjConfig);
  mjAPI.typeset(typesetOptions, function(result){
    if (result.errors) return result.errors;
    createPNG(result, callback);
  });
}

Usage:

var math2png = require("math2png.js").math2png;
var options = {
  config: {
  },
  typeset: {
    math: "x^2",
    format: "TeX"
  },
  png: {}
}
math2png(options, function(result){
  console.log(result.png);
  console.log("style='" + result.style + "width:" + result.width + "; height:" + result.height +";'");
});

Note: this depends on the changes from PR #213 (width and height).

dpvc · 2016-05-27T14:09:27Z

A couple of small comments, here.

Some configuration options for mjAPI.config() must be set before mjAPI.start() is called (e.g., the MathJax, extensions, and fontURL options), so you will not be able to specify these options in your setup, since mjAPI.start() is performed immediately when the module is loaded. Since mjAPI.typeset() will perform a start() automatically, you could just leave out the start() all together, which would allow you to configure all the options. Note, however, that once the first math2png() is called, you will not be able to change those options. It might be better not to pass config options to math2png but rather export another function for configuring the module.

Your options variable has a png block, but that is never used in your code. Perhaps you have that for future use?

In math2png, when you set typesetOptions.*, you are not just changing the local version, but also the original object passed in by the user. So your routine modifies the user's object. That is not an expected result of calling your function, so it might be better to make a copy of the typeset options that were passed to you instead, and then modify that with your values for svg, mml, etc.

In your callback to the typeset() call, if there are errors, you return result.errors. This does nothing, however, since the return value of the typeset() callback is never used (this was in partially corrected code in my point 5, but I pointed out that this was not useful in point 7). Furthermore, this means the user's callback will never get called; but since the user is relying on the callback for his own synchronization with your code, that is probably a bad idea. You probably want

    if (result.errors) {
      callback(result);
    } else {
      createPNG(result, callback);
    }

and let the user check for result.errors (or for result.png being null). Or you could do callback({errors: result.errors}) if you won't want to give the full results object.

Alternatively, your callback could be passed a success/failure value in addition to the results object. Or you could require two callbacks, one for success and one for failure.

pkra · 2016-05-30T08:08:15Z

Thanks for the comments!

First off, let me move the code to a gist at https://gist.github.com/pkra/c60098af5c1d8c37473416caad0418f6

Some configuration options for mjAPI.config() must be set before mjAPI.start() is called (e.g., the MathJax, extensions, and fontURL options), so you will not be able to specify these options in your setup, since mjAPI.start() is performed immediately when the module is loaded. Since mjAPI.typeset() will perform a start() automatically, you could just leave out the start() all together, which would allow you to configure all the options. Note, however, that once the first math2png() is called, you will not be able to change those options. It might be better not to pass config options to math2png but rather export another function for configuring the module.

I guess I wasn't really aware that some options will "stick" (though it was clear as soon as I read it). Questions:

Are MathJax, extensions, and fontURL the only relevant options here? (I'd like to start documenting things a bit better.)
It seems to me we might want to add a reset (and stop) method for these for v1.0. What do you think?

Your options variable has a png block, but that is never used in your code. Perhaps you have that for future use?

No, it was just a copied from an older version. (Though I expect that svg-to-png converters might want those or a more advanced module that allows scaling would need options like this.)

it might be better to make a copy of the typeset options that were passed to you instead,

Ouch; fixed.

this does nothing, however, since the return value of the typeset() callback is never used (this was in partially corrected code in my point 5, but I pointed out that this was not useful in point 7)

Another Ouch -- fixed. Sorry for missing that among the earlier comments.

dpvc · 2016-06-22T18:13:14Z

==> Merged.

pkra · 2016-11-22T08:59:17Z

I've created another wrapper based on svg2png that can be used as a drop-in for mathjax-node to generate PNG content.

federicosan · 2020-01-09T20:45:42Z

Hi, I am having trouble getting the SVG output from MathJax-node to be included into pdf files with pdfmake that uses svg-to-PDFKIT, so I thought of making the SVG output into a png, I've noticed MathJax-node does not support this anymore, could you please point out how this could be done? Would I need to pipe MathJax-node's result into some other function that converts it into PNG, would you know any that would work with MathJax-node's outputs? Thank you

dpvc · 2020-01-09T21:18:20Z

@federicosan, the link in the comment just above yours gives the sag-to-png functionality.

Alternatively, the issue you are probably facing is the <use> elements in the svg output. Try using the --nocache option to prevent the inclusion of <use> elements. That worked for me in the pdfmake playground.

pkra added the Feature Request label Apr 11, 2016

pkra added this to the v1.0 milestone Apr 11, 2016

This was referenced Apr 11, 2016

[discussion] major changes #191

Closed

[WIP] Modularization of PNG generation #174

Closed

Allow change of svg-to-PNG converter [was: Why not use "svg2png" , pure node without JAVA depend] #59

Closed

pkra mentioned this issue Apr 15, 2016

Remove support for PNG generation #213

Merged

pkra mentioned this issue Apr 20, 2016

return an object not a serialization #219

Closed

pkra mentioned this issue Apr 29, 2016

move mj-page out of mathjax-node #206

Closed

pkra mentioned this issue May 25, 2016

[discussion] re-organizing the examples in /bin #208

Closed

pkra added the Ready for Review label May 26, 2016

pkra modified the milestones: v1.0, What comes next May 26, 2016

pkra mentioned this issue May 26, 2016

[mj-single] passing data through to the result object #239

Closed

dpvc added the Fixed label Jun 22, 2016

dpvc removed the Ready for Review label Jun 22, 2016

dpvc closed this as completed Jun 22, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

move PNG generation out of mathjax-node #205

move PNG generation out of mathjax-node #205

pkra commented Apr 11, 2016

pkra commented Apr 14, 2016

pkra commented Apr 14, 2016

pkra commented Apr 15, 2016

dpvc commented Apr 17, 2016 •

edited

dpvc commented Apr 17, 2016

pkra commented Apr 18, 2016 •

edited by dpvc

dpvc commented Apr 18, 2016

pkra commented Apr 18, 2016

pkra commented Apr 20, 2016

pkra commented Apr 29, 2016

pkra commented May 25, 2016

dpvc commented May 27, 2016

pkra commented May 30, 2016

dpvc commented Jun 22, 2016

pkra commented Nov 22, 2016

federicosan commented Jan 9, 2020

dpvc commented Jan 9, 2020

move PNG generation out of mathjax-node #205

move PNG generation out of mathjax-node #205

Comments

pkra commented Apr 11, 2016

pkra commented Apr 14, 2016

pkra commented Apr 14, 2016

pkra commented Apr 15, 2016

dpvc commented Apr 17, 2016 • edited

dpvc commented Apr 17, 2016

pkra commented Apr 18, 2016 • edited by dpvc

dpvc commented Apr 18, 2016

pkra commented Apr 18, 2016

pkra commented Apr 20, 2016

pkra commented Apr 29, 2016

pkra commented May 25, 2016

dpvc commented May 27, 2016

pkra commented May 30, 2016

dpvc commented Jun 22, 2016

pkra commented Nov 22, 2016

federicosan commented Jan 9, 2020

dpvc commented Jan 9, 2020

dpvc commented Apr 17, 2016 •

edited

pkra commented Apr 18, 2016 •

edited by dpvc