Generalizes distinct mathematical formulae so that they can be structurally unified
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.circleci
sample-data
src
.gitignore
LICENSE
README.md
nb-configuration.xml
nbactions.xml
pom.xml

README.md

MathML Unificator – Generalizes mathematical formulae for structural unification

CircleCI

MathML Unificator is a tool which performs structural MathML unification as proposed by Růžička, Sojka, and Líška, 2016.

Usage

File encoding on Windows

On Windows, file encodings default to system-language-specific single-byte encodings. To ensure that JVM uses UTF-8, start JVM with command line argument -Dfile.encoding=UTF-8 as follows:

java -Dfile.encoding=UTF-8 -jar mathml-unificator.jar

However, be aware the default Windows command line shell has significant problems with Unicode in the default configuration. Try Lucida console font with appropriate shell code page setting.

Example

Executing the following command derives a series of four increasingly general formulae from an example formula:

$ java -jar mathml-unificator.jar -p sample-data/single-formula.xml
<?xml version="1.0" encoding="UTF-8"?>
<unified-math xmlns="http://mir.fi.muni.cz/mathml-unification/">
    <math xmlns="http://www.w3.org/1998/Math/MathML">
        <msup>
            <mi>a</mi>
            <mn>2</mn>
        </msup>
        <mo>+</mo>
        <mfrac>
            <msqrt>
                <mi>b</mi>
            </msqrt>
            <mi>c</mi>
        </mfrac>
    </math>
    <math xmlns:uni="http://mir.fi.muni.cz/mathml-unification/"
        uni:unification-level="1" uni:unification-max-level="4" xmlns="http://www.w3.org/1998/Math/MathML">
        <msup>
            <mi>a</mi>
            <mn>2</mn>
        </msup>
        <mo>+</mo>
        <mfrac>
            <msqrt>
                <mi>◍</mi>
            </msqrt>
            <mi>c</mi>
        </mfrac>
    </math>
    <math xmlns:uni="http://mir.fi.muni.cz/mathml-unification/"
        uni:unification-level="2" uni:unification-max-level="4" xmlns="http://www.w3.org/1998/Math/MathML">
        <msup>
            <mi>◍</mi>
            <mi>◍</mi>
        </msup>
        <mo>+</mo>
        <mfrac>
            <mi>◍</mi>
            <mi>◍</mi>
        </mfrac>
    </math>
    <math xmlns:uni="http://mir.fi.muni.cz/mathml-unification/"
        uni:unification-level="3" uni:unification-max-level="4" xmlns="http://www.w3.org/1998/Math/MathML">
        <mi>◍</mi>
        <mo>+</mo>
        <mi>◍</mi>
    </math>
    <math xmlns:uni="http://mir.fi.muni.cz/mathml-unification/"
        uni:unification-level="4" uni:unification-max-level="4" xmlns="http://www.w3.org/1998/Math/MathML">
        <mi>◍</mi>
        <mo>◍</mo>
        <mi>◍</mi>
    </math>
</unified-math>

Citing MathML Unificator

Text

RŮŽIČKA, Michal, Petr SOJKA a Martin LÍŠKA. Math Indexer and Searcher under the Hood: Fine-Tuning Query Expansion and Unification Strategies. In Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies. Tokyo: National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda-ku, Tokyo 101-8430 Japan, 2016. 7 pp.

BibTeX

@inproceedings{RuzickaSojkaLiska16Math,
     author = "Michal R\r{u}\v{z}icka and Petr Sojka and Michal L{\' i}ska",
      title = "{Math Indexer and Searcher under the Hood: Fine-tuning Query
                Expansion and Unification Strategies.}",
  booktitle = "{Proceedings of the 12th NTCIR Conference on Evaluation of
                Information Access Technologies}",
     editor = "{Noriko Kando et al.}",
      pages = "331--337",
       year = 2016,
}