TensorFlow-2 implementation of Im2Latex deep learning model for latex code generation from images of mathematical expressions described in HarvardNLP paper "What You Get Is What You See: A Visual Markup Decompiler"
What You Get Is What You See: A Visual Markup Decompiler
Yuntian Deng, Anssi Kanervisto, and Alexander M. Rush
http://arxiv.org/pdf/1609.04938v1.pdf
This is a general-purpose, deep learning-based system to decompile an image into presentational markup. For example, we can infer the LaTeX or HTML source from a rendered image.
Source im2latex-100k dataset has been preprocessed and resized as suitable for the model. Download the data from this link and move to "images" folder before training.
Step to train and evaluate the model has been given in the notebooks "im2latex_train.ipynb" and "im2latex_test.ipynb" respectively
- Validation dataset (10340 images): 84.44%
- Test dataset (9340 images): 84.30%
- Tensorflow 2.8.0
- Numpy 1.21.6