ContentMine Fork of the WWMM svg2xml Package
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
demos
src
.gitignore
.hgignore
.travis.yml
Book1.xlsx
GETTING_STARTED.txt
LICENSE.txt
README.md
README.txt
TABLE.md
TODO.txt
architecture.txt
figures.html
pom.xml
semanticDoc.txt

README.md

#SVG2XML

See README.txt for previous intro

AMI-tables

This package has major enhancements in 2016-11...2017-02 [onwards] due to the AMI-EPPI project. The goal is to extract HTML tables with high precision / recall. we assume the input SVG is the putput of PDF2SVG. Currently we assume per-page and per-table input. The examples in current development are tables already excised (snipped manually with Inkscape), so the problem is reduced to something known to be a table.

The details are in TABLE.md