Metrics #12

petermr · 2017-01-27T14:54:37Z

This issue will track our metrics. Please contribute your thoughts by replying to this issue and keep the theme restricted to Metrics.

Our intention is to be able to assess the achievement of this project using blinded testing, where the final evaluation kept the methods and corpus secret from the developers.

There are several metrics which can be used. for some we can use the standard "recall + precision" but others may use "accuracy" and yet others a "Likert-like" scale (L)

Identification of tables in articles. This is formally out of scope - the software will be presented with the tables.
Classification of table type. We may develop methods for detecting table type, but may also require the tool be be told it.
identification of sections. (title, header, body, footer, optionally and in any order). This will ly be relevant to tables which humans agree have this structure.
title. L?
Header Structure. Identification of column names, and column trees
Header content. L? will include wrapping, bleeding.
Body structure. May include subtables, possibly guessed or possibly template-driven. Metrics on number of cells missed, or with corrupt content.
Footer content. L?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics #12

Metrics #12

petermr commented Jan 27, 2017

Metrics #12

Metrics #12

Comments

petermr commented Jan 27, 2017