Skip to content
Mike Caprio edited this page Oct 3, 2017 · 20 revisions

Convert PDFs of Evolutionary Trees into Data

Background

There is a large body of scientific literature that contains taxonomies in the form of tree structures. The paleontologists need the images of the trees contained in the PDFs converted into nested data.

Solutions

  • Machine vision, character recognition, turn a tree into a data structure
  • Create a crowdsourcing system to allow people to easily transcribe nested tree structures into the formats below

Resources