Skip to content

Table is not extracted and some text order was wrong for this PDF  #138

@bbfrog

Description

@bbfrog

hi, I am testing converting this pdf to markdown text:
https://www.jacionline.org/action/showPdf?pii=S0091-6749%2822%2901181-2

There are two problems: the second problem is more important. Thanks!

  1. All tables in this PDF (Page 4, 6, 9) are not extracted as markdown format.
  2. The text order in page 4 is wrong: The right panel text (starts with "relationship. The multiple..") is before the left panel text (starts with "with CSU was investigated..."). Page 7 has the same problem.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions