Skip to content

How to Improve Table Extraction #87

@ehildebrandtrojo

Description

@ehildebrandtrojo

I am working with a large set of historical tables and need to extract the rows/columns in them. I ran various layout models from the Model Zoo, but the only one that gives me some interesting results is the HJDataset models. Still, the model does not do a good job consistently identifying the different features in the table (image attached). For instance, the model is able to detect a couple of rows/columns but not all of the ones present in the image (e.g. columns of numbers are well detected in the first table but not in subsequent tables).

Any advice/suggestions on how to best proceed?

LayoutParserDetection

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions