Skip to content
This repository has been archived by the owner on Apr 15, 2024. It is now read-only.

Unable to differentiate between newline and wrapped text for a table in pdf #320

Open
haritas-crest opened this issue Aug 25, 2022 · 1 comment

Comments

@haritas-crest
Copy link

haritas-crest commented Aug 25, 2022

There are 2 different hashes present in attached pdf file but while parsing, PDF Miner separates both a new line and wrapped hash text with ‘\n’ which makes it difficult to handle while extracting hashes from a file.

@haritas-crest
Copy link
Author

haritas-crest commented Aug 26, 2022

Attaching pdf file
updated_Hash_Test.pdf

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant