Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tabula was not read pdf data correctly issue #73

Closed
3 tasks done
mahendranatarajan opened this issue Jan 8, 2018 · 2 comments
Closed
3 tasks done

Tabula was not read pdf data correctly issue #73

mahendranatarajan opened this issue Jan 8, 2018 · 2 comments

Comments

@mahendranatarajan
Copy link

mahendranatarajan commented Jan 8, 2018

Summary of your issue

Hi all,
I am using tabula package for reading data from pdf by using readpdf function. in the function the tabula was wrongly read the pdf while in first page table was continues in second page.How can i fix that issue.

Environment

Write and check your environment.

  • Python 2.7.14
  • java 9.0.1
  • Ubuntu 17.04

What did you do when you faced the problem?

In the following pdf the first page has a 0 to 13 columns ,tabula reads all the columns coorectly

pdf1

in page 2 pdf also has a 0 to 13 columns but tabula does not read the data correctly

pdf2

first page output of column 0 to 6

first_table_columns

first page output of column 7 to 13

first_table_columns_

in second page table continution the tabula read only 0 to 6 only !!! what can i do please help me to fix that issue...

next_page_same_table_less_columns

@mahendranatarajan mahendranatarajan changed the title Tabula readpdf output was not read pdf correctly Tabula was not read pdf data correctly issue Jan 8, 2018
@chezou
Copy link
Owner

chezou commented Jan 8, 2018

Please DO NOT IGNORE the issue template. Unfortunately, I can't see your code, and without knowing that, I can say nothing.

One last thing before I close is 1) set guess=False, lattice=True option or 2) use tabula-java directly.
See also: #58 (comment)

If you open further issues without complying the template, I will close without comments.

@chezou chezou closed this as completed Jan 8, 2018
@mahendranatarajan
Copy link
Author

mahendranatarajan commented Jan 8, 2018

@chezou why are u closed this issue ? still i cant get the solution for this issue...???

Repository owner locked as too heated and limited conversation to collaborators Jan 8, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants