Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AttributeError: 'module' object has no attribute 'read_pdf' #40

Closed
4 tasks
zlqs1985 opened this issue Jun 11, 2017 · 4 comments
Closed
4 tasks

AttributeError: 'module' object has no attribute 'read_pdf' #40

zlqs1985 opened this issue Jun 11, 2017 · 4 comments

Comments

@zlqs1985
Copy link

Summary of your issue

When importing the read_pdf method from tabula-py using
from tabula import read_pdf
as the example demonstrated
It shows the following error message
AttributeError: 'module' object has no attribute 'read_pdf'

Environment

anaconda python 2.1.12 + tabula 0.9.0
Write and check your environment.

  • python --version: ? anaconda python 2.1.12
  • java -version: ? java 1.8.0_111
  • OS and it's version: ? windows 10
  • Your PDF URL:

What did you do when you faced the problem?

//write here

Example code:

paste your core code

Output:

paste your output

What did you intend to be?

@chezou
Copy link
Owner

chezou commented Jun 11, 2017

Cloud uou put the whole error messages and appropriate output of python --version? In short, could you follow the issue template?

After filling the template in details, could you reopen it?

@chezou chezou closed this as completed Jun 11, 2017
@Wind1002
Copy link

Wind1002 commented Jul 11, 2017

Facing a similar issue. I'm attaching the code and the error below.

import tabula as tb
import pandas as pd

df = tb.read_pdf("path to pdf")
tb.convert_into("path to pdf","output.csv",output_format="csv")

Error

AttributeError Traceback (most recent call last)
in ()
1 #df = pd.DataFrame()
----> 2 df = tb.read_pdf("path to pdf")
3 tb.convert_into("path to pdf",output_format="csv")

AttributeError: module 'tabula' has no attribute 'read_pdf'

I need to extract the tables from the pdf. My python version is
Python 3.6.1 :: Anaconda 4.4.0 (64-bit)

java version "1.8.0_131"
Java(TM) SE Runtime Environment (build 1.8.0_131-b11)
Java HotSpot(TM) 64-Bit Server VM (build 25.131-b11, mixed mode)

@priyanshuchauhan
Copy link

@Wind1002


from tabula import wrapper
wrapper.convert_into("path to pdf","output.csv",output_format="csv")

@Wind1002
Copy link

Wind1002 commented Jul 18, 2017

Thanks for the help.
Got a new error.

File "C:\ProgramData\Anaconda3\lib\site-packages\tabula\util.py", line 7

^
SyntaxError: invalid syntax

Repository owner locked and limited conversation to collaborators Jul 18, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants