-
-
Notifications
You must be signed in to change notification settings - Fork 300
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can't read the embedded font #31
Comments
It seems you're facing this bug. In the master code of tabula-java, it had been upgraded to PDFBOX 2.0.0, so after next release of tabula-java, it should be fixed. |
Thanks. I google it too. The package works fine with that bug. Thank you sooo much for your package. You save me tons of time. |
As I mentioned in #27 , you can set Now we can set java options for tabula-py using |
Thank you - large file works with java_options=["-Xmx256m"] |
@GGPay I released tabula-py v1.0.0. Could you try it? |
Hi
I've got a problem when try read one of the pdf. Can you take a look - where am i wrong?
python --version
: 2.7java:
Your PDF URL: https://drive.google.com/file/d/0B0MZAdjMKP0Sbjcyc3Y3RDVMNlk/view?usp=sharing
OS and it's version: ? windows
Output:
May 18, 2017 2:56:53 PM org.apache.pdfbox.pdmodel.font.PDCIDFontType2Font getawtFont
INFO: Can't read the embedded font Arial-BoldMT
May 18, 2017 2:56:53 PM org.apache.pdfbox.pdmodel.font.PDCIDFontType2Font getawtFont
INFO: Using font Arial Bold instead
May 18, 2017 2:56:53 PM org.apache.pdfbox.pdmodel.font.PDCIDFontType2Font getawtFont
INFO: Can't read the embedded font ArialMT
May 18, 2017 2:56:53 PM org.apache.pdfbox.pdmodel.font.PDCIDFontType2Font getawtFont
INFO: Using font Arial instead
The text was updated successfully, but these errors were encountered: