Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Weird charcters parsed from pdf, #88

Closed
LCD344 opened this issue Nov 20, 2015 · 2 comments
Closed

Weird charcters parsed from pdf, #88

LCD344 opened this issue Nov 20, 2015 · 2 comments
Labels

Comments

@LCD344
Copy link

LCD344 commented Nov 20, 2015

I've been trying to get the text from a file like this https://pdonline.brisbane.qld.gov.au/masterviewui/cache/VvxFDshJTm.pdf

and all the text I am getting is weird "�" characters - I tried utf8 decode but it didn't work. Is there anything to do about this?

@davispuh
Copy link

Try #257 PR maybe it's fixed there.

@Connum
Copy link
Contributor

Connum commented Sep 29, 2020

The linked PDF is no longer available, and Brisbane City Council seems to be so strict about who may access their data under which conditions, that I didn't even dare to accept their terms/disclaimer from abroad in order to get another PDF from there. 😅
Unless @LCD344 would like to provide a working PDF to test this 5 years on... I'd suggest that this issue should be closed.

@k00ni k00ni added the invalid label Sep 30, 2020
@k00ni k00ni closed this as completed Sep 30, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants