-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RAIS/CAGED - microdata #1
Labels
Comments
Their data seems to have an encoding problem. Here is the error message when I try to read it in pandas: |
Try to enconde with 'latin1'. It usually solves this problem. And use
python 3 that has a better encoding system.
About the issues, it may be a good ideia to slipt them.
About the PDF, is it readable? If it is, there are good python libraries
and softwares to parse table data on pfs
On Sat, Mar 18, 2017, 6:05 PM João Marcos Gris ***@***.***> wrote:
Their data seems to have an encoding problem. Here is the error message
when I try to read it in pandas:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xea in position 27:
invalid continuation byte
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#1 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/ATCfVCdM8L0VWlPRCpbzTMkK9ZmKoHFtks5rnEcggaJpZM4MfN83>
.
--
João Carabetta / Data Developer
<https://htmlsig.com/t/000001CA95SE>
[image: Facebook] <https://htmlsig.com/t/000001CA95SE> [image: LinkedIn]
<https://htmlsig.com/t/000001CC9BV9> [image: Github]
<https://htmlsig.com/t/000001C5STHP>
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
dados
The text was updated successfully, but these errors were encountered: