Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Misdetects iso8859-1 as windows-1251 (cyrillic) #24

Open
kov opened this issue Mar 17, 2014 · 2 comments
Open

Misdetects iso8859-1 as windows-1251 (cyrillic) #24

kov opened this issue Mar 17, 2014 · 2 comments

Comments

@kov
Copy link

kov commented Mar 17, 2014

>>> chardet.detect('"ULTIMA ATUALIZACAO";"17/03/2014 04:01"\r\n"ANO";"MES";"SENADOR";"TIPO_DESPESA";"CNPJ_CPF";"FORNECEDOR";"DOCUMENTO";"DATA";"DETALHAMENTO";"VALOR_REEMBOLSADO"\r\n"2011";"1";"ACIR GURGACZ";"Aluguel de im\xf3veis para escrit\xf3rio pol\xedtico, compreendendo despesas concernentes a eles.";"05.914.650/0001-66";"CERON - CENTRAIS EL\xc9TRICAS DE ROND\xd4NIA S.A.";"45216633";"11/01/11";"";"47,65"\r\n"2011";"1";"ACIR GURGACZ";"Aluguel de im\xf3veis para escrit\xf3rio pol\xedtico, compreendendo despesas concernentes a eles.";"05.914.650/0001-66";"CERON - CENTRAIS EL\xc9TRICAS DE ROND\xd4NIA S.A.";"4542061";"18/01/11";"";"196,67"\r\n"2011";"1";"ACIR GURGACZ";"Aluguel de im\xf3veis para escrit\xf3rio pol\xedtico, compreendendo despesas concernentes a eles.";"004.948.028-63";"GILBERTO PISELO DO NASCIMENTO";"01";"12/01/11";"";"5000"\r\n"2011";"1";"ACIR GURGACZ";"Aluguel de im\xf3veis para escrit\xf3rio pol\xedtico, compreendendo despesas concernentes a eles.";"76.535.764/0001-43";"OI BRASIL TELECOM S.A.";"963011";"14/01/11";"";"480,59"\r\n"2011";"1";"ACIR GURGACZ";"Aquisi\xe7\xe3o de ma')
{'confidence': 0.99, 'encoding': 'windows-1251'}
@simonsmiley
Copy link

I'm affected by more or less the same bug, it misdetects it as MacCyrillic
If someone tells me how to go about this bug, I'm happy to try to debug it

@erickwilder
Copy link

Similar problem here too. Files from Germany being recognised as Russian.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants