Skip to content

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 20355: ordinal not in range(128) #10312

@yarikoptic

Description

@yarikoptic

While building a fresh package (out of snapshot of master at bc7d48f, since the last release had way too many failures) for Debian I am running into

======================================================================
ERROR: test_file_like (pandas.io.tests.test_html.TestReadHtml)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/io/tests/test_html.py", line 284, in test_file_like
    df1 = self.read_html(f, '.*Water.*')
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/io/tests/test_html.py", line 92, in read_html
    return read_html(*args, **kwargs)
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/io/html.py", line 865, in read_html
    parse_dates, tupleize_cols, thousands, attrs, encoding)
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/io/html.py", line 728, in _parse
    raise_with_traceback(retained)
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/compat/__init__.py", line 708, in raise_with_traceback
    raise exc.with_traceback(traceback)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 20355: ordinal not in range(128)

======================================================================
ERROR: test_string (pandas.io.tests.test_html.TestReadHtml)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/io/tests/test_html.py", line 275, in test_string
    data = f.read()
  File "/usr/lib/python3.4/encodings/ascii.py", line 26, in decode
    return codecs.ascii_decode(input, self.errors)[0]
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 20355: ordinal not in range(128)

======================================================================
ERROR: test_string_io (pandas.io.tests.test_html.TestReadHtml)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/tmp/buildd/pandas-0.16.1+git97-gbc7d48f/debian/tmp/usr/lib/python3/dist-packages/pandas/io/tests/test_html.py", line 264, in test_string_io
    data1 = StringIO(f.read())
  File "/usr/lib/python3.4/encodings/ascii.py", line 26, in decode
    return codecs.ascii_decode(input, self.errors)[0]
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 20355: ordinal not in range(128)

when building with Python 3.4.3+ . Locale is set to C while running tests

Metadata

Metadata

Assignees

No one assigned

    Labels

    Testingpandas testing functions or related to the test suiteUnicodeUnicode strings

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions