Skip to content

encoding error in transform #412

@chinyeungli

Description

@chinyeungli
Traceback (most recent call last):
  File "C:\Users\thoma\git\aboutcode-toolkit\bin\about-script.py", line 11, in <module>
    load_entry_point('aboutcode-toolkit', 'console_scripts', 'about')()
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\site-packages\click\core.py", line 722, in __call__
    return self.main(*args, **kwargs)
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\site-packages\click\core.py", line 697, in main
    rv = self.invoke(ctx)
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\site-packages\click\core.py", line 1066, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\site-packages\click\core.py", line 895, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\site-packages\click\core.py", line 535, in invoke
    return callback(*args, **kwargs)
  File "c:\users\thoma\git\aboutcode-toolkit\src\attributecode\cmd.py", line 460, in transform
    errors = transform_csv_to_csv(location, output, transformer)
  File "c:\users\thoma\git\aboutcode-toolkit\src\attributecode\transform.py", line 50, in transform_csv_to_csv
    column_names, data, errors = transform_data(rows, transformer)
  File "c:\users\thoma\git\aboutcode-toolkit\src\attributecode\transform.py", line 72, in transform_data
    column_names = next(rows)
  File "c:\users\thoma\git\aboutcode-toolkit\src\attributecode\transform.py", line 266, in read_csv_rows
    for row in reader:
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\site-packages\backports\csv.py", line 394, in __next__
    lineobj = next(self.input_iter)
  File "C:\Users\thoma\git\aboutcode-toolkit\lib\codecs.py", line 314, in decode
    (result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xa9 in position 892: invalid start byte

I have the above encoding error.
This is caused by special characters such as ¥ and ©

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions