You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
JulianCologne
changed the title
read_ndjson: RuntimeError: BindingsError: "InternalError(TapeError) at character 0 ('ï')"read_ndjson: Add encoding parameter
Mar 26, 2024
JulianCologne
changed the title
read_ndjson: Add encoding parameterread_ndjsonread_json: Add encoding parameter
Mar 26, 2024
Checks
Reproducible example
data.json (encoding=UTF-8-BOM)
Log output
RuntimeError: BindingsError: "InternalError(TapeError) at character 0 ('ï')"
Issue description
read_ndjson
errors with different encodings and there is no option to speficy encodingMany popular etl tools like azure DataFactory write json files with UTF-8-BOM encoding so this is happens quite a lot.
IMO every time you interact with a file the encoding is important to specify. Pandas also has this parameter.
Expected behavior
should allow encoding parameter and read data into dataframe
Installed versions
The text was updated successfully, but these errors were encountered: