-
Notifications
You must be signed in to change notification settings - Fork 670
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UnicodeDecodeError when using athena.read_sql_query #1156
Comments
I am able to narrow down issue to a particular column in the Athena database table. Since I am calling the function Is it possible to pass encoding parameter to this function, which would then be used while reading the file (ex: |
Hi @Chintan-D - thanks for reaching out. What is the data type of the column in Athena? |
varchar
|
has the bug been fixed? |
@cotrariello84 Yes this was released in 2.16.0 |
Hi @malachi-constant I still have the bug in version 2.17.0. I had to use pyathena to solve it. but I 'd like to use wrangler. |
See comment |
Describe the bug
Hi,
I am using the latest version of awswrangler library to extract bunch of tables from Athena. For one of my table function athena.read_sql_query fails with error:
UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 230232: character maps to <undefined>
Here is the part of code which is giving this error:
df = wr.athena.read_sql_query(query, database=database, boto3_session=session, ctas_approach=False)
Code otherwise works fine for other tables.
Here is the detailed error trace:
How to Reproduce
Expected behavior
No response
Your project
No response
Screenshots
No response
Environment
OS
Windows
Python version
3.9.2
AWS DataWrangler version
2.14.0
Additional context
No response
The text was updated successfully, but these errors were encountered: