Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TypeError: 'coroutine' object is not iterable when reading parquet partitions via s3fs >= 0.5 with pyarrow #26849

Closed
asfimport opened this issue Dec 15, 2020 · 1 comment

Comments

@asfimport
Copy link
Collaborator

asfimport commented Dec 15, 2020

Trying to read partitioned parquet files using updated s3fs >= 0.5 (using via ``dask), and got this error:

 

410 files = set()
 411 
--> 412 for key in list(self.fs._ls(path, refresh=refresh)):
 413 path = key['Key']
 414 if key['StorageClass'] == 'DIRECTORY':

TypeError: 'coroutine' object is not iterable

coming from

for key in list(self.fs._ls(path, refresh=refresh)):
 

 

Seems related to switching s3fs to asyncio in https://github.com/dask/s3fs/pull/336.

Reporter: Ivan Necas

Related issues:

Note: This issue was originally created as ARROW-10921. Please see the migration documentation for further details.

@asfimport
Copy link
Collaborator Author

Joris Van den Bossche / @jorisvandenbossche:
[~inecas] thanks for the report! It's good to have an issue with this error, so others can find it. But so as you already noted yourself (on the PR), this is a duplicate of ARROW-10433 and fixed on master (to be released in pyarrow 3.0.0 in January). For now the best solution is indeed to pin s3fs to <0.5).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant