You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been experimenting with the bz2 decompression functionality in the repo's master branch which isn't part of your last release. When a bz2 compressed file is read, the decompression seems to be happening on one worker node only. Is it possible to parallelise the decompression of externally compressed files?
Thanks in advance for your response.
The text was updated successfully, but these errors were encountered:
yivanova88
changed the title
Decompression on sas7bdat.bz2 file is not distributed across worker nodes
Decompression of sas7bdat.bz2 file is not distributed across worker nodes
Mar 17, 2020
based on #50 this seems to be expected. bz2 is indeed splittable but we need to seek for page boundaries within sas files. the easiest workaround is probably decompress and parse separately both should be parallelizable.
Hello,
I have been experimenting with the bz2 decompression functionality in the repo's master branch which isn't part of your last release. When a bz2 compressed file is read, the decompression seems to be happening on one worker node only. Is it possible to parallelise the decompression of externally compressed files?
Thanks in advance for your response.
The text was updated successfully, but these errors were encountered: