-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Delta lake fails to import (in Python) #7695
Comments
Michal Kurka commented: It doesn’t work when a slash is inserted at the end of the directory name (eg. by user or by python - hence the workaround). The regular expression that filters out the log files needs to be revised in order to work with/without slash. |
Neema Mashayekhi commented: Good point. The error show the crc and json paths but converted all the forward slashes and colon to underscore:{{"dbfs:/mnt/delta/events3/_delta_log/00000000000000000000.crc"}} -> {{dbfs__mnt_delta_events3__delta_log_00000000000000000000.crc}} {noformat}H2OResponseError: Server error water.exceptions.H2OIllegalArgumentException: |
JIRA Issue Migration Info Jira Issue: PUBDEV-7951 Linked PRs from JIRA Attachments From Jira Attachment Name: Screen Shot 2021-01-27 at 1.31.28 PM.png |
Delta lake file import was added to H2O (https://h2oai.atlassian.net/browse/PUBDEV-7923), but it fails for Python API
key is to disable the workaround in Python and it will start working:
H2OFrame.__LOCAL_EXPANSION_ON_SINGLE_IMPORT__ = False
The text was updated successfully, but these errors were encountered: