-
Notifications
You must be signed in to change notification settings - Fork 364
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support HDFS in Python package #1358
Comments
Unfortunately, HDFS driver requires JAVA, and one of the core interests is to not depend on that. so closing this as we right now see no way to provide this without java. |
I see you are working on a Rust implementation for hdfs (No JAVA!) in https://github.com/Kimahriman/hdfs-native and an object store implementation in https://github.com/datafusion-contrib/hdfs-native-object-store Are you also working on integrating your work into delta-rs? I am experimenting with that myself (https://github.com/SchutteJan/delta-rs) and I am curious what your plans are. |
Yeah I do have a branch I've been working on, just been lazy on getting it cleaned up and making a PR. The biggest limitation right now is the libgssapi dependency, which adds a dynamic link and requires some changes to how the wheels would be built. I think I'll try to get the Rust side finished up and make a PR with an option to build a custom Python wheel. I'm currently working on an update to do all gssapi/kerberos things via runtime dynamic loading with |
Description
Add support for HDFS in Python now that it's supported in Rust.
Use Case
Using deltalake Python package to read Delta tables in HDFS
Related Issue(s)
#300
The text was updated successfully, but these errors were encountered: