Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support HDFS in Python package #1358

Closed
Kimahriman opened this issue May 12, 2023 · 4 comments
Closed

Support HDFS in Python package #1358

Kimahriman opened this issue May 12, 2023 · 4 comments
Labels
enhancement New feature or request

Comments

@Kimahriman
Copy link
Contributor

Description

Add support for HDFS in Python now that it's supported in Rust.

Use Case
Using deltalake Python package to read Delta tables in HDFS

Related Issue(s)
#300

@Kimahriman Kimahriman added the enhancement New feature or request label May 12, 2023
@roeap
Copy link
Collaborator

roeap commented Jan 28, 2024

Unfortunately, HDFS driver requires JAVA, and one of the core interests is to not depend on that. so closing this as we right now see no way to provide this without java.

@roeap roeap closed this as not planned Won't fix, can't repro, duplicate, stale Jan 28, 2024
@SchutteJan
Copy link
Contributor

@Kimahriman

I see you are working on a Rust implementation for hdfs (No JAVA!) in https://github.com/Kimahriman/hdfs-native and an object store implementation in https://github.com/datafusion-contrib/hdfs-native-object-store

Are you also working on integrating your work into delta-rs? I am experimenting with that myself (https://github.com/SchutteJan/delta-rs) and I am curious what your plans are.

@Kimahriman
Copy link
Contributor Author

Are you also working on integrating your work into delta-rs? I am experimenting with that myself (https://github.com/SchutteJan/delta-rs) and I am curious what your plans are.

Yeah I do have a branch I've been working on, just been lazy on getting it cleaned up and making a PR. The biggest limitation right now is the libgssapi dependency, which adds a dynamic link and requires some changes to how the wheels would be built. I think I'll try to get the Rust side finished up and make a PR with an option to build a custom Python wheel. I'm currently working on an update to do all gssapi/kerberos things via runtime dynamic loading with libloading. If I can get that working it should simplify building wheels/cross compilation and theoretically could be added to the published wheels if it's accepted.

@Kimahriman
Copy link
Contributor Author

#2612

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants