New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[C++] Add filesystem / IO implementation for Google Cloud Storage #17070
Comments
Antoine Pitrou / @pitrou: |
Wes McKinney / @wesm: https://github.com/googleapis/google-cloud-cpp Note that this library began after I opened this issue originally so it's good that we waited https://github.com/googleapis/google-cloud-cpp/graphs/contributors Probably this will need to be added to conda-forge. We'll have to have libcurl in our build toolchain also... |
Lei (Eddy) Xu: We are very interested into this feature. |
Wes McKinney / @wesm: |
Zhuo Peng / @brills: Micah might know more.
https://googleapis.dev/cpp/google-cloud-storage/latest/ seems to be the documentation for https://googleapis.github.io/google-cloud-cpp/ ? |
Micah Kornfield / @emkornfield: |
Frank Natividad: I'm confirming that the Cloud Storage library in https://github.com/googleapis/google-cloud-cpp is the current state of the art.
Cheers |
Antoine Pitrou / @pitrou: |
Frank Natividad: |
Antoine Pitrou / @pitrou: |
Wes McKinney / @wesm: |
Frank Natividad:
|
Antoine Pitrou / @pitrou: |
Clark Zinzow: I'm mostly interested in realizing the benefits of ARROW-8031 in Ray, but if no one plans on taking this issue in the next few weeks, I could try to find a few spare cycles to take on this issue (following the patterns set by Antoine's S3 implementation) and 8031 thereafter. |
Antoine Pitrou / @pitrou: |
Wes McKinney / @wesm: |
Clark Zinzow: |
Wes McKinney / @wesm: https://github.com/apache/arrow/blob/master/cpp/src/arrow/filesystem/s3fs.cc If you want to work on a filesystem implementation for GCS without dealing with the packaging / toolchain issues, you are welcome to do that also. At some point all of this work (the filesystem wrapper and thirdparty toolchain support) has to be done properly so that we can package and deploy the software all the places it needs to go. |
Clark Zinzow: If that is the case, I'm asking if I (or someone else, if they are interested) could take on the packaging/toolchain issues ARROW-8147 and ARROW-8148, and once those are finished, I could work on the GCS external store implementation for Plasma. This would leave the much larger effort around the GCS filesystem implementation for later. Does that make sense? And is my judgement of the actual GCS filesystem implementation not being a prerequisite for the GCS external store implementation for Plasma correct? |
Wes McKinney / @wesm: |
Clark Zinzow: |
Wes McKinney / @wesm: |
Clark Zinzow: |
Antoine Pitrou / @pitrou: |
Wes McKinney / @wesm: https://github.com/TileDB-Inc/TileDB/blob/dev/tiledb/sm/filesystem/gcs.cc |
Wenbing Bai: |
Wes McKinney / @wesm: |
Carlos O'Ryan / @coryan:
I am interested in helping with the GCS C++ Filesystem implementation. I will start going through the documentation on how to contribute to the project, that may take me a few days. If there are questions or suggestions in the interim, please do not hesitate to ask.
|
Micah Kornfield / @emkornfield: |
See example jumping off point
https://github.com/tensorflow/tensorflow/tree/master/tensorflow/core/platform/cloud
Reporter: Wes McKinney / @wesm
Assignee: Carlos O'Ryan / @coryan
Related issues:
Note: This issue was originally created as ARROW-1231. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: