Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any plan to provide an Java SDK for this cachelib ? #288

Closed
openinx opened this issue Feb 4, 2024 · 2 comments
Closed

Is there any plan to provide an Java SDK for this cachelib ? #288

openinx opened this issue Feb 4, 2024 · 2 comments

Comments

@openinx
Copy link

openinx commented Feb 4, 2024

Hi guys.

I think the cachelib is a quite interesting libary which solves the combined RAM and SSD cache issue. I notice this interesting library because I'm considering how to maintain the recently queried dataset in the local SSD for speeding the Apache Spark queries. ( I think the commecial databricks products has implemented this interesting feature, please see [1]). Saying The spark's executor will maintain its chunks (can be 1MB or 512KB) into the local SSD once the query accessed the column data from remote object storage ( aws s3 or azure blobstore).

After reading the cachelib paper [2], I think this project may be the perfect open source candidate to maintain the cache chunks. So my questions are:

A. Is there any plan to provide an Java SDK for this library ? Or do you know any others has tried to accomplish that ? It's okay if nobody is currently working on this part, I think I can try to implement this Java SDK if we decided to choose cachelib finally :-)

B. Is there any plan to support compiling in MacOS ? It is just a tiny question, since seems our teams are currently using MacOS to develop the projects. It's okay if we don't have any plan to support MacOS, we can also accomplish the Java SDK in linux.

  1. https://docs.databricks.com/en/optimizations/disk-cache.html
  2. https://www.usenix.org/system/files/osdi20-berg.pdf

Best Regrads.

@jiayuebao
Copy link
Contributor

Hi @openinx: Thanks for reaching out!

A. We don't have a plan to provide Java SDK for now.

B. For MacOS compiling, the code is available at: https://github.com/agordon/CacheLib/tree/macos-v40 (note this is the macos-v40 branch, not “main” branch). Could you try the following and see if it works (it was introduced long time ago, so I'm not sure if it still works):

$ git clone --branch macos-v40 https://github.com/agordon/CacheLib.git
$ cd CacheLib
$ ./contrib/macos-01-brew.sh

Install dependencies:

$ ./contrib/macos-02-build-dependencies.sh

Build CacheLib:

$ ./contrib/macos-03-build-cachelib.sh

@openinx
Copy link
Author

openinx commented Feb 18, 2024

Got it, thanks @jiayuebao for the info.

The MacOS compiling links is really helpful !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants