-
Notifications
You must be signed in to change notification settings - Fork 28.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-13057][SQL] Add benchmark codes and the performance results for implemented compression schemes for InMemoryRelation #10965
Conversation
Test build #50254 has finished for PR 10965 at commit
|
Test build #50546 has finished for PR 10965 at commit
|
The size of in-memory columnar cache is much bigger than parquet data on disk because Spark uses simpler compression algorithms than parquet does in Since spark-sql already has |
I tried to use
|
@rxin Could you give me any comment on this? |
cc @nongli is this useful? |
@nongli ping |
the benchmark infra is updated, I think we need to rerun it and update the results. |
The benchmark LGTM and I think this is useful. @maropu Before you make significant changes to this, can you write up what you plan to do? |
@nongli Okay, I'll let you know the plan first. plz give me some time to look around similar codes in |
cc58f20
to
fab3fb2
Compare
Test build #50979 has finished for PR 10965 at commit
|
Thanks - I've merged this in master. |
This pr adds benchmark codes for in-memory cache compression to make future developments and discussions more smooth.