Skip to content
This repository has been archived by the owner on Jan 11, 2021. It is now read-only.

Integration with parquet-format #82

Merged
merged 15 commits into from
Apr 14, 2018
Merged

Conversation

sadikovi
Copy link
Collaborator

@sadikovi sadikovi commented Apr 10, 2018

This PR adds parquet-format dependency and removes parquet_thrift.
Uses parquet-format 2.4.0.

With this patch, users of the crate will no longer need Thrift to build the code.

@sadikovi
Copy link
Collaborator Author

This is WIP at the moment, I just want to test if travis passes with my changes, and we need to discuss if this is an appropriate strategy!

@sunchao
Copy link
Owner

sunchao commented Apr 11, 2018

Thanks @sadikovi . I agree on the benefits of including parquet.rs, but I'm a little hesitant mainly because both parquet-mr and parquet-cpp doesn't do it. This may cause issues if we want to merge into Apache in future.

I'm wondering if there's any workaround on the docs.rs issue. I'll also try to explore on my end before deciding which choice to take.

@sadikovi sadikovi changed the title Include parquet.rs and update build WIP Include parquet.rs and update build Apr 11, 2018
@sadikovi
Copy link
Collaborator Author

I completely understand your concern. I was just thinking that we could have more people using it and/or building on top of it, if crate was self-contained (you would only need Thrift for development). But, I also agree, that it would be better to generate, which makes me feel confused at this point:)

Some build step fails on this PR, so coveralls is not reported. I will see if I can do anything about it.

@sunchao
Copy link
Owner

sunchao commented Apr 11, 2018

Some build step fails on this PR, so coveralls is not reported. I will see if I can do anything about it.
Seems like a transient issue - let me re-launch the build.

Another approach I'm thinking about is to add a feature flag to fetch the parquet.rs from the release page. This can be used to skip the Thrift installation, and enable docs.rs. I have a WIP diff - could you take a look?

@sadikovi
Copy link
Collaborator Author

I will close this for now, and reopen if we decide to go this route. Current discussion is on #84.

@sadikovi sadikovi closed this Apr 11, 2018
@sadikovi sadikovi deleted the include-parquet-rs branch April 11, 2018 07:41
@sadikovi sadikovi restored the include-parquet-rs branch April 12, 2018 02:00
@sadikovi sadikovi reopened this Apr 12, 2018
@sadikovi sadikovi changed the title WIP Include parquet.rs and update build [WIP] Test integration with parquet-format Apr 12, 2018
@sadikovi
Copy link
Collaborator Author

Opened this PR to test integration with parquet-format.

@sunchao
Copy link
Owner

sunchao commented Apr 12, 2018

@sadikovi : the parquet-format crate is published.

@sadikovi
Copy link
Collaborator Author

@sunchao Thank you! I will update the pull request.

@coveralls
Copy link

coveralls commented Apr 12, 2018

Coverage Status

Coverage remained the same at 94.774% when pulling 7fcc9fc on sadikovi:include-parquet-rs into 5e4758c on sunchao:master.

@sadikovi sadikovi changed the title [WIP] Test integration with parquet-format Integration with parquet-format Apr 13, 2018
@sadikovi
Copy link
Collaborator Author

@sunchao I fixed this PR, had to add sudo: required for kcov problem. Could you review? Thanks!

@@ -1,3 +1,5 @@
dist: trusty
sudo: required
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm. why we don't need this before?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks like a regression in travis. I am not sure why we didn't need it before. I will spend some time figuring out why.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I checked the travis docs. It looks like whenever you explicitly use sudo make or any sudo <command>, travis will assume sudo: required flag. This is mentioned here: https://docs.travis-ci.com/user/reference/overview/#For-a-particular-.travis.yml-configuration

When sudo is required, then build runs on GCE, otherwise in container on EC2. Here is the comparison:

So we have been running with sudo: required all this time, because of this line https://github.com/sadikovi/parquet-rs/blob/master/.travis.yml#L18.

So now we just explicitly state that we require sudo (for kcov issue on travis).

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see. Nice finding! Thanks for the detailed explanation.

@@ -36,13 +36,13 @@ extern crate brotli;
extern crate flate2;
extern crate rand;
extern crate x86intrin;
extern crate parquet_format;
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe we can also remove some dependencies that were moved to parquet-format-rs?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I will remove them. Thanks.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed them, now lib.rs should contain only parquet dependencies.

Copy link
Owner

@sunchao sunchao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! You might need to rebase first since I just merged #81

@@ -1,3 +1,5 @@
dist: trusty
sudo: required
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see. Nice finding! Thanks for the detailed explanation.

@sadikovi
Copy link
Collaborator Author

@sunchao I rebased. Could you have a look again? Thanks!

@sunchao sunchao merged commit b48b415 into sunchao:master Apr 14, 2018
@sunchao
Copy link
Owner

sunchao commented Apr 14, 2018

Merged. Thanks @sadikovi !

@sadikovi
Copy link
Collaborator Author

Thank you very much @sunchao! I hope this change will make it easier to use parquet-rs crate in different projects!

@sadikovi sadikovi deleted the include-parquet-rs branch April 14, 2018 00:07
@sunchao sunchao mentioned this pull request Apr 14, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants