Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pyarrow]: sample for performing aggregation on arrow table using gandiva in pyarrow #5532

Closed
joetl opened this issue Sep 27, 2019 · 5 comments

Comments

@joetl
Copy link

joetl commented Sep 27, 2019

Can I get a sample to perform aggregation on arrow table in pyarrow using gandiva. Like sum(x) group by y order by z

@joetl joetl changed the title Python: sample for performing aggregation on arrow table using gandiva in pyarrow [pyarrow]: sample for performing aggregation on arrow table using gandiva in pyarrow Sep 27, 2019
@emkornfield
Copy link
Contributor

As far as I know Gandiva does provide group by or order by functionality. It is focused on expression evaluation (things that appear in the select statement)

@emkornfield
Copy link
Contributor

Cc @pravindra

@pravindra
Copy link
Contributor

pravindra commented Sep 28, 2019

@emkornfield is correct. gandiva does not support group-by/order-by functionality currently. It's limited to projection/filter use cases.

@joetl
Copy link
Author

joetl commented Sep 28, 2019

I am seeing cpp files for aggregate, sum, mean, aggregate in compute sub-module of arrow cop implementation..

https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/kernels/aggregate.cc
https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/kernels/sum.cc

@wesm
Copy link
Member

wesm commented Sep 28, 2019

Let's move the discussion either to JIRA or dev@arrow.apache.org.

A number of developers are interested in implementing hash aggregations in C++ but the work is not done yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants