Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle naming inconsistencies between SQL, DataFrame API and struct names #25835

Closed
asfimport opened this issue Aug 18, 2020 · 1 comment
Closed

Comments

@asfimport
Copy link

Currently, we have naming inconsistencies between the different APIs that make it a bit confusing. The typical example atm is

df.where().to_plan?.explain() shows a "Selection" in the plan when "Selection" in SQL and many other query languages is a projection, not a filter.

Other examples:

name: Selection
SQL: WHERE
DF: filter
name: Aggregation
SQL: GROUP BY
DF: aggregate
name: Projection
SQL: SELECT
DF: select,select_columns

I suggest that we align them with a common notation, preferably aligned with other more common query languages.

I am assigning this to you @andygrove as you are probably the only person that can take a decision on this.

Reporter: Jorge Leitão / @jorgecarleitao
Assignee: Andy Grove / @andygrove

PRs and other links:

Note: This issue was originally created as ARROW-9788. Please see the migration documentation for further details.

@asfimport
Copy link
Author

Andy Grove / @andygrove:
Issue resolved by pull request 7998
#7998

@asfimport asfimport added this to the 2.0.0 milestone Jan 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants