Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset #40173

Closed
wants to merge 1 commit into from

Conversation

amaliujia
Copy link
Contributor

What changes were proposed in this pull request?

Add groupBy(col1: String, cols: String*) to Scala client Dataset API.

Why are the changes needed?

API coverage

Does this PR introduce any user-facing change?

NO

How was this patch tested?

UT

@amaliujia
Copy link
Contributor Author

@hvanhovell

Copy link
Contributor

@hvanhovell hvanhovell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@@ -1990,14 +2019,14 @@ class Dataset[T] private[sql] (val sparkSession: SparkSession, private[sql] val
viewName: String,
replace: Boolean,
global: Boolean): Unit = {
val command = session.newCommand { builder =>
val command = sparkSession.newCommand { builder =>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yes, I guess I broke the build...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Or we use this PR to fix the build, depending on you.

@hvanhovell
Copy link
Contributor

Merging.

hvanhovell pushed a commit that referenced this pull request Feb 26, 2023
### What changes were proposed in this pull request?

Add `groupBy(col1: String, cols: String*)` to Scala client Dataset API.

### Why are the changes needed?

API coverage

### Does this PR introduce _any_ user-facing change?

NO
### How was this patch tested?

UT

Closes #40173 from amaliujia/2nd_groupby.

Authored-by: Rui Wang <rui.wang@databricks.com>
Signed-off-by: Herman van Hovell <herman@databricks.com>
(cherry picked from commit a4c12fc)
Signed-off-by: Herman van Hovell <herman@databricks.com>
snmvaughan pushed a commit to snmvaughan/spark that referenced this pull request Jun 20, 2023
### What changes were proposed in this pull request?

Add `groupBy(col1: String, cols: String*)` to Scala client Dataset API.

### Why are the changes needed?

API coverage

### Does this PR introduce _any_ user-facing change?

NO
### How was this patch tested?

UT

Closes apache#40173 from amaliujia/2nd_groupby.

Authored-by: Rui Wang <rui.wang@databricks.com>
Signed-off-by: Herman van Hovell <herman@databricks.com>
(cherry picked from commit a4c12fc)
Signed-off-by: Herman van Hovell <herman@databricks.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants