Skip to content

Comments

[SPARK-40079] Add Imputer inputCols validation for empty input case#37518

Closed
WeichenXu123 wants to merge 1 commit intoapache:masterfrom
WeichenXu123:imputer-param-validation
Closed

[SPARK-40079] Add Imputer inputCols validation for empty input case#37518
WeichenXu123 wants to merge 1 commit intoapache:masterfrom
WeichenXu123:imputer-param-validation

Conversation

@WeichenXu123
Copy link
Contributor

Signed-off-by: Weichen Xu weichen.xu@databricks.com

What changes were proposed in this pull request?

Add Imputer inputCols validation for empty input case

Why are the changes needed?

If Imputer inputCols is empty, the fit works fine but when saving model, error will be raised:

AnalysisException:
Datasource does not support writing empty or nested empty schemas.
Please make sure the data schema has at least one or more column(s).

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Unit test.

Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
@github-actions github-actions bot added the ML label Aug 15, 2022
WeichenXu123 added a commit that referenced this pull request Aug 15, 2022
Signed-off-by: Weichen Xu <weichen.xudatabricks.com>

### What changes were proposed in this pull request?
Add Imputer inputCols validation for empty input case

### Why are the changes needed?
If Imputer inputCols is empty, the `fit` works fine but when saving model, error will be raised:

>
AnalysisException:
Datasource does not support writing empty or nested empty schemas.
Please make sure the data schema has at least one or more column(s).

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
Unit test.

Closes #37518 from WeichenXu123/imputer-param-validation.

Authored-by: Weichen Xu <weichen.xu@databricks.com>
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
(cherry picked from commit 87094f8)
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
WeichenXu123 added a commit that referenced this pull request Aug 15, 2022
Signed-off-by: Weichen Xu <weichen.xudatabricks.com>

### What changes were proposed in this pull request?
Add Imputer inputCols validation for empty input case

### Why are the changes needed?
If Imputer inputCols is empty, the `fit` works fine but when saving model, error will be raised:

>
AnalysisException:
Datasource does not support writing empty or nested empty schemas.
Please make sure the data schema has at least one or more column(s).

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
Unit test.

Closes #37518 from WeichenXu123/imputer-param-validation.

Authored-by: Weichen Xu <weichen.xu@databricks.com>
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
(cherry picked from commit 87094f8)
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
WeichenXu123 added a commit that referenced this pull request Aug 15, 2022
Signed-off-by: Weichen Xu <weichen.xudatabricks.com>

### What changes were proposed in this pull request?
Add Imputer inputCols validation for empty input case

### Why are the changes needed?
If Imputer inputCols is empty, the `fit` works fine but when saving model, error will be raised:

>
AnalysisException:
Datasource does not support writing empty or nested empty schemas.
Please make sure the data schema has at least one or more column(s).

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
Unit test.

Closes #37518 from WeichenXu123/imputer-param-validation.

Authored-by: Weichen Xu <weichen.xu@databricks.com>
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
(cherry picked from commit 87094f8)
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
@WeichenXu123
Copy link
Contributor Author

Merged to master / branch-3.3 / branch-3.2 / branch-3.1

sunchao pushed a commit to sunchao/spark that referenced this pull request Jun 2, 2023
Signed-off-by: Weichen Xu <weichen.xudatabricks.com>

### What changes were proposed in this pull request?
Add Imputer inputCols validation for empty input case

### Why are the changes needed?
If Imputer inputCols is empty, the `fit` works fine but when saving model, error will be raised:

>
AnalysisException:
Datasource does not support writing empty or nested empty schemas.
Please make sure the data schema has at least one or more column(s).

### Does this PR introduce _any_ user-facing change?
No.

### How was this patch tested?
Unit test.

Closes apache#37518 from WeichenXu123/imputer-param-validation.

Authored-by: Weichen Xu <weichen.xu@databricks.com>
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
(cherry picked from commit 87094f8)
Signed-off-by: Weichen Xu <weichen.xu@databricks.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants