Skip to content

[SPARK-42364][PS][TESTS] Split 'pyspark.pandas.tests.test_dataframe'#39915

Closed
zhengruifeng wants to merge 1 commit intoapache:masterfrom
zhengruifeng:ps_split_test_df
Closed

[SPARK-42364][PS][TESTS] Split 'pyspark.pandas.tests.test_dataframe'#39915
zhengruifeng wants to merge 1 commit intoapache:masterfrom
zhengruifeng:ps_split_test_df

Conversation

@zhengruifeng
Copy link
Contributor

@zhengruifeng zhengruifeng commented Feb 7, 2023

What changes were proposed in this pull request?

Split 'pyspark.pandas.tests.test_dataframe'

Why are the changes needed?

'pyspark.pandas.tests.test_dataframe' is too large: more than 7K lines, may take >15min sometimes

Does this PR introduce any user-facing change?

No, test-only

How was this patch tested?

updated UT

in my local env: 175 sec -> 95 sec + 102 sec

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM.

dongjoon-hyun pushed a commit that referenced this pull request Feb 7, 2023
### What changes were proposed in this pull request?
Split 'pyspark.pandas.tests.test_dataframe'

### Why are the changes needed?
'pyspark.pandas.tests.test_dataframe' is too large: more than 7K lines, may take >15min sometimes

### Does this PR introduce _any_ user-facing change?
No, test-only

### How was this patch tested?
updated UT

in my local env: 175 sec -> 95 sec + 102 sec

Closes #39915 from zhengruifeng/ps_split_test_df.

Authored-by: Ruifeng Zheng <ruifengz@apache.org>
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
(cherry picked from commit 58b6535)
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
@zhengruifeng
Copy link
Contributor Author

thank you @dongjoon-hyun @HyukjinKwon for the reviews!

@zhengruifeng zhengruifeng deleted the ps_split_test_df branch February 7, 2023 08:50
snmvaughan pushed a commit to snmvaughan/spark that referenced this pull request Jun 20, 2023
### What changes were proposed in this pull request?
Split 'pyspark.pandas.tests.test_dataframe'

### Why are the changes needed?
'pyspark.pandas.tests.test_dataframe' is too large: more than 7K lines, may take >15min sometimes

### Does this PR introduce _any_ user-facing change?
No, test-only

### How was this patch tested?
updated UT

in my local env: 175 sec -> 95 sec + 102 sec

Closes apache#39915 from zhengruifeng/ps_split_test_df.

Authored-by: Ruifeng Zheng <ruifengz@apache.org>
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
(cherry picked from commit 58b6535)
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants