Skip to content

Conversation

@zhengruifeng
Copy link
Contributor

@zhengruifeng zhengruifeng commented Sep 22, 2022

What changes were proposed in this pull request?

Remove pyspark.pandas.ml

Why are the changes needed?

pyspark.pandas.ml is no longer needed, since we implemented correlations based on Spark SQL:

  1. pearson corrleation implemented in [SPARK-40399][PS] Make pearson correlation in DataFrame.corr support missing values and min_periods  #37845
  2. spearman corrleation implemented [SPARK-40421][PS] Make spearman correlation in DataFrame.corr support missing values and min_periods #37874

Does this PR introduce any user-facing change?

No

How was this patch tested?

updated suites

address comments
@HyukjinKwon
Copy link
Member

@zhengruifeng it would be great to link which PR or JIRA removed them though.

@HyukjinKwon
Copy link
Member

Merged to master.

@zhengruifeng zhengruifeng deleted the ps_del_ml branch September 23, 2022 00:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants