-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-36617][PYTHON] Fix type hints for approxQuantile to support multi-column version
#33880
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
ok to test |
|
Could you also add |
approxQuantile to support multi-column versionapproxQuantile to support multi-column version
|
Could you check your actions tab (https://github.com/carylee/spark/actions) directly (you might see something like described in this comment)? |
Update both `DataFrame.approxQuantile` and `DataFrameStatFunctions.approxQuantile` to support overloaded definitions when multiple columns are supplied.
|
ok to test |
zero323
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, subject to passing Jenkins tests.
|
test this please |
|
@zero323 It looks like the tests passed? https://github.com/carylee/spark/actions/runs/1191206733 |
HyukjinKwon
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM2. I will leave it to @zero323.
|
ok to test |
|
I think it's good to go and merge since GitHub Actions passed. This change shouldn't break any of tests anyway. |
…multi-column version ### What changes were proposed in this pull request? Update both `DataFrame.approxQuantile` and `DataFrameStatFunctions.approxQuantile` to support overloaded definitions when multiple columns are supplied. ### Why are the changes needed? The current type hints don't support the multi-column signature, a form that was added in Spark 2.2 (see [the approxQuantile docs](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.sql.DataFrame.approxQuantile.html).) This change was also introduced to pyspark-stubs (zero323/pyspark-stubs#552). zero323 asked me to open a PR for the upstream change. ### Does this PR introduce _any_ user-facing change? This change only affects type hints - it brings the `approxQuantile` type hints up to date with the actual code. ### How was this patch tested? Ran `./dev/lint-python`. Closes #33880 from carylee/master. Authored-by: Cary Lee <[email protected]> Signed-off-by: zero323 <[email protected]> (cherry picked from commit 37f5ab0) Signed-off-by: zero323 <[email protected]>
…multi-column version ### What changes were proposed in this pull request? Update both `DataFrame.approxQuantile` and `DataFrameStatFunctions.approxQuantile` to support overloaded definitions when multiple columns are supplied. ### Why are the changes needed? The current type hints don't support the multi-column signature, a form that was added in Spark 2.2 (see [the approxQuantile docs](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.sql.DataFrame.approxQuantile.html).) This change was also introduced to pyspark-stubs (zero323/pyspark-stubs#552). zero323 asked me to open a PR for the upstream change. ### Does this PR introduce _any_ user-facing change? This change only affects type hints - it brings the `approxQuantile` type hints up to date with the actual code. ### How was this patch tested? Ran `./dev/lint-python`. Closes #33880 from carylee/master. Authored-by: Cary Lee <[email protected]> Signed-off-by: zero323 <[email protected]> (cherry picked from commit 37f5ab0) Signed-off-by: zero323 <[email protected]>
|
Merged to master, branch-3.2 and branch-3.1. |
|
Thanks everyone! |
…multi-column version ### What changes were proposed in this pull request? Update both `DataFrame.approxQuantile` and `DataFrameStatFunctions.approxQuantile` to support overloaded definitions when multiple columns are supplied. ### Why are the changes needed? The current type hints don't support the multi-column signature, a form that was added in Spark 2.2 (see [the approxQuantile docs](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.sql.DataFrame.approxQuantile.html).) This change was also introduced to pyspark-stubs (zero323/pyspark-stubs#552). zero323 asked me to open a PR for the upstream change. ### Does this PR introduce _any_ user-facing change? This change only affects type hints - it brings the `approxQuantile` type hints up to date with the actual code. ### How was this patch tested? Ran `./dev/lint-python`. Closes apache#33880 from carylee/master. Authored-by: Cary Lee <[email protected]> Signed-off-by: zero323 <[email protected]> (cherry picked from commit 37f5ab0) Signed-off-by: zero323 <[email protected]>

What changes were proposed in this pull request?
Update both
DataFrame.approxQuantileandDataFrameStatFunctions.approxQuantileto support overloaded definitions when multiple columns are supplied.Why are the changes needed?
The current type hints don't support the multi-column signature, a form that was added in Spark 2.2 (see the approxQuantile docs.) This change was also introduced to pyspark-stubs (zero323/pyspark-stubs#552). @zero323 asked me to open a PR for the upstream change.
Does this PR introduce any user-facing change?
This change only affects type hints - it brings the
approxQuantiletype hints up to date with the actual code.How was this patch tested?
Ran
./dev/lint-python.