-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-2515][mllib] Chi Squared test #1733
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
QA results for PR 1733: |
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
@mengxr @jkbradley @falaki |
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
remove space between |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shall we call it chiSqTest (following R's)? We need test in the method name because X_2 is also a distribution. I feel chiSqTest may be better than chiSquaredTest because it is also called chi-square test without d.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
chiSqTest sounds good.
|
QA results for PR 1733: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
documentation
|
QA results for PR 1733: |
|
I think we should either allow user to input the raw observations or use |
|
@dorx I checked R's implementation and finally figured out what is going on.
I want to suggest the following APIs: |
|
The previous proposal may be hard to implement in Python. Another solution would be separate goodness-of-fit test from independence test, e.g., We can also add direct RDD support, which may be unnecessary: Since we only support |
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
QA results for PR 1733: |
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
QA results for PR 1733: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could be initialized as a null
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
QA results for PR 1733: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
p-value should be 0 here (strongly against the null)
|
QA tests have started for PR 1733. This patch merges cleanly. |
|
QA results for PR 1733: |
Author: Doris Xin <[email protected]> Closes #1733 from dorx/chisquare and squashes the following commits: cafb3a7 [Doris Xin] fixed p-value for extreme case. d286783 [Doris Xin] Merge branch 'master' into chisquare e95e485 [Doris Xin] reviewer comments. 7dde711 [Doris Xin] ChiSqTestResult renaming and changed to Class 80d03e2 [Doris Xin] Reviewer comments. c39eeb5 [Doris Xin] units passed with updated API e90d90a [Doris Xin] Merge branch 'master' into chisquare 7eea80b [Doris Xin] WIP d64c2fb [Doris Xin] Merge branch 'master' into chisquare 5686082 [Doris Xin] facelift bc7eb2e [Doris Xin] unit passed; still need docs and some refactoring 50703a5 [Doris Xin] merge master 4e4e361 [Doris Xin] WIP e6b83f3 [Doris Xin] reviewer comments 3d61582 [Doris Xin] input names 706d436 [Doris Xin] Added API for RDD[Vector] 6598379 [Doris Xin] API and code structure. ff17423 [Doris Xin] WIP (cherry picked from commit 32638b5) Signed-off-by: Xiangrui Meng <[email protected]>
|
LGTM. Merged into both master and branch-1.1. Thanks! |
Author: Doris Xin <[email protected]> Closes apache#1733 from dorx/chisquare and squashes the following commits: cafb3a7 [Doris Xin] fixed p-value for extreme case. d286783 [Doris Xin] Merge branch 'master' into chisquare e95e485 [Doris Xin] reviewer comments. 7dde711 [Doris Xin] ChiSqTestResult renaming and changed to Class 80d03e2 [Doris Xin] Reviewer comments. c39eeb5 [Doris Xin] units passed with updated API e90d90a [Doris Xin] Merge branch 'master' into chisquare 7eea80b [Doris Xin] WIP d64c2fb [Doris Xin] Merge branch 'master' into chisquare 5686082 [Doris Xin] facelift bc7eb2e [Doris Xin] unit passed; still need docs and some refactoring 50703a5 [Doris Xin] merge master 4e4e361 [Doris Xin] WIP e6b83f3 [Doris Xin] reviewer comments 3d61582 [Doris Xin] input names 706d436 [Doris Xin] Added API for RDD[Vector] 6598379 [Doris Xin] API and code structure. ff17423 [Doris Xin] WIP
bump boson to 0.3.1 Patch list: https://github.pie.apple.com/pie/boson/compare/8a8fbab4fd...658ed31b85
No description provided.