-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-8598] [MLlib] Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for RDDs #6994
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from 1 commit
Commits
Show all changes
27 commits
Select commit
Hold shift + click to select a range
13dfe4d
created test result class for ks test
c659ea1
created KS test class
4da189b
added user facing ks test functions
ce8e9a1
added kstest testing in HypothesisTestSuite
b9cff3a
made small changes to pass style check
f6951b6
changed style and some comments based on feedback from pull request
c18dc66
removed ksTestOpt from API and changed comments in HypothesisTestSuit…
16b5c4c
renamed dat to data and eliminated recalc of RDD size by sharing as a…
0b5e8ec
changed KS one sample test to perform just 1 distributed pass (in add…
4b8ba61
fixed off by 1/N in cases when post-constant adjustment ecdf is above…
6a4784f
specified what distributions are available for the convenience method…
992293b
Style changes as per comments and added implementation note explainin…
3f81ad2
renamed ks1 sample test for clarity
9c0f1af
additional style changes incorporated and added documentation to mlli…
1226b30
reindent multi-line lambdas, prior intepretation of style guide was w…
9026895
addressed style changes, correctness change to simpler approach, and …
3288e42
addressed style changes, correctness change to simpler approach, and …
e760ebd
line length changes to fit style check
7e66f57
copied implementation note to public api docs, and added @see for lin…
a4bc0c7
changed ksTest(data, distName) to ksTest(data, distName, params*) aft…
2ec2aa6
initialize to stdnormal when no params passed (and log). Change unit …
1bb44bd
style and doc changes. Factored out ks test into 2 separate tests
a48ae7b
refactor code to account for serializable RealDistribution. Reuse tes…
1f56371
changed ksTest in public API to kolmogorovSmirnovTest for clarity
0d0c201
kstTest -> kolmogorovSmirnovTest in statistics.md
bbb30b1
renamed KSTestResult to KolmogorovSmirnovTestResult, to stay consiste…
08834f4
Merge remote-tracking branch 'upstream/master'
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
renamed ks1 sample test for clarity
- Loading branch information
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
kolmogorov smirnov->Kolmogorov-SmirnovCould you also add a unit test to compare with R? Here is one example:
https://github.com/apache/spark/blob/master/mllib/src/test/scala/org/apache/spark/ml/classification/LogisticRegressionSuite.scala#L214
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
added a comparison using R's ks.test (from stats library)