-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-47547][CORE] Add BloomFilter V2 and use it as default
#50933
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
ishnagy
wants to merge
46
commits into
apache:master
from
ishnagy-patches:SPARK-47547_bloomfilter_fpp_degradation
Closed
Changes from 4 commits
Commits
Show all changes
46 commits
Select commit
Hold shift + click to select a range
3c5a843
SPARK-47547 BloomFilter fpp degradation: addressing the int32 truncation
ishnagy 08cbfeb
SPARK-47547 BloomFilter fpp degradation: fixing test data repetition …
ishnagy e3cb08e
SPARK-47547 BloomFilter fpp degradation: scrambling the high 32bytes …
ishnagy c4e3f58
SPARK-47547 BloomFilter fpp degradation: random distribution fpp test
ishnagy 1a0b66f
SPARK-47547 BloomFilter fpp degradation: javadoc for test methods, ch…
ishnagy d912b66
SPARK-47547 BloomFilter fpp degradation: make seed serialization back…
ishnagy f589e2c
SPARK-47547 BloomFilter fpp degradation: counting discarded odd items…
ishnagy f597c76
SPARK-47547 BloomFilter fpp degradation: refactoring FPP counting log…
ishnagy 4ea633d
SPARK-47547 BloomFilter fpp degradation: checkstyle fix
ishnagy 6696106
SPARK-47547 BloomFilter fpp degradation: fix test bug
ishnagy b75e187
SPARK-47547 BloomFilter fpp degradation: parallelization friendly tes…
ishnagy 2d8a9f1
SPARK-47547 BloomFilter fpp degradation: parallelization friendly tes…
ishnagy 4a30794
SPARK-47547 BloomFilter fpp degradation: parallelization friendly tes…
ishnagy d9d6980
SPARK-47547 BloomFilter fpp degradation: addressing concerns around d…
ishnagy 39a46c9
SPARK-47547 BloomFilter fpp degradation: cut down test cases to decre…
ishnagy 7f235e7
Merge branch 'master' into SPARK-47547_bloomfilter_fpp_degradation
ishnagy 16be3a9
SPARK-47547 BloomFilter fpp degradation: revert creating a new SlowTe…
ishnagy e91b5ca
SPARK-47547 BloomFilter fpp degradation: disable progress logging by …
ishnagy 897c1d4
SPARK-47547 BloomFilter fpp degradation: adjust tolerance and fail on…
ishnagy 013bfe4
SPARK-47547 BloomFilter fpp degradation: make V1/V2 distinction in Bl…
ishnagy 6d44c1e
SPARK-47547 BloomFilter fpp degradation: scrambling test input withou…
ishnagy 925bf12
SPARK-47547 BloomFilter fpp degradation: parallelizing BloomFilter re…
ishnagy 6f28882
SPARK-47547 BloomFilter fpp degradation: add seed to equals/hashCode
ishnagy ed6caac
SPARK-47547 BloomFilter fpp degradation: checkstyle fix
ishnagy 7d4ef74
SPARK-47547 BloomFilter fpp degradation: remove dependency between lo…
ishnagy c52ead3
Merge branch 'master' into SPARK-47547_bloomfilter_fpp_degradation
ishnagy 0ab8276
SPARK-47547 BloomFilter fpp degradation: running /dev/scalafmt
ishnagy d2477bf
SPARK-47547 BloomFilter fpp degradation: javadoc comment for the V2 enum
ishnagy 413c4fe
SPARK-47547 BloomFilter fpp degradation: reindent with 2 spaces
ishnagy 4599fcb
SPARK-47547 BloomFilter fpp degradation: (recover empty line in Bloom…
ishnagy 1ee2e13
SPARK-47547 BloomFilter fpp degradation: JEP-361 style switches
ishnagy c501b2a
SPARK-47547 BloomFilter fpp degradation: removing Objects::equals
ishnagy 1f5cfb6
SPARK-47547 BloomFilter fpp degradation: add missing seed comparison …
ishnagy f60d55f
SPARK-47547 BloomFilter fpp degradation: checkstyle
ishnagy 0314963
SPARK-47547 BloomFilter fpp degradation: BloomFilterBase abstract par…
ishnagy f2df338
SPARK-47547 BloomFilter fpp degradation: pull up long and byte hashin…
ishnagy 4aaff83
SPARK-47547 BloomFilter fpp degradation: checkstyle
ishnagy e214bd7
SPARK-47547 BloomFilter fpp degradation: removing unnecessary line wr…
ishnagy 99f7343
SPARK-47547 BloomFilter fpp degradation: moving junit-pioneer version…
ishnagy 58e3066
SPARK-47547 BloomFilter fpp degradation: (empty line juggling)
ishnagy c06cb38
SPARK-47547 BloomFilter fpp degradation: pull up common hash scatteri…
ishnagy b99ef3a
SPARK-47547 BloomFilter fpp degradation: (empty line juggling)
ishnagy ce3ad76
SPARK-47547 BloomFilter fpp degradation: remove redundant default cas…
ishnagy 626e459
SPARK-47547 BloomFilter fpp degradation: properly capitalize InputStr…
ishnagy b0f5b45
SPARK-47547 BloomFilter fpp degradation: indenting method parameters …
ishnagy 6849dbe
SPARK-47547 BloomFilter fpp degradation: removing junit-pioneer from …
ishnagy File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For the management of dependency versions, they should be placed in the parent
pom.xml. However, ifTestSparkBloomFiltercan be removed from the current pr, then it seems that this dependency is no longer needed either.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll defer addressing this, until we decide what should happen with
TestSparkBloomFilter.(remove & move the versions under managed dependencies)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we keep the dependency then please move the version to the main pom.