[Swift 4.2] Bloom Filter #759

SpacyRicochet · 2018-10-04T18:52:15Z

Checklist

I've looked at the contribution guidelines.
This pull request is complete and ready for review.

Description

* Remove the top code snippet, as per instructions. * Remove use of deprecated `characters` property.

* Adds a section about another Bloom filter approach, using only a single hashing function with Swift 4.2's `Hasher`. * Adds links to some more documentation and a blog post implementing the Bloom filter in this way. * Adds my name as updater.

SpacyRicochet · 2018-10-11T12:04:42Z

References #748.

kelvinlauKL · 2018-10-23T04:53:22Z

Bloom Filter/README.markdown

-*Written for Swift Algorithm Club by Jamil Dhanani. Edited by Matthijs Hollemans.*
+## Another approach
+
+Another approach to create different hashes of an element for use in the Bloom filter, is to use the same hash function for every iteration, but combine it with different random numbers. This can help, because finding good hashing functions is hard, but combining them is equally non-trivial.


This part is unclear for me.

My understanding of hash functions is that it should be deterministic -- "hello world!" should hash to the same value during the lifetime of a single program.

If you combine the result of the hash function with different random numbers, wouldn't that just be akin to producing random numbers?

For the Bloom filter, the actual hash isn't important, as long as it is consistent and a proper hash. So if you guarantee that you acquire the hash for the same object in the same way during the lifetime of the Bloom filter, you're fine.

This approach would create random numbers at the initialisation of the Bloom filter, and use them consistently as modifiers on an object's hash, effectively creating a new hash function per random number. So the hash would still be same for object A during the lifetime of that Bloom filter.

Make sense? The linked blog post about it is probably clearer.

Ah, I see what you're getting at. I read this as "everything time you use the hash function, combine it with a random number."

I propose we change this paragraph to the following:

In the previous section, you learnt about how using multiple different hash functions can help reduce the chance of collisions in the bloom filter. However, good hash functions are difficult to design. A simple alternative to multiple hash functions is to use a set of random numbers.

As an example, let's say a bloom filter wants to hash each element 15 times during insertion. Instead of using 15 different hash functions, you can rely on just one hash function. The hash value can then be offset by 15 different values to form the indices for flipping. This bloom filter would initialize a set of 15 random numbers ahead of time and use these values during each insertion.

Let me know if that sounds fine with you. If this is okay, I'll go ahead and make the change to merge it in.

Updated the README.

kelvinlauKL · 2018-11-11T23:31:37Z

Thanks @SpacyRicochet!

SpacyRicochet added 5 commits October 4, 2018 20:49

Update project with Xcode 10.

8943b09

Remove use of deprecated characters property.

06cdc9f

Xcode 10 automated changes.

6ef7a6e

Update for Swift 4.2.

4201fb4

* Remove the top code snippet, as per instructions. * Remove use of deprecated `characters` property.

Add note about Hasher in README.

849309c

* Adds a section about another Bloom filter approach, using only a single hashing function with Swift 4.2's `Hasher`. * Adds links to some more documentation and a blog post implementing the Bloom filter in this way. * Adds my name as updater.

SpacyRicochet changed the title ~~[WIP] [Swift 4.2] Bloom Filter~~ [Swift 4.2] Bloom Filter Oct 4, 2018

kelvinlauKL reviewed Oct 23, 2018

View reviewed changes

Update README according to comments.

f37fb19

kelvinlauKL merged commit 0e4bd5f into kodecocodes:master Nov 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Swift 4.2] Bloom Filter #759

[Swift 4.2] Bloom Filter #759

Uh oh!

SpacyRicochet commented Oct 4, 2018 •

edited

Loading

Uh oh!

SpacyRicochet commented Oct 11, 2018

Uh oh!

kelvinlauKL Oct 23, 2018

Uh oh!

SpacyRicochet Oct 23, 2018

Uh oh!

kelvinlauKL Oct 28, 2018

Uh oh!

SpacyRicochet Nov 1, 2018

Uh oh!

kelvinlauKL commented Nov 11, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Swift 4.2] Bloom Filter #759

[Swift 4.2] Bloom Filter #759

Uh oh!

Conversation

SpacyRicochet commented Oct 4, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Description

Uh oh!

SpacyRicochet commented Oct 11, 2018

Uh oh!

kelvinlauKL Oct 23, 2018

Choose a reason for hiding this comment

Uh oh!

SpacyRicochet Oct 23, 2018

Choose a reason for hiding this comment

Uh oh!

kelvinlauKL Oct 28, 2018

Choose a reason for hiding this comment

Uh oh!

SpacyRicochet Nov 1, 2018

Choose a reason for hiding this comment

Uh oh!

kelvinlauKL commented Nov 11, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SpacyRicochet commented Oct 4, 2018 •

edited

Loading