Suppose you are a poor orkish messenger and have rings to distribute to the elven-kings under the sky. Of course each is supposed to get their own, but you’re lazy and the nasty elves all look the same anyway.
Due to your extensive experience with chaos randomization you quickly see that throwing rings at elves randomly is in expectation just as good. The number of rings for each elf is distributed as , so the expectation is one. You might also know that binomial random variables are pretty strongly concentrated around their expectation and you don’t think the dependencies between the elves will change anything about that.
Your boss however would like some more arguments why your approach is good enough. Let’s try to compute the maximum number of rings any elf will get.
The knowledgeable reader probably recognizes this as a classical balls-into-bins settings with all the immediate applications to scheduling, hashing, and so forth. From here I will use the more standard nomenclature.
This seems like a difficult problem to tackle, but it turns out we can use the first moment method1 on appropriately defined random variables.
As it is so often the case, ‘appropriately’ defining things gets you half way to a solution. The first moment method allows us to show that a random variable will be zero asymptotically almost surely. In our application we want to show that there will be no bins with a large number of balls, so let’s define to count the number of bins who have or more rings.
As we already observed the number of balls is binomially distributed, so it is not so difficult to bound the probability that a bin contains more than . To make things easier, we will use the Poisson approximation2 and cheat a little with the constant factors involved.
We used Stirling to approximate . By defining an indicator for every bin we see that the expectation is just . Clearly this expectation goes to zero for for some suitable , depending on the exact values of the constants in the exponent.
Our sloppy calculations show that the number of bins with more than balls is zero a.a.s., and hence the maximum load is sublogarithmic with high probability. The interested reader might want to show, using a variance calculation, that this bound is tight. Not bad for such a simple algorithm! If you’re feeling adventurous, you can try the exercise at the end of this blog post to see how much a simple twist can improve the algorithm. Perhaps I’ll write a bit about that in a different post.
You might be a little disappointed at how slowly the expectation goes to zero, but rest assured that this is just a symptom of our weak tools. After all we used nothing more advanced than Markov’s inequality! It is entirely possible (and indeed a common textbook example) to derive much stronger bounds by applying Chernoff-type bounds on a series of appropriately conditioned Poisson random variables, see for example chapter 5 of the excellent book Probability and Computing by Mitzenmacher and Upfal.
Intuitively you could do a little better by distributing the rings in rounds. In each round you pick two elves and give a ring to the elf who currently has fewer. Can you still compute the maximum number of rings a single elf will get?
Recently discussed in relation to triangles in random graphs.↩
I told you the Poisson distribution is useful in this post↩