# Sperner's theorem

## Statement of the theorem

Sperner's theorem as originally stated is a result about set systems. Suppose that you want to find the largest collection of sets [math]\mathcal{A}[/math] such that no set in [math]\mathcal{A}[/math] is a proper subset of any other. Then the best you can do is choose all the sets of some fixed size---and of course the best size to pick is [math]\lfloor n/2\rfloor[/math], since the binomial coefficient [math]\binom nm[/math] is maximized when [math]m=\lfloor n/2\rfloor.[/math]

Sperner's theorem is closely related to the density Hales-Jewett theorem: in fact, it is nothing other than DHJ(2) with the best possible bound. To see this, we associate each set [math]A\subset[n][/math] with its characteristic function (that is, the sequence that is 0 outside A and 1 in A). If we have a pair of sets [math]A\subset B,[/math] then the two sequences form a combinatorial line in [math][2]^n.[/math] For example, if n=6 and A and B are the sets [math]\{2,3\}[/math] and [math]\{2,3,4,6\}[/math], then we get the combinatorial line that consists of the two points 011000 and 011101, which we can denote by 011*0* (so the wildcard set is [math]\{4,6\}[/math]).

## Proof of the theorem

There are several proofs, but perhaps the most enlightening is a very simple averaging argument that proves a stronger result. Let [math]\mathcal{A}[/math] be a collection of subsets of [n]. For each k, let [math]\delta_k[/math] denote the density of [math]\mathcal{A}[/math] in the kth layer of the cube: that is, it is the number of sets in [math]\mathcal{A}[/math] of size k, divided by [math]\binom nk.[/math] The equal-slices measure of [math]\mathcal{A}[/math] is defined to be [math]\delta_0+\dots+\delta_n.[/math]

Now the equal-slices measure of [math]\mathcal{A}[/math] is easily seen to be equal to the following quantity. Let [math]\pi[/math] be a random permutation of [n], let [math]U_0,U_1,U_2\dots,U_n[/math] be the sets [math]\emptyset, \{\pi(1)\},\{\pi(1),\pi(2)\},\dots,[n],[/math] and let [math]\mu(\mathcal{A})[/math] be the expected number of the sets [math]U_i[/math] that belong to [math]\mathcal{A}.[/math] This is the same by linearity of expectation and the fact that the probability that [math]U_k[/math] belongs to [math]\mathcal{A}[/math] is [math]\delta_k.[/math]

Therefore, if the equal-slices measure of [math]\mathcal{A}[/math] is greater than 1, then the expected number of sets [math]U_k[/math] in [math]\mathcal{A}[/math] is greater than 1, so there must exist a permutation for which it is at least 2, and that gives us a pair of sets with one contained in the other.

To see that this implies Sperner's theorem, one just has to make the simple observation that a set with equal-slices measure at most 1 must have cardinality at most [math]\binom n{\lfloor n/2\rfloor}.[/math] (If n is odd, so that there are two middle layers, then it is not quite so obvious that to have an extremal set you must pick one or other of the layers, but this is the case.) This stronger version of the statement is called the *LYM inequality*

### Strong version

An alternative argument deduces the multidimensional Sperner theorem from the density Hales-Jewett theorem. We can think of [math][2]^n[/math] as [math][2^k]^{n/k}.[/math] If we do so and apply DHJ(2^k) and translate back to [math][2]^n,[/math] then we find that we have produced a k-dimensional combinatorial subspace. This is obviously a much more sophisticated proof, since DHJ(2^k) is a very hard result, but it gives more information, since the wildcard sets turn out to have the same size. A sign that this strong version is genuinely strong is that it implies Szemerédi's theorem. For instance, suppose you take as your set [math]\mathcal{A}[/math] the set of all sequences such that the number of 0s plus the number of 1s in even places plus twice the number of 1s in odd places belongs to some dense set in [math][3n].[/math] Then if you have a 2D subspace with both wildcard sets of size d, one wildcard set consisting of odd numbers and the other of even numbers (which this proof gives), then this implies that in your dense set of integers you can find four integers of the form a, a+d, a+2d, a+d+2d, which is an arithmetic progression of length 4.

One can also prove the above strong form of Sperner theorem by using the multidimensional Szemerédi theorem which has combinatorial proofs. (reference!) It states that large dense high dimensional grids contain corners. Given a dense subset of [math][2]^n,[/math] denoted by [math]\mathcal{A}[/math]. We can suppose that the elements of [math]\mathcal{A}[/math] are of size about [math]\frac{n}{2}\pm C\sqrt{n}.[/math] Take a random permutation of [n]. An element of [math]\mathcal{A}[/math] is “[math]d-[/math]nice” after the permutation if it consists of [math]d[/math] intervals, each of length between [math]\frac{n}{2d}\pm C\sqrt{n}/2,[/math] and each interval begins at position [math]id[/math] for some [math]0\leq i\lt \frac{n}{d}.[/math] (Suppose that [math]d[/math] divides [math]n[/math]) Any [math]d-[/math]nice set can be represented as a point in a [math]d-[/math]dimensional [math][C\sqrt{n}]^d[/math] cube. The sets represented by the vertices of an axis-parallel [math]d-[/math]dimensional cube in [math][C\sqrt{n}]^d[/math] form a subspace with equal sized wildcard sets. Finding a cube is clearly more difficult than finding a corner, but it's existence in dense sets also follows from the multidimensional Szemerédi theorem. All what we need is to show that the expected number of the [math]d-[/math]nice elements is [math]c\sqrt{n}^d[/math] where c only depends on the density of [math]\mathcal{A}[/math]. For a typical [math]m-[/math]element subset of [math]\mathcal{A}[/math] the probability that it is [math]d-[/math]nice after the permutation is about [math]\binom{n}{m}^{-1}\sqrt{n}^{d-1}.[/math] The sum for elements of [math]\mathcal{A}[/math] with size between [math]\frac{n}{2}\pm C\sqrt{n},[/math] gives that the expected number of the [math]d-[/math]nice elements is [math]c\sqrt{n}^d,[/math] so there is a cube if n is large enough.

## Further remarks

The k=3 generalisation of the LYM inequality is the hyper-optimistic conjecture.

Sperner's theorem is also related to the Kruskal-Katona theorem.