Sperner's theorem: Difference between revisions

From Polymath Wiki
Jump to navigationJump to search
New page: '''Sperner's theorem''': Any line-free subset of <math>[2]^n</math> has cardinality at most <math>\binom{n}{\lfloor n/2\rfloor}</math>. It implies the k=2 version of the [[density Ha...
 
Deleted incorrect multidim version
 
(16 intermediate revisions by 6 users not shown)
Line 1: Line 1:
'''Sperner's theorem''': Any [[line-free]] subset of <math>[2]^n</math> has cardinality at most <math>\binom{n}{\lfloor n/2\rfloor}</math>.
==Statement of the theorem==


It implies the k=2 version of the [[density Hales-Jewett theorem]].
Sperner's theorem as originally stated is a result about set systems. Suppose that you want to find the largest collection of sets <math>\mathcal{A}</math> such that no set in <math>\mathcal{A}</math> is a proper subset of any other. Then the best you can do is choose all the sets of some fixed size---and of course the best size to pick is <math>\lfloor n/2\rfloor</math>, since the binomial coefficient <math>\binom nm</math> is maximized when <math>m=\lfloor n/2\rfloor.</math>


A stronger version of Sperner's theorem is  
Sperner's theorem is closely related to the density Hales-Jewett theorem: in fact, it is nothing other than DHJ(2) with the best possible bound. To see this, we associate each set <math>A\subset[n]</math> with its characteristic function (that is, the sequence that is 0 outside A and 1 in A). If we have a pair of sets <math>A\subset B,</math> then the two sequences form a [[combinatorial line]] in <math>[2]^n.</math> For example, if n=6 and A and B are the sets <math>\{2,3\}</math> and <math>\{2,3,4,6\}</math>, then we get the combinatorial line that consists of the two points 011000 and 011101, which we can denote by 011*0* (so the wildcard set is <math>\{4,6\}</math>).


'''LYM inequality''': Any [[line-free]] subset of <math>[2]^n</math> has [[equal-slices measure]] at most 1.
==Proof of the theorem==


The k=3 generalisation of this inequality is the [[hyper-optimistic conjecture]].
There are several proofs, but perhaps the most enlightening is a very simple averaging argument that proves a stronger result. Let <math>\mathcal{A}</math> be a collection of subsets of [n]. For each k, let <math>\delta_k</math> denote the density of <math>\mathcal{A}</math> in the kth [[slice|layer]] of the cube: that is, it is the number of sets in <math>\mathcal{A}</math> of size k, divided by <math>\binom nk.</math> The [[equal-slices measure]] of <math>\mathcal{A}</math> is defined to be <math>\delta_0+\dots+\delta_n.</math>


The LYM inequality can be proven as follows.  Randomly shuffle the n indices and then consider the intersection of A with the strings <math>0^i 1^{n-i}</math> for <math>i=0,\ldots,n</math>.  As A is line-free, at most one of these strings lie in A. Averaging over all choice of shuffles we obtain the claim.
Now the equal-slices measure of <math>\mathcal{A}</math> is easily seen to be equal to the following quantity. Let <math>\pi</math> be a random permutation of [n], let <math>U_0,U_1,U_2\dots,U_n</math> be the sets <math>\emptyset, \{\pi(1)\},\{\pi(1),\pi(2)\},\dots,[n],</math> and let <math>\mu(\mathcal{A})</math> be the expected number of the sets <math>U_i</math> that belong to <math>\mathcal{A}.</math> This is the same by linearity of expectation and the fact that the probability that <math>U_k</math> belongs to <math>\mathcal{A}</math> is <math>\delta_k.</math>


* [[http://en.wikipedia.org/wiki/Sperner%27s_theorem The Wikipedia entry for Sperner's theorem]]
Therefore, if the equal-slices measure of <math>\mathcal{A}</math> is greater than 1, then the expected number of sets <math>U_k</math> in <math>\mathcal{A}</math> is greater than 1, so there must exist a permutation for which it is at least 2, and that gives us a pair of sets with one contained in the other.
* [[http://en.wikipedia.org/wiki/Lubell-Yamamoto-Meshalkin_inequality The Wikipedia entry for the LYM inequality]]
 
To see that this implies Sperner's theorem, one just has to make the simple observation that a set with equal-slices measure at most 1 must have cardinality at most <math>\binom n{\lfloor n/2\rfloor}.</math> (If n is odd, so that there are two middle layers, then it is not quite so obvious that to have an extremal set you must pick one or other of the layers, but this is the case.) This stronger version of the statement is called the <em>LYM inequality</em>
 
 
 
===Strong version===
 
An alternative argument deduces the multidimensional Sperner theorem from the density Hales-Jewett theorem. We can think of <math>[2]^n</math> as <math>[2^k]^{n/k}.</math> If we do so and apply DHJ(2^k) and translate back to <math>[2]^n,</math> then we find that we have produced a k-dimensional combinatorial subspace. This is obviously a much more sophisticated proof, since DHJ(2^k) is a very hard result, but it gives more information, since the wildcard sets turn out to have the same size. A sign that this strong version is genuinely strong is that it implies Szemer&eacute;di's theorem. For instance, suppose you take as your set <math>\mathcal{A}</math> the set of all sequences such that the number of 0s plus the number of 1s in even places plus twice the number of 1s in odd places belongs to some dense set in <math>[3n].</math> Then if you have a 2D subspace with both wildcard sets of size d, one wildcard set consisting of odd numbers and the other of even numbers (which this proof gives), then this implies that in your dense set of integers you can find four integers of the form a, a+d, a+2d, a+d+2d, which is an arithmetic progression of length 4.
 
One can also prove the above strong form of Sperner theorem  by using the multidimensional Szemer&eacute;di theorem which has combinatorial proofs. (reference!) It states that large dense high dimensional grids contain [[corners]]. Given a dense subset of <math>[2]^n,</math> denoted by <math>\mathcal{A}</math>. We can suppose that the elements of <math>\mathcal{A}</math> are of size about <math>\frac{n}{2}\pm C\sqrt{n}.</math> Take a random permutation of [n]. An element of <math>\mathcal{A}</math> is “<math>d-</math>nice” after the permutation if it consists of <math>d</math> intervals, each of length between <math>\frac{n}{2d}\pm C\sqrt{n}/2,</math> and each interval begins at position <math>id</math> for some <math>0\leq i< \frac{n}{d}.</math> (Suppose that <math>d</math> divides <math>n</math>) Any <math>d-</math>nice set can be represented as a point in a <math>d-</math>dimensional <math>[C\sqrt{n}]^d</math> cube. The sets represented by the vertices of an axis-parallel <math>d-</math>dimensional cube in <math>[C\sqrt{n}]^d</math> form a subspace with equal sized wildcard sets. Finding a cube is clearly more difficult than finding a corner, but it's existence in dense sets also follows from the multidimensional Szemer&eacute;di theorem. All what we need is to show that the expected number of the <math>d-</math>nice elements is <math>c\sqrt{n}^d</math> where c only depends on the density of <math>\mathcal{A}</math>. For a typical <math>m-</math>element subset of <math>\mathcal{A}</math> the probability that it is <math>d-</math>nice after the permutation is about <math>\binom{n}{m}^{-1}\sqrt{n}^{d-1}.</math> The sum for elements of <math>\mathcal{A}</math> with size between <math>\frac{n}{2}\pm C\sqrt{n},</math> gives that the expected number of the <math>d-</math>nice elements is <math>c\sqrt{n}^d,</math> so there is a cube if n is large enough.
 
==Further remarks==
 
The k=3 generalisation of the LYM inequality is the [[hyper-optimistic conjecture]].
 
Sperner's theorem is also related to the [[Kruskal-Katona theorem]].
 
* [http://en.wikipedia.org/wiki/Sperner%27s_theorem The Wikipedia entry for Sperner's theorem]
* [http://en.wikipedia.org/wiki/Lubell-Yamamoto-Meshalkin_inequality The Wikipedia entry for the LYM inequality]

Latest revision as of 05:15, 17 March 2009

Statement of the theorem

Sperner's theorem as originally stated is a result about set systems. Suppose that you want to find the largest collection of sets [math]\displaystyle{ \mathcal{A} }[/math] such that no set in [math]\displaystyle{ \mathcal{A} }[/math] is a proper subset of any other. Then the best you can do is choose all the sets of some fixed size---and of course the best size to pick is [math]\displaystyle{ \lfloor n/2\rfloor }[/math], since the binomial coefficient [math]\displaystyle{ \binom nm }[/math] is maximized when [math]\displaystyle{ m=\lfloor n/2\rfloor. }[/math]

Sperner's theorem is closely related to the density Hales-Jewett theorem: in fact, it is nothing other than DHJ(2) with the best possible bound. To see this, we associate each set [math]\displaystyle{ A\subset[n] }[/math] with its characteristic function (that is, the sequence that is 0 outside A and 1 in A). If we have a pair of sets [math]\displaystyle{ A\subset B, }[/math] then the two sequences form a combinatorial line in [math]\displaystyle{ [2]^n. }[/math] For example, if n=6 and A and B are the sets [math]\displaystyle{ \{2,3\} }[/math] and [math]\displaystyle{ \{2,3,4,6\} }[/math], then we get the combinatorial line that consists of the two points 011000 and 011101, which we can denote by 011*0* (so the wildcard set is [math]\displaystyle{ \{4,6\} }[/math]).

Proof of the theorem

There are several proofs, but perhaps the most enlightening is a very simple averaging argument that proves a stronger result. Let [math]\displaystyle{ \mathcal{A} }[/math] be a collection of subsets of [n]. For each k, let [math]\displaystyle{ \delta_k }[/math] denote the density of [math]\displaystyle{ \mathcal{A} }[/math] in the kth layer of the cube: that is, it is the number of sets in [math]\displaystyle{ \mathcal{A} }[/math] of size k, divided by [math]\displaystyle{ \binom nk. }[/math] The equal-slices measure of [math]\displaystyle{ \mathcal{A} }[/math] is defined to be [math]\displaystyle{ \delta_0+\dots+\delta_n. }[/math]

Now the equal-slices measure of [math]\displaystyle{ \mathcal{A} }[/math] is easily seen to be equal to the following quantity. Let [math]\displaystyle{ \pi }[/math] be a random permutation of [n], let [math]\displaystyle{ U_0,U_1,U_2\dots,U_n }[/math] be the sets [math]\displaystyle{ \emptyset, \{\pi(1)\},\{\pi(1),\pi(2)\},\dots,[n], }[/math] and let [math]\displaystyle{ \mu(\mathcal{A}) }[/math] be the expected number of the sets [math]\displaystyle{ U_i }[/math] that belong to [math]\displaystyle{ \mathcal{A}. }[/math] This is the same by linearity of expectation and the fact that the probability that [math]\displaystyle{ U_k }[/math] belongs to [math]\displaystyle{ \mathcal{A} }[/math] is [math]\displaystyle{ \delta_k. }[/math]

Therefore, if the equal-slices measure of [math]\displaystyle{ \mathcal{A} }[/math] is greater than 1, then the expected number of sets [math]\displaystyle{ U_k }[/math] in [math]\displaystyle{ \mathcal{A} }[/math] is greater than 1, so there must exist a permutation for which it is at least 2, and that gives us a pair of sets with one contained in the other.

To see that this implies Sperner's theorem, one just has to make the simple observation that a set with equal-slices measure at most 1 must have cardinality at most [math]\displaystyle{ \binom n{\lfloor n/2\rfloor}. }[/math] (If n is odd, so that there are two middle layers, then it is not quite so obvious that to have an extremal set you must pick one or other of the layers, but this is the case.) This stronger version of the statement is called the LYM inequality


Strong version

An alternative argument deduces the multidimensional Sperner theorem from the density Hales-Jewett theorem. We can think of [math]\displaystyle{ [2]^n }[/math] as [math]\displaystyle{ [2^k]^{n/k}. }[/math] If we do so and apply DHJ(2^k) and translate back to [math]\displaystyle{ [2]^n, }[/math] then we find that we have produced a k-dimensional combinatorial subspace. This is obviously a much more sophisticated proof, since DHJ(2^k) is a very hard result, but it gives more information, since the wildcard sets turn out to have the same size. A sign that this strong version is genuinely strong is that it implies Szemerédi's theorem. For instance, suppose you take as your set [math]\displaystyle{ \mathcal{A} }[/math] the set of all sequences such that the number of 0s plus the number of 1s in even places plus twice the number of 1s in odd places belongs to some dense set in [math]\displaystyle{ [3n]. }[/math] Then if you have a 2D subspace with both wildcard sets of size d, one wildcard set consisting of odd numbers and the other of even numbers (which this proof gives), then this implies that in your dense set of integers you can find four integers of the form a, a+d, a+2d, a+d+2d, which is an arithmetic progression of length 4.

One can also prove the above strong form of Sperner theorem by using the multidimensional Szemerédi theorem which has combinatorial proofs. (reference!) It states that large dense high dimensional grids contain corners. Given a dense subset of [math]\displaystyle{ [2]^n, }[/math] denoted by [math]\displaystyle{ \mathcal{A} }[/math]. We can suppose that the elements of [math]\displaystyle{ \mathcal{A} }[/math] are of size about [math]\displaystyle{ \frac{n}{2}\pm C\sqrt{n}. }[/math] Take a random permutation of [n]. An element of [math]\displaystyle{ \mathcal{A} }[/math] is “[math]\displaystyle{ d- }[/math]nice” after the permutation if it consists of [math]\displaystyle{ d }[/math] intervals, each of length between [math]\displaystyle{ \frac{n}{2d}\pm C\sqrt{n}/2, }[/math] and each interval begins at position [math]\displaystyle{ id }[/math] for some [math]\displaystyle{ 0\leq i\lt \frac{n}{d}. }[/math] (Suppose that [math]\displaystyle{ d }[/math] divides [math]\displaystyle{ n }[/math]) Any [math]\displaystyle{ d- }[/math]nice set can be represented as a point in a [math]\displaystyle{ d- }[/math]dimensional [math]\displaystyle{ [C\sqrt{n}]^d }[/math] cube. The sets represented by the vertices of an axis-parallel [math]\displaystyle{ d- }[/math]dimensional cube in [math]\displaystyle{ [C\sqrt{n}]^d }[/math] form a subspace with equal sized wildcard sets. Finding a cube is clearly more difficult than finding a corner, but it's existence in dense sets also follows from the multidimensional Szemerédi theorem. All what we need is to show that the expected number of the [math]\displaystyle{ d- }[/math]nice elements is [math]\displaystyle{ c\sqrt{n}^d }[/math] where c only depends on the density of [math]\displaystyle{ \mathcal{A} }[/math]. For a typical [math]\displaystyle{ m- }[/math]element subset of [math]\displaystyle{ \mathcal{A} }[/math] the probability that it is [math]\displaystyle{ d- }[/math]nice after the permutation is about [math]\displaystyle{ \binom{n}{m}^{-1}\sqrt{n}^{d-1}. }[/math] The sum for elements of [math]\displaystyle{ \mathcal{A} }[/math] with size between [math]\displaystyle{ \frac{n}{2}\pm C\sqrt{n}, }[/math] gives that the expected number of the [math]\displaystyle{ d- }[/math]nice elements is [math]\displaystyle{ c\sqrt{n}^d, }[/math] so there is a cube if n is large enough.

Further remarks

The k=3 generalisation of the LYM inequality is the hyper-optimistic conjecture.

Sperner's theorem is also related to the Kruskal-Katona theorem.