<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://michaelnielsen.org/polymath/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Olof</id>
	<title>Polymath Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://michaelnielsen.org/polymath/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Olof"/>
	<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Special:Contributions/Olof"/>
	<updated>2026-04-07T18:10:15Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.42.3</generator>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4037</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4037"/>
		<updated>2011-02-08T04:13:15Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* The nd-estimate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Let &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on a translate of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In other words, if the large spectrum of a set is somewhat concentrated in some subspace then one can find a density increment on a translate of the annihilator of that subspace.&lt;br /&gt;
&lt;br /&gt;
==Proof==&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the scaled indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case of the conclusion, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate. &lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large; formally, the largest independent set has size at most &amp;lt;math&amp;gt;C\log(\alpha^{-1}) \delta^{-2}&amp;lt;/math&amp;gt;. Unfortunately this statement becomes trivial with the parameters needed for the Bateman-Katz argument (&amp;lt;math&amp;gt;\delta \sim \alpha \sim 1/n^{1+\epsilon}&amp;lt;/math&amp;gt;). Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper. (Note: for this type of conclusion to hold we need to know that the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; is quite large, which follows from the assumption that we may make in the &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; context that &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has no particularly large non-trivial Fourier coefficients.)&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4036</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4036"/>
		<updated>2011-02-08T04:12:45Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* The nd-estimate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Let &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on a translate of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In other words, if the large spectrum of a set is somewhat concentrated in some subspace then one can find a density increment on the annihilator of that subspace.&lt;br /&gt;
&lt;br /&gt;
==Proof==&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the scaled indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case of the conclusion, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate. &lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large; formally, the largest independent set has size at most &amp;lt;math&amp;gt;C\log(\alpha^{-1}) \delta^{-2}&amp;lt;/math&amp;gt;. Unfortunately this statement becomes trivial with the parameters needed for the Bateman-Katz argument (&amp;lt;math&amp;gt;\delta \sim \alpha \sim 1/n^{1+\epsilon}&amp;lt;/math&amp;gt;). Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper. (Note: for this type of conclusion to hold we need to know that the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; is quite large, which follows from the assumption that we may make in the &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; context that &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has no particularly large non-trivial Fourier coefficients.)&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4035</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4035"/>
		<updated>2011-02-08T03:53:58Z</updated>

		<summary type="html">&lt;p&gt;Olof: Clarifying certain points&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Let &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
In other words, if the large spectrum of a set is somewhat concentrated in some subspace then one can find a density increment on the annihilator of that subspace.&lt;br /&gt;
&lt;br /&gt;
==Proof==&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the scaled indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case of the conclusion, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate. &lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large; formally, the largest independent set has size at most &amp;lt;math&amp;gt;C\log(\alpha^{-1}) \delta^{-2}&amp;lt;/math&amp;gt;. Unfortunately this statement becomes trivial with the parameters needed for the Bateman-Katz argument (&amp;lt;math&amp;gt;\delta \sim \alpha \sim 1/n^{1+\epsilon}&amp;lt;/math&amp;gt;). Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper. (Note: for this type of conclusion to hold we need to know that the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; is quite large, which follows from the assumption that we may make in the &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; context that &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has no particularly large non-trivial Fourier coefficients.)&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4018</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4018"/>
		<updated>2011-02-07T03:52:55Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* The nd-estimate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Let &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&#039;&#039;&#039;Proof&#039;&#039;&#039;:&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the scaled indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case of the conclusion, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate.&lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large; formally, the largest independent set has size at most &amp;lt;math&amp;gt;C\log(\alpha^{-1}) \delta^{-2}&amp;lt;/math&amp;gt;. Unfortunately this statement becomes trivial with the parameters needed for the Bateman-Katz argument. (Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.)&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper.&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4017</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4017"/>
		<updated>2011-02-07T03:52:08Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* The nd-estimate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Let &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&#039;&#039;&#039;Proof&#039;&#039;&#039;:&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the scaled indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate.&lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large; formally, the largest independent set has size at most &amp;lt;math&amp;gt;C\log(\alpha^{-1}) \delta^{-2}&amp;lt;/math&amp;gt;. Unfortunately this statement becomes trivial with the parameters needed for the Bateman-Katz argument. (Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.)&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper.&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4015</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4015"/>
		<updated>2011-02-07T00:24:19Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Comparison with other results about the large spectrum of a set */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Let &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&#039;&#039;&#039;Proof&#039;&#039;&#039;:&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate.&lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in the large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large; formally, the largest independent set has size at most &amp;lt;math&amp;gt;C\log(\alpha^{-1}) \delta^{-2}&amp;lt;/math&amp;gt;. Unfortunately this statement becomes trivial with the parameters needed for the Bateman-Katz argument. (Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.)&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper.&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Section_3:_the_%22nd-estimate%22&amp;diff=4010</id>
		<title>Section 3: the &quot;nd-estimate&quot;</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Section_3:_the_%22nd-estimate%22&amp;diff=4010"/>
		<updated>2011-02-06T20:26:08Z</updated>

		<summary type="html">&lt;p&gt;Olof: Section 3: the &amp;quot;nd-estimate&amp;quot; moved to BK:Section 3: Easier to refer to&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#REDIRECT [[BK:Section 3]]&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4009</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4009"/>
		<updated>2011-02-06T20:26:08Z</updated>

		<summary type="html">&lt;p&gt;Olof: Section 3: the &amp;quot;nd-estimate&amp;quot; moved to BK:Section 3: Easier to refer to&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Suppose &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&#039;&#039;&#039;Proof&#039;&#039;&#039;:&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate.&lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large. Unfortunately, with the parameters needed for the Bateman-Katz paper, Chang&#039;s theorem reduces to a trivial statement. (Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.)&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper.&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=4008</id>
		<title>Improving the bounds for Roth&#039;s theorem</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=4008"/>
		<updated>2011-02-06T20:25:33Z</updated>

		<summary type="html">&lt;p&gt;Olof: Updating link names&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is a wiki for Polymath6, a project that aims to look at recent results of Bateman and Katz and of Sanders, and attempt to combine the ideas that go into them in order to break the log N-barrier in Roth&#039;s theorem. I hope that soon there will be expositions of their proofs, as well as of much of the background material needed to understand them. The wiki has only just been started, so has very little material on it so far.&lt;br /&gt;
&lt;br /&gt;
==Background material==&lt;br /&gt;
&lt;br /&gt;
[[Sketch of the Roth/Meshulam argument for cap sets]]&lt;br /&gt;
&lt;br /&gt;
[[Basic facts about Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
[[Sketch of Bourgain&#039;s first argument using Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
==The Bateman-Katz argument==&lt;br /&gt;
&lt;br /&gt;
[[Overview]]&lt;br /&gt;
&lt;br /&gt;
[[BK:Section 3|Section 3: the &amp;quot;nd-estimate&amp;quot;]]&lt;br /&gt;
&lt;br /&gt;
[[BK:Section 4|Section 4: the large spectrum contains many additive quadruples]]&lt;br /&gt;
&lt;br /&gt;
[[BK:Section 5|Section 5: the large spectrum of a non-incrementing set is not additively smoothing]]&lt;br /&gt;
&lt;br /&gt;
[[BK:Section 6|Section 6: the structure of additively non-smoothing sets]]&lt;br /&gt;
&lt;br /&gt;
[[BK:Section 7|Section 7: the structure of the large spectrum of a non-incrementing cap set]]&lt;br /&gt;
&lt;br /&gt;
[[BK:Section 8|Section 8: using the structure of the large spectrum to obtain density increments]]&lt;br /&gt;
&lt;br /&gt;
==Sanders&#039;s argument==&lt;br /&gt;
&lt;br /&gt;
[[Overview]]&lt;br /&gt;
&lt;br /&gt;
[[The Croot-Sisask lemma]]&lt;br /&gt;
&lt;br /&gt;
[[The Katz-Koester lemma]]&lt;br /&gt;
&lt;br /&gt;
[[Sanders&#039;s local version of Chang&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
==Annotated bibliography==&lt;br /&gt;
&lt;br /&gt;
*Michael Bateman and Nets Katz, [http://arxiv.org/pdf/1101.5851v1 New bounds on cap sets]. The paper that obtains an improvement to the Roth/Meshulam bound for cap sets.&lt;br /&gt;
&lt;br /&gt;
*Tom Sanders, [http://arxiv.org/abs/1011.0104  On Roth&#039;s theorem on progressions]. The paper that gets within loglog factors of the 1/logN density barrier in Roth&#039;s theorem.&lt;br /&gt;
&lt;br /&gt;
*Ernie Croot and Olof Sisask, [http://arxiv.org/abs/1003.2978  A probabilistic technique for finding almost-periods of convolutions]. The paper that introduced a technique that is central to Sanders&#039;s proof.&lt;br /&gt;
&lt;br /&gt;
*Nets Katz and Paul Koester, [http://arxiv.org/abs/0802.4371  On additive doubling and energy], Another paper that introduced a method that Sanders used and that is related to some of the ideas in the Bateman-Katz paper as well.&lt;br /&gt;
&lt;br /&gt;
==Links to more informal discussions of the problem==&lt;br /&gt;
&lt;br /&gt;
*[http://polymathprojects.org/  Blog post by Tom Sanders about the ideas and difficulties involved] &lt;br /&gt;
&lt;br /&gt;
*[http://polymathprojects.files.wordpress.com/2011/02/polymath-3.pdf  The way an argument might go], A document written by Nets Katz, summarizing discussions he had with Olof Sisask about the possibilities for combining the Bateman-Katz and Sanders papers.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4007</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4007"/>
		<updated>2011-02-06T20:22:29Z</updated>

		<summary type="html">&lt;p&gt;Olof: Added relation to the Parseval bound, Chang&amp;#039;s theorem and a few words on the significance. More work needed.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
==The nd-estimate==&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:Suppose &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
:* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
:* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&#039;&#039;&#039;Proof&#039;&#039;&#039;:&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
==Comparison with other results about the large spectrum of a set==&lt;br /&gt;
The main ingredient in deriving the nd-estimate is Parseval&#039;s identity. This identity also has the following useful consequence: letting &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; be as above, we have&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \delta^2 \alpha^2 \leq \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{1_A}(\gamma)|^2 = \mathbb{E}_x 1_A(x)^2 = \alpha&amp;lt;/math&amp;gt;,&lt;br /&gt;
whence&lt;br /&gt;
:&amp;lt;math&amp;gt;|\Delta| \leq \alpha^{-1} \delta^{-2}&amp;lt;/math&amp;gt;,&lt;br /&gt;
which should be compared to the bound on &amp;lt;math&amp;gt;| \Delta \cap V^{\perp} |&amp;lt;/math&amp;gt; given by the nd-estimate.&lt;br /&gt;
&lt;br /&gt;
There is another useful result about the large spectrum of a set known as Chang&#039;s theorem. Informally, this says that the largest size of a linearly independent set in large spectrum &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; cannot be too large. Unfortunately, with the parameters needed for the Bateman-Katz paper, Chang&#039;s theorem reduces to a trivial statement. (Nevertheless, there is [http://arxiv.org/abs/math/0605689 a generalization of Chang&#039;s theorem due to Shkredov] that gives a lower bound for the number of additive &amp;lt;math&amp;gt;(2m)&amp;lt;/math&amp;gt;-tuples in the large spectrum of a set, which is used in [[BK:Section 4|Section 4]] of the Bateman-Katz paper.)&lt;br /&gt;
&lt;br /&gt;
By contrast, the nd-estimate is something like a statement in the opposite direction: it says that there are quite a lot of linearly independent characters in &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, or else there is a density increment. Specifically, if we have picked &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt; from &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt;, then&lt;br /&gt;
:&amp;lt;math&amp;gt;| \Delta \cap \langle \gamma_1, \ldots, \gamma_d \rangle | \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;&lt;br /&gt;
unless we get a density increment on a (particular) subspace of codimension at most &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;.&lt;br /&gt;
For suitable parameter choices, this says that there are a lot of characters in the large spectrum that are linearly independent of &amp;lt;math&amp;gt;\gamma_1, \ldots, \gamma_d&amp;lt;/math&amp;gt;, which is very important in [[BK:Section 5|Section 5]] of the paper.&lt;br /&gt;
&lt;br /&gt;
==Relation to Lemma 2.8 in Sanders&#039;s paper==&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4006</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=4006"/>
		<updated>2011-02-06T19:48:32Z</updated>

		<summary type="html">&lt;p&gt;Olof: Rephrasing statement and shortening proof to illustrate link with Sanders&amp;#039;s Lemma 2.8 more clearly&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Parent page: [[Improving the bounds for Roth&#039;s theorem]]&lt;br /&gt;
&lt;br /&gt;
One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, an important part of which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points. We shall say later on why this is significant.&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results. For a subspace &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; we write &lt;br /&gt;
:&amp;lt;math&amp;gt;V^{\perp} = \{ \gamma \in \widehat{\mathbb{F}_3^n} : \gamma(x) = 1 \ \forall x \in V \}&amp;lt;/math&amp;gt;&lt;br /&gt;
for its annihilator (cf. [[Basic facts about Bohr sets|the section on Bohr sets]]).&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A \subset \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a set with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;0 \leq \delta, \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &lt;br /&gt;
:&amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{ 0_{\widehat{\mathbb{F}_3^n}} \}&amp;lt;/math&amp;gt;.&lt;br /&gt;
Suppose &amp;lt;math&amp;gt;V \leq \mathbb{F}_3^n&amp;lt;/math&amp;gt; be a subspace. Then&lt;br /&gt;
* either &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt; on &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt;,&lt;br /&gt;
* or &amp;lt;math&amp;gt;|\Delta \cap V^{\perp}| \leq 3\eta \delta^{-2}&amp;lt;/math&amp;gt;; in fact &amp;lt;math&amp;gt;\sum_{\gamma \in V^{\perp}} |\widehat{(1_A - \alpha)}(\gamma)|^2 \leq 3\eta \alpha^2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&#039;&#039;&#039;Proof&#039;&#039;&#039;:&lt;br /&gt;
Let us write &amp;lt;math&amp;gt;\mu_V = \frac{|\mathbb{F}_3^n|}{|V|}1_V&amp;lt;/math&amp;gt; for the indicator function of &amp;lt;math&amp;gt;V&amp;lt;/math&amp;gt; normalized so that &amp;lt;math&amp;gt;\mathbb{E}_x \mu_V(x) = 1&amp;lt;/math&amp;gt;. If &lt;br /&gt;
:&amp;lt;math&amp;gt;1_A*\mu_V(x) &amp;gt; \alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
for some &amp;lt;math&amp;gt;x \in \mathbb{F}_3^n&amp;lt;/math&amp;gt; then we are in the first case, so let us assume that &amp;lt;math&amp;gt;1_A*\mu_V \leq \alpha(1+\eta)&amp;lt;/math&amp;gt;. Write &amp;lt;math&amp;gt;f = 1_A - \alpha&amp;lt;/math&amp;gt; for the balanced function of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
:&amp;lt;math&amp;gt; | \Delta \cap V^{\perp} | \delta^2 \alpha^2 \leq \sum_{\gamma \in V^{\perp}} |\widehat{f}(\gamma)|^2 = \sum_{\gamma \in \widehat{\mathbb{F}_3^n}} |\widehat{f}(\gamma)|^2 |\widehat{\mu_V}(\gamma)|^2.&amp;lt;/math&amp;gt;&lt;br /&gt;
By Parseval&#039;s identity, this equals&lt;br /&gt;
:&amp;lt;math&amp;gt; \mathbb{E}_{x \in \mathbb{F}_3^n} f*\mu_V(x)^2 = \mathbb{E}_{x \in \mathbb{F}_3^n} 1_A*\mu_V(x)^2 - \alpha^2 \leq \alpha^2(2\eta + \eta^2),&amp;lt;/math&amp;gt;&lt;br /&gt;
which proves the result.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To be added:&lt;br /&gt;
* Statement of size bound on &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; from Parseval alone&lt;br /&gt;
* Statement of Chang&#039;s theorem&lt;br /&gt;
* Relation to Lemma 2.8 in Sanders&#039;s paper&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=3978</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=3978"/>
		<updated>2011-02-06T02:19:14Z</updated>

		<summary type="html">&lt;p&gt;Olof: Typo correction&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points.&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results.&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; be a subset of &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;\delta &amp;gt; 0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;0 \leq \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{0\}&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
# either there is a subspace of &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt; on which &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
# or &amp;lt;math&amp;gt;|\Delta \cap W| \leq \eta \delta^{-2}&amp;lt;/math&amp;gt; for each &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace &amp;lt;math&amp;gt;W \leq \widehat{\mathbb{F}_3^n}&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To be added:&lt;br /&gt;
* Proof&lt;br /&gt;
* Statement of size bound on &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; from Parseval alone&lt;br /&gt;
* Statement of Chang&#039;s theorem&lt;br /&gt;
* Relation to Lemma 2.8 in Sanders&#039;s paper&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=3977</id>
		<title>BK:Section 3</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=BK:Section_3&amp;diff=3977"/>
		<updated>2011-02-06T02:15:09Z</updated>

		<summary type="html">&lt;p&gt;Olof: Very rough initial sketch&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;One of the take-away results from Section 3 of the Bateman-Katz paper is Proposition 3.1, which is in some places referred to as the &amp;quot;nd-estimate&amp;quot;. The rough reason for this terminology is that it says that a set &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; in &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of density about &amp;lt;math&amp;gt;1/n&amp;lt;/math&amp;gt; either has a `good&#039; density increment on a subspace of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;, or else the &amp;lt;math&amp;gt;(1/n)&amp;lt;/math&amp;gt;-large spectrum of &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; intersects any &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace in at most about &amp;lt;math&amp;gt;nd&amp;lt;/math&amp;gt; points.&lt;br /&gt;
&lt;br /&gt;
Here is the precise result, stated in slightly different terms to the paper in order to illustrate how it relates to other results.&lt;br /&gt;
&lt;br /&gt;
:&#039;&#039;&#039;Proposition 1&#039;&#039;&#039; Let &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; be a subset of &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; with density &amp;lt;math&amp;gt;\alpha&amp;lt;/math&amp;gt;, and let &amp;lt;math&amp;gt;\delta &amp;gt; 0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;0 \leq \eta \leq 1&amp;lt;/math&amp;gt; be parameters. Set &amp;lt;math&amp;gt;\Delta = \{ \gamma \in \widehat{G} : | \widehat{1_A}(\gamma) | \geq \delta \alpha \} \setminus \{0\}&amp;lt;/math&amp;gt;. Then&lt;br /&gt;
# either there is a subspace of &amp;lt;math&amp;gt;\mathbb{F}_3^n&amp;lt;/math&amp;gt; of codimension &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt; on which &amp;lt;math&amp;gt;A&amp;lt;/math&amp;gt; has density at least &amp;lt;math&amp;gt;\alpha(1 + \eta)&amp;lt;/math&amp;gt;&lt;br /&gt;
# or &amp;lt;math&amp;gt;|\Delta \cap W| \leq \eta \alpha^{-1}&amp;lt;/math&amp;gt; for each &amp;lt;math&amp;gt;d&amp;lt;/math&amp;gt;-dimensional subspace &amp;lt;math&amp;gt;W \leq \widehat{\mathbb{F}_3^n}&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
To be added:&lt;br /&gt;
* Proof&lt;br /&gt;
* Statement of size bound on &amp;lt;math&amp;gt;\Delta&amp;lt;/math&amp;gt; from Parseval alone&lt;br /&gt;
* Statement of Chang&#039;s theorem&lt;br /&gt;
* Relation to Lemma 2.8 in Sanders&#039;s paper&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3975</id>
		<title>Improving the bounds for Roth&#039;s theorem</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3975"/>
		<updated>2011-02-06T01:24:58Z</updated>

		<summary type="html">&lt;p&gt;Olof: Changing N to n in discussing the BK argument in order to avoid future confusion&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is a wiki for Polymath6, a project that aims to look at recent results of Bateman and Katz and of Sanders, and attempt to combine the ideas that go into them in order to break the log N-barrier in Roth&#039;s theorem. I hope that soon there will be expositions of their proofs, as well as of much of the background material needed to understand them. The wiki has only just been started, so has very little material on it so far.&lt;br /&gt;
&lt;br /&gt;
==Background material==&lt;br /&gt;
&lt;br /&gt;
[[Sketch of the Roth/Meshulam argument for cap sets]]&lt;br /&gt;
&lt;br /&gt;
[[Basic facts about Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
[[Sketch of Bourgain&#039;s first argument using Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==The Bateman-Katz argument==&lt;br /&gt;
&lt;br /&gt;
[[Overview]]&lt;br /&gt;
&lt;br /&gt;
[[Section 3: the &amp;quot;nd-estimate&amp;quot;]]&lt;br /&gt;
&lt;br /&gt;
[[Section 4: the large spectrum contains many additive quadruples]]&lt;br /&gt;
&lt;br /&gt;
[[Section 5: the large spectrum of a non-incrementing set is not additively smoothing]]&lt;br /&gt;
&lt;br /&gt;
[[Section 6: the structure of additively non-smoothing sets]]&lt;br /&gt;
&lt;br /&gt;
[[Section 7: the structure of the large spectrum of a non-incrementing cap set]]&lt;br /&gt;
&lt;br /&gt;
[[Section 8: using the structure of the large spectrum to obtain density increments]]&lt;br /&gt;
&lt;br /&gt;
==Annotated bibliography==&lt;br /&gt;
&lt;br /&gt;
*Michael Bateman and Nets Katz, [http://arxiv.org/pdf/1101.5851v1 New bounds on cap sets]. The paper that obtains an improvement to the Roth/Meshulam bound for cap sets.&lt;br /&gt;
&lt;br /&gt;
*Tom Sanders, [http://arxiv.org/abs/1011.0104  On Roth&#039;s theorem on progressions]. The paper that gets within loglog factors of the 1/logN density barrier in Roth&#039;s theorem.&lt;br /&gt;
&lt;br /&gt;
*Ernie Croot and Olof Sisask, [http://arxiv.org/abs/1003.2978  A probabilistic technique for finding almost-periods of convolutions]. The paper that introduced a technique that is central to Sanders&#039;s proof.&lt;br /&gt;
&lt;br /&gt;
*Nets Katz and Paul Koester, [http://arxiv.org/abs/0802.4371  On additive doubling and energy], Another paper that introduced a method that Sanders used and that is related to some of the ideas in the Bateman-Katz paper as well.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3974</id>
		<title>Improving the bounds for Roth&#039;s theorem</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3974"/>
		<updated>2011-02-06T01:18:13Z</updated>

		<summary type="html">&lt;p&gt;Olof: Undo revision 3973 by Olof (Talk)&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is a wiki for Polymath6, a project that aims to look at recent results of Bateman and Katz and of Sanders, and attempt to combine the ideas that go into them in order to break the log N-barrier in Roth&#039;s theorem. I hope that soon there will be expositions of their proofs, as well as of much of the background material needed to understand them. The wiki has only just been started, so has very little material on it so far.&lt;br /&gt;
&lt;br /&gt;
==Background material==&lt;br /&gt;
&lt;br /&gt;
[[Sketch of the Roth/Meshulam argument for cap sets]]&lt;br /&gt;
&lt;br /&gt;
[[Basic facts about Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
[[Sketch of Bourgain&#039;s first argument using Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==The Bateman-Katz argument==&lt;br /&gt;
&lt;br /&gt;
[[Overview]]&lt;br /&gt;
&lt;br /&gt;
[[Section 3: the &amp;quot;Nd-estimate&amp;quot;]]&lt;br /&gt;
&lt;br /&gt;
[[Section 4: the large spectrum contains many additive quadruples]]&lt;br /&gt;
&lt;br /&gt;
[[Section 5: the large spectrum of a non-incrementing set is not additively smoothing]]&lt;br /&gt;
&lt;br /&gt;
[[Section 6: the structure of additively non-smoothing sets]]&lt;br /&gt;
&lt;br /&gt;
[[Section 7: the structure of the large spectrum of a non-incrementing cap set]]&lt;br /&gt;
&lt;br /&gt;
[[Section 8: using the structure of the large spectrum to obtain density increments]]&lt;br /&gt;
&lt;br /&gt;
==Annotated bibliography==&lt;br /&gt;
&lt;br /&gt;
*Michael Bateman and Nets Katz, [http://arxiv.org/pdf/1101.5851v1 New bounds on cap sets]. The paper that obtains an improvement to the Roth/Meshulam bound for cap sets.&lt;br /&gt;
&lt;br /&gt;
*Tom Sanders, [http://arxiv.org/abs/1011.0104  On Roth&#039;s theorem on progressions]. The paper that gets within loglog factors of the 1/logN density barrier in Roth&#039;s theorem.&lt;br /&gt;
&lt;br /&gt;
*Ernie Croot and Olof Sisask, [http://arxiv.org/abs/1003.2978  A probabilistic technique for finding almost-periods of convolutions]. The paper that introduced a technique that is central to Sanders&#039;s proof.&lt;br /&gt;
&lt;br /&gt;
*Nets Katz and Paul Koester, [http://arxiv.org/abs/0802.4371  On additive doubling and energy], Another paper that introduced a method that Sanders used and that is related to some of the ideas in the Bateman-Katz paper as well.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3973</id>
		<title>Improving the bounds for Roth&#039;s theorem</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3973"/>
		<updated>2011-02-06T01:13:18Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* The Bateman-Katz argument */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is a wiki for Polymath6, a project that aims to look at recent results of Bateman and Katz and of Sanders, and attempt to combine the ideas that go into them in order to break the log N-barrier in Roth&#039;s theorem. I hope that soon there will be expositions of their proofs, as well as of much of the background material needed to understand them. The wiki has only just been started, so has very little material on it so far.&lt;br /&gt;
&lt;br /&gt;
==Background material==&lt;br /&gt;
&lt;br /&gt;
[[Sketch of the Roth/Meshulam argument for cap sets]]&lt;br /&gt;
&lt;br /&gt;
[[Basic facts about Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
[[Sketch of Bourgain&#039;s first argument using Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==The Bateman-Katz argument==&lt;br /&gt;
&lt;br /&gt;
[[Overview]]&lt;br /&gt;
&lt;br /&gt;
[[Section 3: the &amp;quot;Nd-estimate&amp;quot;| the &amp;quot;Nd-estimate&amp;quot;]]&lt;br /&gt;
&lt;br /&gt;
[[Section 4: the large spectrum contains many additive quadruples| the large spectrum contains many additive quadruples]]&lt;br /&gt;
&lt;br /&gt;
[[Section 5: the large spectrum of a non-incrementing set is not additively smoothing| the large spectrum of a non-incrementing set is not additively smoothing]]&lt;br /&gt;
&lt;br /&gt;
[[Section 6: the structure of additively non-smoothing sets| the structure of additively non-smoothing sets]]&lt;br /&gt;
&lt;br /&gt;
[[Section 7: the structure of the large spectrum of a non-incrementing cap set| the structure of the large spectrum of a non-incrementing cap set]]&lt;br /&gt;
&lt;br /&gt;
[[Section 8: using the structure of the large spectrum to obtain density increments| using the structure of the large spectrum to obtain density increments]]&lt;br /&gt;
&lt;br /&gt;
==Annotated bibliography==&lt;br /&gt;
&lt;br /&gt;
*Michael Bateman and Nets Katz, [http://arxiv.org/pdf/1101.5851v1 New bounds on cap sets]. The paper that obtains an improvement to the Roth/Meshulam bound for cap sets.&lt;br /&gt;
&lt;br /&gt;
*Tom Sanders, [http://arxiv.org/abs/1011.0104  On Roth&#039;s theorem on progressions]. The paper that gets within loglog factors of the 1/logN density barrier in Roth&#039;s theorem.&lt;br /&gt;
&lt;br /&gt;
*Ernie Croot and Olof Sisask, [http://arxiv.org/abs/1003.2978  A probabilistic technique for finding almost-periods of convolutions]. The paper that introduced a technique that is central to Sanders&#039;s proof.&lt;br /&gt;
&lt;br /&gt;
*Nets Katz and Paul Koester, [http://arxiv.org/abs/0802.4371  On additive doubling and energy], Another paper that introduced a method that Sanders used and that is related to some of the ideas in the Bateman-Katz paper as well.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3972</id>
		<title>Improving the bounds for Roth&#039;s theorem</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Improving_the_bounds_for_Roth%27s_theorem&amp;diff=3972"/>
		<updated>2011-02-06T01:09:15Z</updated>

		<summary type="html">&lt;p&gt;Olof: Adding section headers for the BK argument; these may need to be revised as our understanding develops&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is a wiki for Polymath6, a project that aims to look at recent results of Bateman and Katz and of Sanders, and attempt to combine the ideas that go into them in order to break the log N-barrier in Roth&#039;s theorem. I hope that soon there will be expositions of their proofs, as well as of much of the background material needed to understand them. The wiki has only just been started, so has very little material on it so far.&lt;br /&gt;
&lt;br /&gt;
==Background material==&lt;br /&gt;
&lt;br /&gt;
[[Sketch of the Roth/Meshulam argument for cap sets]]&lt;br /&gt;
&lt;br /&gt;
[[Basic facts about Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
[[Sketch of Bourgain&#039;s first argument using Bohr sets]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==The Bateman-Katz argument==&lt;br /&gt;
&lt;br /&gt;
[[Overview]]&lt;br /&gt;
&lt;br /&gt;
[[Section 3: the &amp;quot;Nd-estimate&amp;quot;]]&lt;br /&gt;
&lt;br /&gt;
[[Section 4: the large spectrum contains many additive quadruples]]&lt;br /&gt;
&lt;br /&gt;
[[Section 5: the large spectrum of a non-incrementing set is not additively smoothing]]&lt;br /&gt;
&lt;br /&gt;
[[Section 6: the structure of additively non-smoothing sets]]&lt;br /&gt;
&lt;br /&gt;
[[Section 7: the structure of the large spectrum of a non-incrementing cap set]]&lt;br /&gt;
&lt;br /&gt;
[[Section 8: using the structure of the large spectrum to obtain density increments]]&lt;br /&gt;
&lt;br /&gt;
==Annotated bibliography==&lt;br /&gt;
&lt;br /&gt;
*Michael Bateman and Nets Katz, [http://arxiv.org/pdf/1101.5851v1 New bounds on cap sets]. The paper that obtains an improvement to the Roth/Meshulam bound for cap sets.&lt;br /&gt;
&lt;br /&gt;
*Tom Sanders, [http://arxiv.org/abs/1011.0104  On Roth&#039;s theorem on progressions]. The paper that gets within loglog factors of the 1/logN density barrier in Roth&#039;s theorem.&lt;br /&gt;
&lt;br /&gt;
*Ernie Croot and Olof Sisask, [http://arxiv.org/abs/1003.2978  A probabilistic technique for finding almost-periods of convolutions]. The paper that introduced a technique that is central to Sanders&#039;s proof.&lt;br /&gt;
&lt;br /&gt;
*Nets Katz and Paul Koester, [http://arxiv.org/abs/0802.4371  On additive doubling and energy], Another paper that introduced a method that Sanders used and that is related to some of the ideas in the Bateman-Katz paper as well.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3195</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3195"/>
		<updated>2010-07-09T08:28:53Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* First solution */ Updating solution in light of the compound moves section.&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \to [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \to [0, b+2a, 0] \to [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \to [a-1,2,0] \to [a-1,0,4] \to [a-2,4,0] \to [a-2,0,8] \to \ldots \to [1, 0, 2^a] \to [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
#* Note that, except for the first move, the coins in the left-most box here are only being used for Type 2 swaps, and otherwise compound move 1 is being applied to the second two boxes.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a,b \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;. This follows from the previous compound move together with a Type 2 swap.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 2&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-2, 2^{b+2}, 0, 0]&amp;lt;/math&amp;gt;, which is valid for &amp;lt;math&amp;gt;b=0&amp;lt;/math&amp;gt; as well. This follows from applying a Type 1 move to &amp;lt;math&amp;gt;[a,b]&amp;lt;/math&amp;gt; and then applying the previous compound move.&lt;br /&gt;
&lt;br /&gt;
The last two moves seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Let &amp;lt;math&amp;gt;T = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
We will be done if we can obtain the configuration &amp;lt;math&amp;gt;[0,0,0,T/4,0,0]&amp;lt;/math&amp;gt;, for then we can apply two compound move 1s to get the &amp;lt;math&amp;gt;[0,0,0,0,0,T]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
To get this configuration it is enough to get a configuration &amp;lt;math&amp;gt;[0,0,0,X,0,0]&amp;lt;/math&amp;gt; for some &amp;lt;math&amp;gt;X \geq T/4&amp;lt;/math&amp;gt;, because we can then apply Type 2 moves until the number of coins in the fourth box is reduced to &amp;lt;math&amp;gt;T/4&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Finally, one can obtain such a configuration using compound move 4. First note that we can get &amp;lt;math&amp;gt;[0,0, 140, 0, 0 ,0]&amp;lt;/math&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From this we get &amp;lt;math&amp;gt;[0,0, 139, 2, 0 ,0]&amp;lt;/math&amp;gt;, and then by applying compound move 4 139 times we get &amp;lt;math&amp;gt;[0,0, 0, X, 0 ,0]&amp;lt;/math&amp;gt; for some &amp;lt;math&amp;gt;X&amp;lt;/math&amp;gt; much, much bigger than &amp;lt;math&amp;gt;T/4&amp;lt;/math&amp;gt;, and so we are done.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3194</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3194"/>
		<updated>2010-07-09T08:14:01Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \to [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \to [0, b+2a, 0] \to [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \to [a-1,2,0] \to [a-1,0,4] \to [a-2,4,0] \to [a-2,0,8] \to \ldots \to [1, 0, 2^a] \to [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
#* Note that, except for the first move, the coins in the left-most box here are only being used for Type 2 swaps, and otherwise compound move 1 is being applied to the second two boxes.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a,b \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;. This follows from the previous compound move together with a Type 2 swap.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 2&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-2, 2^{b+2}, 0, 0]&amp;lt;/math&amp;gt;, which is valid for &amp;lt;math&amp;gt;b=0&amp;lt;/math&amp;gt; as well. This follows from applying a Type 1 move to &amp;lt;math&amp;gt;[a,b]&amp;lt;/math&amp;gt; and then applying the previous compound move.&lt;br /&gt;
&lt;br /&gt;
The last two moves seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3192</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3192"/>
		<updated>2010-07-08T20:14:38Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \to [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \to [0, b+2a, 0] \to [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \to [a-1,2,0] \to [a-1,0,4] \to [a-2,4,0] \to [a-2,0,8] \to \ldots \to [1, 0, 2^a] \to [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for any &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# By doing the same but with some intermediate Type 1 and 2 moves, one has &amp;lt;math&amp;gt;[a,b,0,0] \to [a-2, 2^{b+2}, 0, 0]&amp;lt;/math&amp;gt; for &amp;lt;math&amp;gt;a \geq 2&amp;lt;/math&amp;gt;, which is slightly stronger than the previous move applied twice if &amp;lt;math&amp;gt;b=0,1&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last two moves seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3191</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3191"/>
		<updated>2010-07-08T20:13:31Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \to [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \to [0, b+2a, 0] \to [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \to [a-1,2,0] \to [a-1,0,4] \to [a-2,4,0] \to [a-2,0,8] \to \ldots \to [1, 0, 2^a] \to [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for any &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# By doing the same but with some intermediate Type 1 and 2 moves, one has &amp;lt;math&amp;gt;[a,b,0,0] \to [a-2, 2^{b+2}, 0, 0]&amp;lt;/math&amp;gt; for &amp;lt;math&amp;gt;a \geq 2&amp;lt;/math&amp;gt;, which is slightly stronger than the previous move applied twice if &amp;lt;math&amp;gt;b=0,1&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last two moves seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
Slightly more general variants:&lt;br /&gt;
&lt;br /&gt;
# [a,b,c] \to [a-1, 2b+c]&lt;br /&gt;
# [a,b,c,d] \to [a-2, 2^{b+1}(2c+d), 0,0]&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3190</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3190"/>
		<updated>2010-07-08T20:08:31Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \to [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \to [0, b+2a, 0] \to [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \to [a-1,2,0] \to [a-1,0,4] \to [a-2,4,0] \to [a-2,0,8] \to \ldots \to [1, 0, 2^a] \to [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for any &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# In fact, by using intermediate Type 1 and 2 moves, one has &amp;lt;math&amp;gt;[a,b,0,0] \to [a-2, 2^{b+2}, 0, 0]&amp;lt;/math&amp;gt; for &amp;lt;math&amp;gt;a \geq 2&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last two moves seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3189</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3189"/>
		<updated>2010-07-08T20:03:40Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \to [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \to [0, b+2a, 0] \to [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \to [a-1,2,0] \to [a-1,0,4] \to [a-2,4,0] \to [a-2,0,8] \to \ldots \to [1, 0, 2^a] \to [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for any &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last move seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3188</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3188"/>
		<updated>2010-07-08T20:02:17Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \to [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \mapsto [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \to [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \to [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \to [1, 0, b+2a] \mapsto [0, b+2a, 0] \mapsto [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \to [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \mapsto [a-1,2,0] \mapsto [a-1,0,4] \mapsto [a-2,4,0] \mapsto [a-2,0,8] \mapsto \ldots \mapsto [1, 0, 2^a] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for any &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \to [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last move seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3187</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3187"/>
		<updated>2010-07-08T20:01:13Z</updated>

		<summary type="html">&lt;p&gt;Olof: /* Compound moves */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \mapsto [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \mapsto [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \mapsto [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \mapsto [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \mapsto [1, 0, b+2a] \mapsto [0, b+2a, 0] \mapsto [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \mapsto [a-1,2,0] \mapsto [a-1,0,4] \mapsto [a-2,4,0] \mapsto [a-2,0,8] \mapsto \ldots \mapsto [1, 0, 2^a] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for any &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \mapsto [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last move seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3186</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3186"/>
		<updated>2010-07-08T19:53:02Z</updated>

		<summary type="html">&lt;p&gt;Olof: Compound move update&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \mapsto [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \mapsto [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \mapsto [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \mapsto [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \mapsto [1, 0, b+2a] \mapsto [0, b+2a, 0] \mapsto [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# For &amp;lt;math&amp;gt;a \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,0,0] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \mapsto [a-1,2,0] \mapsto [a-1,0,4] \mapsto [a-2,4,0] \mapsto [a-2,0,8] \mapsto \ldots \mapsto [1, 0, 2^a] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, for &amp;lt;math&amp;gt;a, b \geq 1&amp;lt;/math&amp;gt; we have &amp;lt;math&amp;gt;[a,b,0,0] \mapsto [a-1, 2^b, 0, 0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last move seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
	<entry>
		<id>https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3185</id>
		<title>Imo 2010</title>
		<link rel="alternate" type="text/html" href="https://michaelnielsen.org/polymath/index.php?title=Imo_2010&amp;diff=3185"/>
		<updated>2010-07-08T19:49:31Z</updated>

		<summary type="html">&lt;p&gt;Olof: Uniformizing notation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This is the wiki page for the mini-polymath2 project, which seeks solutions to Question 5 of the 2010 International Mathematical Olympiad.   &lt;br /&gt;
&lt;br /&gt;
The project will start at [http://www.timeanddate.com/worldclock/fixedtime.html?year=2010&amp;amp;month=7&amp;amp;day=8&amp;amp;hour=16&amp;amp;min=0&amp;amp;sec=0&amp;amp;p1=0 16:00 UTC July 8], and is hosted at the [http://polymathprojects.org/ polymath blog].  A discussion thread is hosted at [http://terrytao.wordpress.com Terry Tao&#039;s blog].&lt;br /&gt;
&lt;br /&gt;
== Rules ==&lt;br /&gt;
&lt;br /&gt;
This project will follow the [http://polymathprojects.org/general-polymath-rules/ usual polymath rules].  In particular:&lt;br /&gt;
&lt;br /&gt;
* Everyone is welcome to participate, though people who have already seen an external solution to the problem should probably refrain from giving spoilers throughout the experiment.&lt;br /&gt;
* This is a team effort, not a race between individuals.  Rather than work for extended periods of time in isolation from the rest of the project, the idea is to come up with short observations (or to carry an observation of another participant further) and then report back what one gets to the rest of the team.  Partial results or even failures can be worth reporting.&lt;br /&gt;
* Participants are encouraged to update the wiki, or to summarise progress within threads, for the benefit of others.&lt;br /&gt;
&lt;br /&gt;
== Threads ==&lt;br /&gt;
&lt;br /&gt;
Discussion and planning:&lt;br /&gt;
&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/12/future-mini-polymath-project-2010-imo-q6/ Future mini-polymath project: 2010 IMO Q6?]  June 12, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/21/organising-mini-polymath2/ Organising mini-polymath2] June 21, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/06/27/mini-polymath2-start-time/ Mini-polymath2 start time], June 27, 2010.&lt;br /&gt;
* [http://terrytao.wordpress.com/2010/07/08/mini-polymath2-discussion-thread/ Mini-polymath2 discussion thread], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
Research:&lt;br /&gt;
&lt;br /&gt;
* [http://polymathprojects.org/2010/07/08/minipolymath2-project-imo-2010-q5/ Minipolymath2 project: IMO 2010 Q5], July 8 2010.&lt;br /&gt;
&lt;br /&gt;
== The question ==&lt;br /&gt;
&lt;br /&gt;
The question to be solved is Question 5 of the [http://www.imo-official.org/problems.aspx 2010 International Mathematical Olympiad]:&lt;br /&gt;
&lt;br /&gt;
: &#039;&#039;&#039;Problem&#039;&#039;&#039; In each of six boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5, B_6&amp;lt;/math&amp;gt; there is initially one coin. There are two types of operation allowed:&lt;br /&gt;
: &lt;br /&gt;
: &#039;&#039;Type 1:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq j \leq 5&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt; and add two coins to &amp;lt;math&amp;gt;B_{j+1}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: &#039;&#039;Type 2:&#039;&#039; Choose a nonempty box &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; with &amp;lt;math&amp;gt;1 \leq k \leq 4&amp;lt;/math&amp;gt;. Remove one coin from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; and exchange the contents of (possibly empty) boxes &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&lt;br /&gt;
: Determine whether there is a finite sequence of such operations that results in boxes &amp;lt;math&amp;gt;B_1, B_2, B_3, B_4, B_5&amp;lt;/math&amp;gt;  being empty and box &amp;lt;math&amp;gt;B_6&amp;lt;/math&amp;gt; containing exactly &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt; coins. (Note that &amp;lt;math&amp;gt;a^{b^c} := a^{(b^c)}&amp;lt;/math&amp;gt;.)&lt;br /&gt;
&lt;br /&gt;
== Observations and partial results ==&lt;br /&gt;
&lt;br /&gt;
* If the left-most box &amp;lt;math&amp;gt;B_1&amp;lt;/math&amp;gt; becomes empty, then it cannot ever become non-empty again.  Furthermore, the left-most box can never have more than one coin; it can be touched exactly once.&lt;br /&gt;
* Define the &#039;&#039;worth&#039;&#039; W of a state to be &amp;lt;math&amp;gt;W = B_6 + 2 B_5 + 4 B_4 + 8 B_3 + 16 B_2 + 32 B_1&amp;lt;/math&amp;gt;.  Then the initial worth is 63, the final desired worth is &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, and the Type 1 move does not affect the worth.  On the other hand, the Type 2 move increases the worth when &amp;lt;math&amp;gt;B_{j+2} - B_{j+1} \geq 4&amp;lt;/math&amp;gt;.&lt;br /&gt;
* Once one has a large number of coins in one of the first four boxes, say &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt;, one can apply the Type 2 move repeatedly to remove coins from &amp;lt;math&amp;gt;B_k&amp;lt;/math&amp;gt; while swapping &amp;lt;math&amp;gt;B_{k+1}&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_{k+2}&amp;lt;/math&amp;gt; repeatedly.  This suggests that it is relatively easy to remove coins from the system; the difficulty is in adding coins to the system.&lt;br /&gt;
* The total number of coins in the system is bounded.  Indeed, let &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; be the maximum number of coins that one can end up with starting with N boxes with at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; coins in them.  Thus for instance &amp;lt;math&amp;gt;f(1,\Sigma)=\Sigma&amp;lt;/math&amp;gt;.  By considering the times when one touches the left-most box, we can bound &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt; by at most &amp;lt;math&amp;gt;\Sigma&amp;lt;/math&amp;gt; iterations of the map &amp;lt;math&amp;gt;n \mapsto f(N-1,n)+2&amp;lt;/math&amp;gt; starting with &amp;lt;math&amp;gt;n=\Sigma&amp;lt;/math&amp;gt;.  This gives an Ackermann-type bound on &amp;lt;math&amp;gt;f(N,\Sigma)&amp;lt;/math&amp;gt;.  We need f(6,6) to be less than &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;, but this bound is likely to be too large.&lt;br /&gt;
&lt;br /&gt;
== Possible strategies ==&lt;br /&gt;
&lt;br /&gt;
* Split the problem into two pieces.  Part I: try to show the weaker result that the number of coins in the system can eventually be as large as &amp;lt;math&amp;gt;2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  Part II: Show that once one has a lot of coins, one can move to the final state where &amp;lt;math&amp;gt;B_1=\ldots=B_5=0&amp;lt;/math&amp;gt; and &amp;lt;math&amp;gt;B_6 = 2010^{2010^{2010}}&amp;lt;/math&amp;gt;.  &lt;br /&gt;
* Try to show that a quantity such as the worth increases or decreases in a controlled manner as one applies the Type 1 and Type 2 moves.&lt;br /&gt;
* We know that the first box can never contain more than one coin.  What can we say about the second box, third box, etc.?&lt;br /&gt;
** There may be a recursive formula for the maximal size of box &amp;lt;math&amp;gt;B_j&amp;lt;/math&amp;gt;, possibly requiring one to solve the five-box, four-box, etc. problems first.&lt;br /&gt;
* Work backwards?&lt;br /&gt;
* Try to completely solve the three-box problem (say) first: starting from &amp;lt;math&amp;gt;[X,Y,Z]&amp;lt;/math&amp;gt;, what is the most number of coins one can generate?&lt;br /&gt;
&lt;br /&gt;
== Compound moves ==&lt;br /&gt;
&lt;br /&gt;
Here we use Type 1 move &amp;lt;math&amp;gt;[a,b] \mapsto [a-1,b+2]&amp;lt;/math&amp;gt; and the Type 2 move &amp;lt;math&amp;gt;[a,b,c] \mapsto [a-1,c,b]&amp;lt;/math&amp;gt; to create more advanced moves.&lt;br /&gt;
&lt;br /&gt;
# We can create the move &amp;lt;math&amp;gt;[a,b] \mapsto [0,b+2a]&amp;lt;/math&amp;gt; from repeated application of Type 1.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[1,a,b] \mapsto [0,0,a+2b]&amp;lt;/math&amp;gt; by applying Type 2 once and then Type 1 b times.&lt;br /&gt;
#* Or, by using advanced move 1 first, the move &amp;lt;math&amp;gt;[1, a, b] \mapsto [1, 0, b+2a] \mapsto [0, b+2a, 0] \mapsto [0, 0, 2b+4a]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# We have &amp;lt;math&amp;gt;[a,0,0] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,0,0] \mapsto [a-1,2,0] \mapsto [a-1,0,4] \mapsto [a-2,4,0] \mapsto [a-2,0,8] \mapsto \ldots \mapsto [1, 0, 2^a] \mapsto [0,2^a,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
# Using the previous move, we have &amp;lt;math&amp;gt;[a,b,0,0] \mapsto [a-2, 2^{b+2}, 0, 0]&amp;lt;/math&amp;gt; via &amp;lt;math&amp;gt;[a,b,0,0] \mapsto [a,0,2^b,0] \mapsto [a,0,0,2^{b+1}] \mapsto [a-1,2,0,2^{b+1}] \mapsto [a-1,1,0,2^{b+2}] \mapsto [a-1,0,2^{b+2},0] \mapsto [a-2,2^{b+2},0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The last move seems to be the key to the solutions so far discovered, since it allows one to introduce an exponential at only a linear cost.&lt;br /&gt;
&lt;br /&gt;
== World records ==&lt;br /&gt;
&lt;br /&gt;
To make the second box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [1,0,3] \mapsto [0,3,0]&amp;lt;/math&amp;gt; places 3 coins in box 2.&lt;br /&gt;
&lt;br /&gt;
To make the third box as big as possible:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1] \mapsto [0,3,1] \mapsto [0,0,7]&amp;lt;/math&amp;gt; places 7 coins in box 3.  (Here we use advanced move 2).&lt;br /&gt;
&lt;br /&gt;
To make the fourth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;math&amp;gt;[1,1,1,1] \to [0,3,0,3] \to [0,2,2,3] \to [0,2,0,7] \to [0,1,7,0] \to [0,1,0,14]&amp;lt;/math&amp;gt; &amp;lt;math&amp;gt; \to [0,0,14,0] \to [0,0,0,28]&amp;lt;/math&amp;gt; gives 28 coins in box 4.&lt;br /&gt;
&lt;br /&gt;
To make the sixth box as big as possible&lt;br /&gt;
&lt;br /&gt;
* Using the fourth box move we have &amp;lt;math&amp;gt;[0,0,0,28,1,1]&amp;lt;/math&amp;gt;.  We can move &amp;lt;math&amp;gt;[28,1,1]&amp;lt;/math&amp;gt; to &amp;lt;math&amp;gt;[27,0,7]&amp;lt;/math&amp;gt; through Type 1; then using a variant of advanced move 3 we get &amp;lt;math&amp;gt;[0,0,7 * 2^{27}]&amp;lt;/math&amp;gt;, leading to &amp;lt;math&amp;gt;7 \times 2^{27}&amp;lt;/math&amp;gt; coins in the last box.&lt;br /&gt;
&lt;br /&gt;
* A new record: we can get &amp;lt;math&amp;gt;2^{192}&amp;lt;/math&amp;gt; in the 6th box by &amp;lt;math&amp;gt;[1,1,1,1,1,1]\to \to [0,0,7,1,1,1]\to [0,0,7,0,3,1] \to \to [0,0,1,0,3\times 2^6,1]\to[0,0,0,3\times 2^6,0,1]\to [0,0,0,0,0,2^{3\times 2^6}]&amp;lt;/math&amp;gt; where &amp;lt;math&amp;gt;\to\to&amp;lt;/math&amp;gt; are the special moves.&lt;br /&gt;
&lt;br /&gt;
== Completed solutions ==&lt;br /&gt;
&lt;br /&gt;
=== First solution ===&lt;br /&gt;
Fist, I introduce a move that follows from compound move 3:&lt;br /&gt;
&lt;br /&gt;
(1) &amp;lt;math&amp;gt; [N,M,0,0] \to [N-1, 1, 0, 2^{M+1}] \to [N-2,2^{M+1}, 0, 0 ] &amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
We can get &lt;br /&gt;
&lt;br /&gt;
(2) &amp;lt;math&amp;gt; [0,0, 140, 0, 0 ,0] &amp;lt;/math&amp;gt; as follows:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,2,2,2,2,3] \to [0,2,1,1,8,3] \to [0,2,1,1,0,19] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\to [0,1,1,36,0,0] \to [0,1,1,1,0,140] \to [0,0,140,0,0,0]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
By (1) combined with (2) we can get some number that is greater than &amp;lt;math&amp;gt;n:=2010^{2010^{2010}}&amp;lt;/math&amp;gt; in the 4th spot&lt;br /&gt;
&lt;br /&gt;
And swapping 5 with 6 enough times, we can adjust this number to have the value n/4. Moving everything to the right will give us the desired result.&lt;br /&gt;
&lt;br /&gt;
=== Second solution ===&lt;br /&gt;
&amp;lt;math&amp;gt;[1,1,1,1,1,1] \to [0,3,1,1,1,1] \to [0,2,3,1,1,1] \to [0,2,1,5,1,1] \to [0,2,1,1,9,1]&amp;lt;/math&amp;gt;&amp;lt;math&amp;gt; \to [0,2,1,1,0,19] \to [0,2,1,0,19,0] \to [0,2,0,19,0,0] \to [0,1,19,0,0,0]&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
Make use of the compound move 4 here so that &amp;lt;math&amp;gt;\to [0,1,0,2^{2^{\cdots^2}},0,0] \to [0,0,2^{2^{\cdots^2}},0,0,0]=[0,0,N,0,0,0]&amp;lt;/math&amp;gt; where there are 19 2&#039;s. Note that &amp;lt;math&amp;gt;2^{2^{2^2}}=2^16&amp;gt;2010&amp;lt;/math&amp;gt;. Thus, &amp;lt;math&amp;gt;M=2^{2^{\cdots^2}}&amp;lt;/math&amp;gt; with 12 2&#039;s is bigger than &amp;lt;math&amp;gt;K=2010^{2010^{2010}}&amp;lt;/math&amp;gt;. Hence, we have &amp;lt;math&amp;gt;N&amp;gt;K&amp;gt;K/8&amp;lt;/math&amp;gt; so that &amp;lt;math&amp;gt;[0,0,N,0,0,0] \to [0,0,N-1,0,0,0] \to \cdots \to [0,0,K/8,0,0,0] \to [0,0,0,K/4,0,0] \to [0,0,0,0,K/2,0] \to [0,0,0,0,0,K]&amp;lt;/math&amp;gt;.&lt;/div&gt;</summary>
		<author><name>Olof</name></author>
	</entry>
</feed>