Polylog parameterizability
From Polymath1Wiki
In this post I will try to describe what I believe Deolilakar means by "polylog parameterizability", a key concept in his work. (A more accurate term here would be "polylog recursive factorizability".
Contents |
Definition
Let μ be a probability distribution on the cube {0,1}n. Then to describe this distribution fully, one needs 2n − 1 real parameters; one real number μ({x}) for each of the 2n elements of the cube, minus one for the constraint
.
Roughly speaking, a probability distribution is "polylog parameterizable" if it requires much fewer than 2n real numbers to describe, and more precisely it needs only exp(O(polylog(n))) many real numbers. More precisely still, we ask that the distribution function
admit a factorisation into potentials
where for each j, pa(j) is a set of O(polylog(n)) "parents" of j (as given by a directed acyclic graph), and
is a probability distribution for each assignment of the parent literals xpa(j). The point is that each factor pj only takes exp(O(polylog(n)) real numbers to describe, and so the entire distribution also only requires exp(O(polylog(n)) real numbers to describe.
The purpose of the FO(LFP) machinery is (I think) ultimately to obtain a result of the following form:
- Claim. If a solution space in {0,1}n can be "explored by a P algorithm" in a suitable sense, then the uniform distribution μ of that space is the _projection_ of a polylog-parameterisable measure μ' on a subset of a polynomially larger space
.
The following can be easily shown (see proof):
A probability distribution on {0,1}n that can be computed by a polynomial time run on a probabilistic Turing machine is a projection of a polylog parametrizable distribution.
However, a key point that does not seem to be sufficiently addressed in the paper is whether the projection of a polylog-parameterisable measure remains polylog-parameterisable, and which I believe could in fact be a serious problem with the proof. In fact, it is not the case, shown by counterexamples below.
Examples
Let me illustrate the above Claim with a simple example, that of a straight line program. Let N: = n100, and consider a boolean function
obtained by the formula
where
are obtained from previous literals by formulae of the form
-
(1)
for
and some logic gates
. Now let
be the graph of f, and let μ be the uniform distribution on S, thus μ assigns a probability 2 − n to each element of S:
(2)
As f is fairly complicated (indeed, it is not obviously distinguishable from a random function), it does not seem that μ itself is polylog parameterisable. However, μ can be lifted to a probability measure μ' on the larger space {0,1}N which _is_ polylog parameterisable. Namely, let
be the set of all tuples
for which (1) holds for all
. Clearly, the map
from {0,1}N to {0,1}n + 1 maps S' to S in a bijective manner, and so the uniform measure μ' on S' projects to the uniform measure μ on S.
On the other hand, μ' is polylog-parameterisable. Indeed, we have the explicit factorisation
-
.
Next, consider the problem of parameterising a certain distribution ν to be defined shortly on the solution space
of some function
, which we assume to be non-empty. We assume that the partial satisfiability problem is solvable here, thus for any initial partial assignment
of literals, one can quickly solve the problem of whether this partial assignment can be completed to a full solution in T. Indeed, let us suppose that there is a polynomial length straight line program
such that
if and only if there exists
for which
.
We now define ν to be the probability distribution on T defined by selecting random variables
consecutively in the following manner. Assume that
have already been selected and are extendible (i.e.
). We then compute
and
; at least one of these must equal 1. If only the former equals 1, we set xi: = 0; if the latter, we set xi: = 1; and otherwise, we set xi equal to either 0 or 1 with a 1/2 probability of each.
One can then factor ν as
where
is the delta distribution δ0 if a0 = 1,a1 = 0, the delta distribution δ1 if a0 = 0,a1 = 1, the uniform distribution
if a0 = a1 = 1, and some arbitrary distribution (e.g. δ0 for concreteness) in the (unattainable) case a0 = a1 = 0.
This is not yet of a polylog parameterizable form (each i has i-1 ancestors rather than polylog ancestors).
However, if one expands all the straight line programs fi and adds all the literals required for computing such programs, then we can lift ν to a polylog parameterizable measure ν' similarly to before. However ν itself is not obviously a polylog parameterizable measure, merely the projection of one that involves additional "hidden variables".
Another remark: since NP is in EXP, I think that the solution space to a k-SAT problem, if it is non-empty admits, the projection of a degree O(1) recursively factorizable measure on exponentially many variables. This already illustrates that the projection operation is powerful enough to convert even the simplest measure into an extremely general one, at least if one is allowed to discard an exponential number of variables.
Counterexamples
The following measures are not polylog parameterisable, though they are projections of polylog parameterisable measures in a higher dimensional space:
- uniform (or any positive) measure on
- uniform (or any positive) measure on
.
To see the non polylog parameterisability, we use
- Claim: Let
, let ΧS be the characteristic function of S, and let μ be a probability measure whose support is exactly S (i.e. μ(x) > 0 if and only if x is in S. Assume that μ is polylog parametriable. Then, there is some permutation (reordering) of [n] such that Boolean function
can be expressed/computed as follows:
-
, where each g is a Boolean function of xi and polylog-many xj’s with j < i.
We can see the claim by focusing on measure being zero or nonzero in the definition of polylog paramateizability. Now consider #1. With respect to any ordering (permutation) of [n], the membership in EVEN cannot be expressed by such an AND as in the claim because the last factor g(xn;xj's) only depend on *fixed* polylog many xj's. By the same reason, #2 is not polylog parametrizable. (To express #1 and #2 in such a form, the last g(xn; xj's) have to depend on all n variables.)
On the other hand EVEN can be polylog parametrized if we add a number of auxilary variables, placeholders for partial sums, so that we never add more than polylog elements (we can always add a fixed number), and similarly for #2.
pp vs ppp
For the purposes of discussion, the following abbreviations have been adopted:
- A probability distribution on {0,1}n is pp (polylog parameterizable) if it is polylog parameterizable in the above sense, WITHOUT the addition of any additional variables beyond the existing n literals
.
- A probability distribution on {0,1}n is ppp (projected polylog parameterizable) if it is the projection of a pp distribution on {0,1}N for some polynomial size N = nO(1) under a map that keeps n of the N literals and discards the rest.
Thus, for instance, the above discussion shows that the graph of a straight line program of polynomial length is in ppp, and if P=NP then the solution space of every NP problem has a ppp distribution supported on it. The sets EVEN and
can also be easily seen to support ppp distributions (e.g. the uniform distribution on either is ppp), but neither of these sets supports pp distributions.
As another example, consider the space of collections of k-clauses on n-variables for which the corresponding SAT problem is solvable, where k is fixed (e.g. k=10) and n is large. (This is a subset of {0,1}^C, where C is the space of k-clauses on n variables.) [Note that this space is NOT the same as the solution space of a single k-SAT instance; the former is the "base" of the k-SAT relation, while the latter is a "fiber".] On the one hand, if P!=NP, we know that this space is not in P. On the other hand, it supports a ppp distribution, because one can randomly select a solution
first, and then select a random family of the clauses that are obeyed by this solution. Thus we see that ppp supports can actually be quite "hard" in nature.
As a further example of a set supporting a ppp distribution, consider the image of {0,1}^n via a hash function or some other computable "one-way function".
The pp vs ppp distinction appears to be a crucial issue in the Deolalikar argument. As far as can be discerned from the manuscript, the paper establishes the following facts:
- If P=NP, then the solution space of any k-SAT problem with at least one solution, supports a ppp distribution.
- The hard phase of k-SAT does not support any pp distribution.
However this does not allow us to conclude that P!=NP, because of the substantial gap between pp and ppp.
Furthermore, it is in fact true that the solution space of any k-SAT problem with at least one solution supports a ppp distribution regardless of whether P=NP, though the ppp distribution is not uniform in the sense that it depends on knowing the specific solution. Specifically, one randomly selects an element of {0,1}^n, and tests whether it solves the k-SAT problem. If it does, output that element; otherwise output the specific solution. It is easy to see that this distribution is ppp and is supported exactly on the solution space of k-SAT.
