Deolalikar P vs NP paper: Difference between revisions

Revision as of 10:00, 10 August 2010

Note: This is currently an UNOFFICIAL page on Deolalikar's P!=NP paper, and is not yet affiliated with a Polymath project.

This is a clearinghouse wiki page for the analysis of Vinay Deolalikar's recent preprint claiming to prove that P != NP, and to aggregate various pieces of news and information about this paper. Corrections and new contributions to this page are definitely welcome. Of course, any new material should be sourced whenever possible, and remain constructive and objectively neutral; in particular, personal subjective opinions or speculations are to be avoided. This page is derived from an earlier collaborative document created by Suresh Venkatasubramanian.

For the latest discussion on the technical points of the paper, see this thread of Dick Lipton and Ken Regan. For meta-discussion of this wiki (and other non-mathematical or meta-mathematical issues), see this thread of Suresh Venkatasubramanian.

The paper

These links are taken from Vinay Deolalikar's web page.

First draft, Aug 6, 2010
Second draft Aug 9, 2010.

Typos and minor errors

(Second draft, page 31, Definition 2.16): "Perfect man" should be "Perfect map". (via Blake Stacey)
(Second draft) Some (but not all) of the instances of the [math]\displaystyle{ O() }[/math] notation should probably be [math]\displaystyle{ \Theta() }[/math] or [math]\displaystyle{ \Omega() }[/math] instead, e.g. on pages 4, 9, 16, 28, 33, 57, 68, etc. (via András Salamon)
(Second draft, page 27) [math]\displaystyle{ n 2^n }[/math] independent parameters → [math]\displaystyle{ n 2^k }[/math] independent parameters

Proof strategy

(Excerpted from this comment of Ken Regan)

Deolalikar has constructed a vocabulary V which apparently obeys the following properties:

Satisfiability of a k-CNF formula can be expressed by NP-queries over V—in particular, by an NP-query Q over V that ties in to algorithmic properties.
All P-queries over V can be expressed by FO(LFP) formulas over V.
NP = P implies Q is expressible by an FO(LFP) formula over V.
If Q is expressible by an LFP formula over V, then by the algorithmic tie-in, we get a certain kind of polynomial-time LFP-based algorithm.
Such an algorithm, however, contradicts known statistical properties of randomized k-SAT when k >= 9.

Possible issues

Issues with LFP

There appear to be three issues related to the use of the characterization of P in terms of first order logic, an ordering and a least fixed point operator. All of these are discussed in the Lipton/Regan post, with contributions from David Barrington, Paul Christiano, Lance Fortnow, James Gate, and Arthur Milchior.

Is the lack of ordering in the logical structures used to define the LFP structure a problem ? On the surface, it appears to be, since it is not known whether FO(LFP) can be used to characterize P without ordering. (No, it is known that parity can not be expressed without an ordering even with LFP, hence P is not captured without order [AVV1997, page 35]. But in chapter 7 this issue seems to disappear since he introduces a successor relation over the variables [math]\displaystyle{ x_1\lt \dots\lt x_n\lt \neg x_1\lt \dots\lt \neg x_n }[/math] )
The paper requires that a certain predicate in the FO(LFP) formula be unary, and forces this by expanding neighborhoods and constructing k-tuples of parameters to act as single parameters. It is not clear how this affects the arguments about the propagation of local neighborhoods.
Does the logical vocabulary created to express the LFP operation suffice to capture all P-time operations ?

Issues with random k-SAT

Whether the "condensation" stage is significant: the latest ideas from physics suggest that random [math]\displaystyle{ k }[/math]-SAT and similar CSPs don’t become hard at the clustering transition, but rather at the condensation transition where a subexponential number of clusters dominate the space of solutions. Graph coloring provides some evidence of this. Moreover, random k-XORSAT has a clustering transition, frozen variables, etc., but is of course in P. (Cris Moore, Alif Wahid, and Lenka Zdeborova)
Whether a complex solution space amounts to true problem hardness: The author tries to use the fact that for certain distributions of random k-SAT, the solution space has a "hard structure". There are two "meta" objections to this. They don't actually point to a place where the proof is wrong. But they do appear to give an obstacle to the general proof method. Any objections to these objections are welcome!
1. Polytime solvable problems (such as perfect matching on random graphs) can also have complicated solution distributions (Ryan Williams, on twitter). In fact it is not hard to design 2-SAT formulas (in this case not random, but specifically designed ones) so that they have exponentially many clusters of solutions, each cluster being "far" from the others. That is, the fact that random k-SAT has a "hard" distribution of solutions does not seem to be relevant for showing that k-SAT is hard. That is, it is not sufficient to use a problem with a hard distribution of solutions, if you're separating P from NP.
2. Moreover, a hard distribution of solutions is not necessary for NP-hardness, either. The "hard" case of 3-SAT is the case where there is *at most one* satisfying assignment. There is a randomized reduction from 3-SAT to 3-SAT with at most ONE satisfying assignment (Valiant-Vazirani). This reduction increases the number of clauses and the number of variables, but that doesn't really matter. The point is that you can always reduce 3-SAT with a "complex" solution space to one with an "easy" solution space, so how can a proof separating P from NP rely on the former? Suppose Valiant-Vazirani can be derandomized to run in deterministic polynomial time (which is true if plausible circuit lower bounds hold up). For every LFP formula F that is to solve k-SAT, replace it with an LFP formula F' that has computationally equivalent behavior to the following algorithm: first "Valiant-Vazirani-ize" your input formula (reduce it to having at most one solution in polynomial time) then evaluate F on the result. These new formulas only have at most "one" solution to deal with. The intuition here is that either Valiant-Vazirani can't be derandomized (very unlikely) or the proof must break (Ryan Williams, on twitter). But again, this is just intuition.

Barriers

Any P vs NP proof must deal with the three known barriers described below. The concerns around this paper have not yet reached this stage yet.

Relativization

Natural proofs

Algebraization

Terminology

Finite model theory
Immerman-Vardi theorem
Least fixed point (LFP) in general, and in a descriptive complexity setting
Random k-SAT
The complexity class NP
The complexity class P
SAT is reffering to the Boolean satisfiability problem. [1]

Online reactions

Theoretical computer science blogs

P ≠ NP, Greg Baker, Greg and Kat’s blog, August 7 2010
A proof that P is not equal to NP?, Richard Lipton, Godel's lost letter and P=NP, August 8 2010
On the Deolalikar proof: Crowdsourcing the discussion ?, Suresh Venkatasubramanian, The Geomblog, August 9 2010
Putting my money where my mouth isn’t, Scott Aaronson, Shtetl-Optimized, August 9 2010
That P ne NP proof- whats up with that?, Bill Gasarch, Computational Complexity, August 9 2010
Issues In The Proof That P≠NP, Richard Lipton and Ken Regan, Godel's lost letter and P=NP, August 9 2010
Deolalikar's manuscript, András Salamon, Constraints, August 9 2010
A relatively serious proof that P != NP ?, Antonio E. Porreca, August 9 2010 (aggregates all the comments)
A 'polymath' home for analysis of the Deolalikar proof, Suresh Venkatasubramanian, The Geomblog, August 10 2010

Media and aggregators

P ≠ NP, Hacker News, August 8 2010
Claimed Proof That P != NP, Slashdot, August 8 2010
P != NP möglicherweise bewiesen, heise online, August 8 2010
P=NP=WTF?: A Short Guide to Understanding Vinay Deolalikar's Mathematical Breakthrough, Dana Chivvis, AolNews, August 9 2010
HP Researcher Claims to Crack Compsci Complexity Conundrum, Joab Jackson, IDG News, August 9 2010

Real-time searches

Other

Twitter, Lance Fortnow, August 8 2010
P<>NP?, Dave Bacon, The Quantum Pontiff, August 8 2010
How to get everyone talking about your research, Daniel Lemire, August 9 2010
Twitter, Ryan Williams, August 9 2010
Google Buzz, Terence Tao, August 9 2010
P ≠ NP?, Bruce Schneier, Schneier on Security, August 9 2010.
Vinay Deolalikar says P ≠ NP, Philip Gibbs, vixra log, August 9 2010.

Additions to the above list of links are of course very welcome.

Timeline

August 6: Vinay Deolalikar sends out his manuscript to several experts in the field.
August 7: Greg Baker posts about the manuscript on his blog.
August 8: The paper is noted on Hacker News and Slashdot, and discussed on many theoretical computer science blogs.
August 9: A second draft of the manuscript is posted.
August 9: Suresh Venkatasubramanian collects several technical comments on the paper into a collaborative document.
August 9: In a post of Dick Lipton and Ken Regan, several technical issues and concerns raised by various experts are discussed.
August 10: Venkatasubramanian's document is migrated over to a wiki page.

Bibliography

[AVV1997] S. Abiteboul, M. Y. Yardi, V. Vianu, "Fixpoint logics, relational machines, and computational complexity", Journal of the ACM (JACM) Volume 44, Issue 1 (January 1997), 30-56.
[AM2003] D. Achlioptas, C. Moore, "Almost all graphs with average degree 4 are 3-colorable", Journal of Computer and System Sciences 67, Issue 2, September 2003, 441-471.
[I1986] N. Immerman, "Relational queries computable in polynomial time", Information and Control 68 (1986), 86-104.
[VV1986] L. G. Valiant, V. V. Vazirani, "NP is as easy as detecting unique solutions", Theoretical Computer Science (North-Holland) 47: 85–93 (1986). doi:10.1016/0304-3975(86)90135-0.
[V1982] M. Vardi, Complexity of Relational Query Languages, 14th Symposium on Theory of Computation (1982), 137-146.

@@ Line 44: / Line 44: @@
 # '''Whether the "condensation" stage is significant''': the latest ideas from physics suggest that random  <math>k</math>-SAT and similar CSPs don’t become hard at the clustering transition,  but rather at the condensation transition where a subexponential number  of clusters dominate the space of solutions.  Graph coloring [http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6WJ0-49M05RK-3&_user=10&_coverDate=09%2F30%2F2003&_rdoc=1&_fmt=high&_orig=search&_sort=d&_docanchor=&view=c&_acct=C000050221&_version=1&_urlVersion=0&_userid=10&md5=13eae49445b87797b1f90aa42e54b5a5 provides some evidence of this]. Moreover, random k-XORSAT has a clustering transition, frozen  variables, etc., but is of course in P. ([http://rjlipton.wordpress.com/2010/08/08/a-proof-that-p-is-not-equal-to-np/#comment-4505 Cris Moore], [http://rjlipton.wordpress.com/2010/08/08/a-proof-that-p-is-not-equal-to-np#comment-4518 Alif Wahid], and [http://rjlipton.wordpress.com/2010/08/08/a-proof-that-p-is-not-equal-to-np/#comment-4633 Lenka Zdeborova])
-# '''Whether a complex solution space amounts to true problem hardness''': The author tries to use the fact that for certain distributions of random k-SAT, the solution space has a "hard structure". There are two "meta" objections to this. They don't actually point to a place where the proof is wrong. But they do appear to give an obstacle to the general proof method.
+# '''Whether a complex solution space amounts to true problem hardness''': The author tries to use the fact that for certain distributions of random k-SAT, the solution space has a "hard structure". There are two "meta" objections to this. They don't actually point to a place where the proof is wrong. But they do appear to give an obstacle to the general proof method. ''Any objections to these objections are welcome!''
 ## Polytime solvable problems (such as perfect matching on random graphs) can also have complicated solution distributions (Ryan Williams, on [http://twitter.com/rrwilliams/status/20741046788 twitter]). In fact it is not hard to design 2-SAT formulas (in this case not random, but specifically designed ones) so that they have exponentially many clusters of solutions, each cluster being "far" from the others. That is, the fact that random k-SAT has a "hard" distribution of solutions does not seem to be relevant for showing that k-SAT is hard. That is, it is not sufficient to use a problem with a hard distribution of solutions, if you're separating P from NP.
 ## Moreover, a hard distribution of solutions is not necessary for NP-hardness, either. The "hard" case of 3-SAT is the case where there is *at most one* satisfying assignment. There is a randomized reduction from 3-SAT to 3-SAT with at most ONE satisfying assignment ([http://en.wikipedia.org/wiki/Valiant%E2%80%93Vazirani_theorem Valiant-Vazirani]). This reduction increases the number of clauses and the number of variables, but that doesn't really matter. The point is that you can always reduce 3-SAT with a "complex" solution space to one with an "easy" solution space, so how can a proof separating P from NP rely on the former? Suppose Valiant-Vazirani can be derandomized to run in deterministic polynomial time (which is true if plausible circuit lower bounds hold up). For every LFP formula F that is to solve k-SAT, replace it with an LFP formula F' that has computationally equivalent behavior to the following algorithm: first "Valiant-Vazirani-ize" your input formula (reduce it to having at most one solution in polynomial time) then evaluate F on the result. These new formulas only have at most "one" solution to deal with. The intuition here is that either Valiant-Vazirani can't be derandomized (very unlikely) or the proof must break (Ryan Williams, on [http://twitter.com/rrwilliams/status/20741046788 twitter]). But again, this is just intuition.