how to not prove
play

How to not prove A Brief Introduction to Natural Proofs & - PowerPoint PPT Presentation

How to not prove A Brief Introduction to Natural Proofs & Data Complexity Shubhang Kulkarni and Ryan Davis Part I : Introduction to Natural Proofs Shubhang Kulkarni The obstacle is, roughly, that a large class of


  1. How to not prove 𝑄 β‰  𝑂𝑄 A Brief Introduction to Natural Proofs & Data Complexity Shubhang Kulkarni and Ryan Davis

  2. Part I : Introduction to Natural Proofs Shubhang Kulkarni β€œThe obstacle is, roughly, that a large class of approaches to circuit lower bounds must prove more” - R. Lipton

  3. Why This Talk is Important Lowerbounds ≑ Computational Intractability of a problem Modern day e-commerce heavily relies on certain lower bounds being true. Its natural to ask what makes lower bound questions so difficult β€’ 𝑄 β‰  𝑂𝑄 ? β€’ 𝑄 β‰  𝑄𝑇𝑄𝐡𝐷𝐹 ? β€’ 𝑄 β‰  𝑂𝐷 ?

  4. Algorithms vs Complexity Theory Algorithms Theory βˆƒ β€’ When can we solve problems quickly? β€’ What’s an efficient way to solve the problem? Complexity Theory β€’ When can problems not be solved efficiently? βˆ€ β€’ How can we prove that a problem is not easy? The algorithm designers and the complexity theorists have opposing goals.

  5. Complexity Barriers It turns out we have some formal understanding of why lower bounds are so tough to prove (at the moment) Any lower bound proof must overcome complexity barriers These barriers are β€œmeta-theorems” about proofs β€’ Relativization β€’ Natural Proofs Baker, Gill, Solovay β€’ Algebrization βˆƒ Oracles A, B such that 𝑄 - = 𝑂𝑄 - but 𝑄 / β‰  𝑂𝑄 /

  6. Simple Notation β€’ 0,1 3 ∢ n-bit binary strings (input x) Also denoted by 𝐢 3 3 : function 𝑔: 0,1 3 β†’ {0,1} β€’ 𝑔 β€’ 𝐺 3 ∢ set of all functions 𝑔 3 β€’ 𝐷 3 : combinatorial property of 𝑔 3

  7. Combinatorial Properties I 𝐷 3 can be thought of as a subset of 𝐺 3 of the functions possessing the property β€’ 𝑔 3 β€œsatisfies” 𝐷 3 ↔ 𝑔 3 ∈ 𝐷 3 β€’ Also denoted as: 𝐷 3 𝑔 3 = 1 if 𝑔 3 β€œsatisfies” 𝐷 3 , = 0 otherwise

  8. Combinatorial Properties II Actually for any subset of 𝐷 3 𝐷 3 is Natural if it satisfies (1) Constructivity There is a polynomial algorithm to determine whether 𝑔 3 ∈ 𝐷 3 (2) Largeness A random 𝑕 3 has a β€œnon-negligible” chance of satisfying 𝐷 3 Formally, |𝐷 3 | β‰₯ 2 BC 3 β‹… |𝐺 3 | Terminology A combinatorial property is useful against 𝑄 /FGHI if the circuit sizes of all functions satisfying 𝐷 3 are super-polynomial.

  9. Natural Proof β€œA proof that some function does not have a polynomial size circuit is natural against 𝑄 /FGHI if the proof contains the definition of natural combinatorial properties useful against 𝑄 /FGHI .” Want to prove { 𝑕 3 } has no polynomial circuit β€’ Identify property 𝐷 3 such that β€’ The proof shows that βˆ€π‘” 3 ∈ 𝐷 3 , 𝑔 3 is β€œhard” for circuits to compute β€’ 𝑕 3 ∈ 𝐷 3

  10. The NaΓ―ve Approach to P β‰  NP β€’ Define mathematically the notion of β€œdiscrepancy” or β€œscatter” of boolean function values i.e. Define a 𝐷 3 s.t. 𝐷 3 is true for functions of β€œhigh discrepancy” β€’ Show (inductively) that poly circuits can only compute low discrepancy functions i.e. 𝐷 3 is useful as 𝑔 3 ∈ 𝐷 3 cannot be computed by poly circuit β€’ Show that SAT has high discrepancy i.e. SAT ∈ 𝐷 3 β€’ 𝑄 βŠ‚ 𝑄 /FGHI implies 𝑄 β‰  𝑂𝑄

  11. The Breakthrough Results β€’ β€œOur main theorem … … gives evidence that no proof strategy along these lines can ever succeed” – Razborov, Rudich ’ 96 β€’ β€œAny Large and Constructive 𝐷 3 useful against 𝑄 /FGHI provides a statistical test that can be used to break any polytime psuedo-random generator.” β€’ Violates a widely held belief that psuedo-random generators of hardness 2 K (πœ— > 0) exist.

  12. Psuedo-Random Generators f(𝑦 3 ) 𝚾(𝒀 𝒍 ) Ξ¦(𝑦 R ) 𝟏, 𝟐 𝒐 {0,1} 𝟏, 𝟐 R π‘Œ R π‘Œ 3 β€’ 𝑄 𝑔 = 1 : Probability that 𝑔(x \ ) = 1 β€’ 𝑄 𝑔 ] = 1 : Probability that 𝑔 Ξ¦ x ^ = 1 𝑄 𝑔 = 1 | 𝑦 3 ∈ Ξ¦(π‘Œ R ) Ξ¦ is a M-hard psuedo-random generator if 𝑄 𝑔 β‰ˆ 𝑄(𝑔 ] ) ≀ 𝑁 Bc i.e. 𝑄 𝑔 βˆ’ 𝑄 𝑔 ]

  13. The Entire Picture (so far…) Integer One Way Discrete Log Empirical Evidence Factorization Functions Exist Hardness Hardness Psuedo-Random No Natural Proofs Generators Exist 𝑸 β‰  𝑢𝑸

  14. Implications of no Natural Proofs Recall the definition of Natural Proofs Any property used by a non-natural lower bound proof must fall into one of: β€’ Violates Largeness β€’ Probability that a random function has the property is small β€’ Property is shared by very few β€’ Violates Constructability β€’ Very complicated property

  15. Takeaways Decoding the literature: Notation Description Size of Distribution Type 0,1 3 2 3 Input Set Set 2 f g Functions on 0,1 3 Set of Sets 𝐺 3 2 f hg 𝐷 3 Properties of 𝐺 Set of Sets of Sets 3 A random efficiently computable function is very hard to distinguish from a random function β€’ Main Proof Idea [RR97] β€’ The rubik's cube β€’ 3 bit scramblers’ composition

  16. End of Part I Shubhang Kulkarni β€œThe general problem of mathematically proving computational lower bounds is a mystery” - Ryan Williams, Thinking Algorithmically About Impossibility

  17. Part II : Introduction to Data Complexity Ryan Davis β€œ[We] will have to develop new methods to make a serious dent in major lower bound problems.” - Ryan Williams, Thinking Algorithmically About Impossibility

  18. Why use Data Complexity? Closely related to program verification (testing) Carefully chosen input/output pairs to determine correctness When does it suffice to use a small number of test cases? What if we know something about the program, such as its size?

  19. Defining Data Complexity Decision Problem Potential Solution Assume a known function 𝑔: {0,1} βˆ— β†’ {0,1} Given a circuit 𝐷 of size 𝑑 , we wish to determine if 𝐷 computes 𝑔 Data complexity (w.r.t. 𝑑 ) – minimum number of input/output examples to determine if 𝐷 computes 𝑔 β€œGray-box” testing where 𝑑 is side information

  20. Overarching Question The data complexity for size 𝑑 circuits is trivially 2 C(k) (Include all input/output examples up to length 𝑑 ) We are interested to know: For what functions 𝑔 can the data complexity be much smaller?

  21. Data Complexity and Circuit Complexity The data complexity of testing 𝑔 is β€œsmall” If and only if The circuit complexity of 𝑔 is β€œlarge” β€œThe theory of circuits becomes interesting when we restrict the complexities of the circuits; The theory of test suites becomes similarly interesting when restricting the amount of necessary data.” - Chapman, B., Williams, R.

  22. The Circuit-Input Game β€’ Circuit player has a set 𝐷 of all circuits of size 𝑑 β€’ Size of 𝐷 is 𝐷 = 2 C(k lmn k) 2 C(k lmn k) β€’ Input player has a set 𝐽 of all inputs of length π‘œ β€’ Size of 𝐽 is |𝐽| = 2 3 β€’ Payoff matrix 𝑁 with 2 3 rows and 2 C(k lmn k) columns 2 3 𝑁 β€’ 𝑁 𝑑, 𝑦 = 0 if 𝑑 𝑦 = 𝑔(𝑦) β€’ 𝑁 𝑑, 𝑦 = 1 if 𝑑 𝑦 β‰  𝑔(𝑦) Payoff goes to the input player

  23. Approximate Optimal Strategies Theorem 2.1 (roughly) Circuit player has a good strategy! 𝑙 β‰₯ 𝑑 u π‘œ 1. There exists a 𝑙 -size distribution π‘ž (strategy) on 𝐷 such that for all 𝑦 ∈ 𝐽 , the circuit player has a good chance 𝑑 ∈ π‘ž will satisfy 𝑑 𝑦 = 𝑔 𝑦 β„“ β‰₯ 𝑑 u 𝑑 log 𝑑 2. There exists a β„“ -size distribution π‘ž (strategy) on 𝐽 such that for all 𝑑 ∈ 𝐷 , the input player has a good chance 𝑦 ∈ π‘ž will satisfy 𝑑 𝑦 β‰  𝑔 𝑦 Input player has a good strategy!

  24. Data Complexity Consequence Theorem 2.2 (roughly) Let π‘ž + π‘Ÿ ≀ 1 βˆ’ πœ— 𝑙 β‰₯ 𝑑 u π‘œ 1. There exists a 𝑙 -size set of circuits 𝑍 βŠ† 𝐷 such that for all 𝑦 ∈ 𝐽 , 𝑑 𝑦 = 𝑔 𝑦 for more than a π‘ž -fraction of circuits 𝑑 ∈ 𝑍 . β„“ β‰₯ 𝑑 u 𝑑 log 𝑑 2. There exists a β„“ -size set of inputs π‘Œ βŠ† 𝐽 such that for all 𝑑 ∈ 𝐷 , 𝑑 𝑦 β‰  𝑔 𝑦 for more than a π‘Ÿ -fraction of inputs 𝑦 ∈ π‘Œ .

  25. Data Complexity and Circuit Complexity The data complexity of testing 𝑔 is β€œsmall” If and only if The circuit complexity of 𝑔 is β€œlarge”

  26. Data Complexity and Circuit Complexity Theorem 1.2: Let function 𝑔: {0,1} βˆ— β†’ {0,1} and 𝑇(π‘œ) β‰₯ 2π‘œ for all π‘œ Hard to test! 1. If 𝑔 is in SIZE ( 𝑇(π‘œ) ), the data complexity of testing size- 𝑑 circuits for 𝑔 is at least 2 }(~ ‒€ k ) Easy to test! 2. If 𝑔 is not in SIZE ( π‘œ u 𝑇(π‘œ) ), the data complexity of testing size- 𝑑 circuits for 𝑔 is at most 𝑃(2 ~ ‒€ k + 𝑇 Bc 𝑑 u 𝑑 f log 𝑑)

  27. Proof Intuition Replace data complexity with time complexity When 𝑔 has large circuit complexity, we can quickly test circuits for 𝑔 Suppose 𝑇(π‘œ) is a lower bound on circuit complexity of 𝑔 3 Given a size- 𝑑 circuit, we may have to try all 2 3 < 2 ~ ‒€ (k) inputs As circuit complexity 𝑇(π‘œ) increases, time complexity 2 ~ ‒€ (k) decreases Time complexity with respect to circuit size 𝑑

  28. Proof Idea Pt. 1 If f is in SIZE ( 𝑇(π‘œ) ), the data complexity of testing size- 𝑑 circuits for 𝑔 is at least 2 }(~ ‒€ k ) 1. Suppose a circuit 𝑑 of size 𝑇(π‘œ) that computes 𝑔 2. Construct a circuit 𝑑 Ζ’ of size 𝑇’(π‘œ) = 𝑇(π‘œ) + π‘œ that agrees with 𝑑 on all inputs except 𝑦 3. Thus, for any π‘œ -input circuit of size 𝑇’ π‘œ we must include all inputs of length π‘œ β‰₯ 𝑇 Bc 𝑑 in a test set for 𝑔

Recommend


More recommend