Simplicity and Complexity of Belief-Propagation Elchanan Mossel 1 1 - PowerPoint PPT Presentation

Simplicity and Complexity of Belief-Propagation Elchanan Mossel 1 1 MIT July 2020 Elchanan Mossel Simplicity & Complexity of BP

A Double phase transition for large q Theorem (Count Reconstruction, Robust Reconstruction (Mossel-Peres, Janson-Peres)) For all q and d-ary tree, d θ 2 = 1 is the threshold for: census and robust reconstruction. Theorem (Reconstruction for large q (Mossel 00)) If d θ > 1 then for q > q θ can distinguish the root better than random: h →∞ Var [ E [ X 0 | X L h ]] > 0 lim = ⇒ Non-linear estimators are superior. Pf: Shows fractal nature of information. Elchanan Mossel Simplicity & Complexity of BP

Proof sketch For q = ∞ , clearly threshold is d θ = 1. For finite q , d = 2, fix θ such that d θ > 1. Inference: Infer root color to be c if there is an ℓ -diluted binary subtree T ′ ⊂ T with root at 0 and where all leaves have color c . Exercise 1: There exists an ℓ, ε > 0 such that if the root is c , the probability that such a tree exists is at least ε . Exercise 2: For all ε > 0, if q is sufficiently large, and if the root is not c , the probability that there is an ℓ -diluted 2 ℓ − 1 tree with all the leaves of color � = c is at least 1 − ε/ 10. Exercise 3: Prove that if d λ ≤ 1, then the root and leaves are asymptotically independent. Elchanan Mossel Simplicity & Complexity of BP

More detailed Picture Sly 11: Defined magnetization m n = E [ M n ] such that if m n is small then: m n +1 = d θ 2 m n + (1 + o (1)) d ( d − 1) q ( q − 4) q − 1 θ 4 m 2 n . 2 = ⇒ if q ≥ 5, the KS bound is not tight. Also proved that if q = 3 and d ≥ d min is large then KS bound is tight. M-01: For general Markov chains, can have λ 2 ( M ) = 0, yet root and leaves are not independent. Exercise: Prove this for following chain on F 2 2 . M ( x , y ) = ( r , r ⊕ x ) or ( r , r ⊕ y ) with probability 1 / 2 each. More sophisticated examples in Mossel-Peres. Elchanan Mossel Simplicity & Complexity of BP

Two conjectures about inference Consider a model where different edges have different θ ’s. Let q so that for θ ∈ ( θ R , θ KS ), Var [ E [ X 0 | X h ]] → α > 0. Conj 1: There is no estimator f such that f ( X h ) and X 0 have no negligible correlation for all models with θ ( e ) ∈ ( θ R , θ KS ) for all edges. Conj 2: It is “impossible” to recover phylogenetic trees using O ( h ) samples under the conditions above. Strong version of impossible would mean information theoretically. Weak version would mean computationally. Elchanan Mossel Simplicity & Complexity of BP

Part 3 : Complexity of BP Part 3: Complexity of BP Elchanan Mossel Simplicity & Complexity of BP

Complexity of BP What is the complexity of BP? Low: Runs in linear time. But: Uses real numbers - it this necessary? But: Uses depth - is this necessary? Fractal picture suggests maybe depth is needed. Elchanan Mossel Simplicity & Complexity of BP

Understanding the Omnipresence What is everywhere and understand everything? “Omnipresence”. A: The deep-net on your smartphone that understands you. Elchanan Mossel Simplicity & Complexity of BP

Deep Inference? Mathematically, it is natural to ask if there are data generative process satisfying 3 natural criteria: 1. Realism: Reasonable data models. ∨ 2. Reconstruction: Provable efficient algorithms to reverse engineer the generative process. ∨ (phylogenetic reconstruction). 3. Depth: Proof that depth is needed. ??? 4. Also: why does BP use real numbers, when the generating process is discrete? Elchanan Mossel Simplicity & Complexity of BP

Precision in BP Q: What are the memory requirements for BP? Conjecture (EKPS-00): For q = 2, any recursive algorithm on the tree which uses at most B bits of memory per node can only distinguish the root value better then random if θ < θ ( B ) where d θ ( B ) 2 > 1. Thm:(Jain-Koehler-Liu-M-19): Conjecture is true: θ ( B ) − θ = B − O (1) . Elchanan Mossel Simplicity & Complexity of BP

Problem Setup generation tree X 1 (broadcast model) X 2 X 3 X 4 X 5 X 6 X 7 . . . . . . . . . . . . Y 4 Y 5 Y 6 Y 7 Y 2 Y 3 reconstruction (message passing) Y 1 Elchanan Mossel Simplicity & Complexity of BP

Problem Setup (cont.) X 1 Broadcast process on X 2 X 3 d -regular tree of height h . X 4 X 5 X 6 X 7 . . . . . . Each reconstruction Y i = f i ( Y 2 i , Y 2 i +1 ) is an arbitrary log L -bit . . . . . . Y 4 Y 5 Y 6 Y 7 string (memory constraint) . Y 2 Y 3 Y 1 Elchanan Mossel Simplicity & Complexity of BP

AC 0 AC 0 := class of bounded depth circuits with AND/OR (unbounded fan) and NOT gates. Thm: Moitra-M-Sandon-20: AC 0 ( X h ) cannot classify X 0 better than random. Is this trivial? Maybe not: Thm MMS-20: AC 0 generates leaf distributions. Elchanan Mossel Simplicity & Complexity of BP

TC 0 TC 0 := like AC 0 but with Majority gates. “Bounded depth deep nets”. Thm (MMS-20): When q = 2 and 0 . 9999 < θ < 1, there exists an algorithm A in TC 0 such that lim h P [ A ( X h ) = X 0 ] = lim h P [ BP ( X h ) = X 0 ]. Conj: This is true for all θ when q = 2. So maybe we can classify optimally in TC 0 ? Maybe bounded depth nets suffice? Elchanan Mossel Simplicity & Complexity of BP

NC 1 NC 1 := class of O (log n ) depth circuits with AND/OR (fan 2) and NOT gates. Known that TC 0 ⊂ NC 1 . Open if they are the same. Thm (MMS-20): One can classify as well as BP in NC 1 . Thm (MMS-20): There is a broadcast process for which classifying better than random is NC 1 -complete. So, unless TC 0 = NC 1 , log n depth is needed. Elchanan Mossel Simplicity & Complexity of BP

The KS bound and Circuit Complexity The threshold 2 θ 2 = 1 is called the Kesten-Stigum threshold. Above this threshold it is known that one neuron can classify the root better than random (Kesten-Stigum-66). Below this threshold, one neuron cannot (M-Peres-04). Below this threshold, with enough i.i.d. noise on the leaves, BP becomes trivial (Janson-M-05). Related to “Replica Symmetry Breaking” in statistical physics models (Mezard-Montanari-06). Conjecture (MMS-20): For any broadcast process, below the KS bound and where BP classifies better than random, classification is NC 1 -complete. Elchanan Mossel Simplicity & Complexity of BP

Conclusion BP is simple: Runs in linear time. Above KS bound behaves like a Linear Algorithm. BP is complex: Below KS bound, tend to be fractal. Statistical/computation gaps. Requires depth / precision. Elchanan Mossel Simplicity & Complexity of BP

Simplicity and Complexity of Belief-Propagation Elchanan Mossel 1 1 - PowerPoint PPT Presentation

Simplicity and Complexity of Belief-Propagation Elchanan Mossel 1 1 MIT July 2020 Elchanan Mossel Simplicity & Complexity of BP A Double phase transition for large q Theorem (Count Reconstruction, Robust Reconstruction (Mossel-Peres,

Simplicity and Complexity of Belief-Propagation Elchanan Mossel 1 1 MIT Dec 2020 Elchanan Mossel

PLANT PROPAGATION An Overview of Plant Propagation Methods Two Techniques of Stem Cutting

Increased Simplicity, Reduced Footprint Simplicity Size Performance Safety Intuitive

Overview Independence Belief Networks Conditional Independence Belief networks Chris

26:198:722 Expert Systems I Dempster-Shafer Belief Functions I Combining Belief Functions I Types

Welcome to Form Simplicity Training Basic use Log-in to Form Simplicity through the web address

Introduction: Belief vs Degrees of Belief Hannes Leitgeb LMU Munich October 2014 My three

Shuffled Belief Propagation Decoding Juntan Zhang and Marc Fossorier Department of Electrical

An empirical study of Gaussian belief propagation and application in the detection of F-formations

THE AMATEURS FRIEND OR Enemy A short course on Propagation Propagation What is it? What

1 How to deal with Radio Propagation How to deal with Radio Propagation Where are you from?

Physical of radio propagation Two types of propagation models

Complexity and Economics The lowest simplicity you can achieve is the complexity Jo ao Basilio

Complexity and Economics The lowest simplicity you can achieve is the complexity Jo ao Basilio

Data Mining with Weka Class 3 Lesson 1 Simplicity first! Ian H. Witten Department of Computer

Simplicity Matters Rich Hickey Simplicity is prerequisite for reliability Edsger W. Dijkstra

GRAPH REPRESENTATIONS, BACKPROPAGATION AND BIOLOGICAL PLAUSIBILITY Marco Gori SAILAB,

Various Faces of Data Centric Networking and Systems Eiko Yoneki University of Cambridge

Purpose Learn. Gain valuable feedback from membership Connect. Provide a space for NCDFIs

Native Financial Inclusion Policy Toolkit: Advocating for Stronger Policies in 2018 January 11,

DERP Forum Strengthening Relationships with our Regulatory Partners St. Louis, Missouri May 8-9,

CS 188: Artificial Intelligence Bayes Nets Instructors: Dan Klein and Pieter Abbeel ---

Bayes Networks 2 Robert Platt Northeastern University All slides in this file are adapted from

UNINETT OpenFlow testbed UNINETT OpenFlow testbed Terena Network Architecture Workshop,