Foundations of Distributed Computing in the 2020s Jukka Suomela

What are the theoretical foundations of the modern society? • Modern world ≈ large-scale communication networks • Physical side: • practice: computers, network equipment, laser, fiber optics, radio … • solid theoretical foundations: electromagnetism, quantum mechanics … • Logical side: • practice: communication protocols, networked applications … • solid theoretical foundations: ??? 2

Logical foundations of large communication networks • Computers: • theory of computation, computability, computational complexity … • Communication between computers: • information theory, communication complexity theory … • Computation in a network as a whole: • theory of distributed computing Our focus today 3

Logical foundations of computers vs. computer networks • Theory of computation: Which tasks can be solved efficiently with a computer? • Theory of distributed computing: Which tasks can be solved efficiently in a large computer network? 4

Logical foundations of computers vs. computer networks • Example: solving graph problems • Theory of computation: • “Here is a graph that is given as a string on a Turing machine tape” • How many steps does a Turing machine need to solve this problem? • Theory of distributed computing: • “I am a node in the middle of a very large graph” • How far do I need to see to pick my own part of the solution? • How much of the graph do I need to see? • How many communication rounds are needed to solve the problem? 5

Local: am I part of a triangle? O (1) distance 1 0 1 1 0 0 0 0 0 0 0 0 0 Global: how far am I from the nearest triangle? Θ( n ) distance 0 1 0 0 1 2 3 4 5 6 7 8 9 6

Logical foundations of computers vs. computer networks • Theory of computation: • e.g. hugely influential framework of NP-completeness (1970s) • Theory of distributed computing: • studied actively already since the 1980s • but we have only very recently started to really understand e.g. locality • solid theoretical foundations still largely missing • lots of progress in the 2010s, tons of work left for the 2020s 7

Distributed computing before the 2010s 8

Standard models 7 21 18 5 of computing 8 4 6 11 • LOCAL model 14 2 • input graph = computer network 10 • initially: each node has a unique ID + its own part of input • communication round: each node sends a message to each neighbor • finally: each nodes stops and outputs its own part of the solution • CONGEST model • bounded-size messages Number of rounds • Port-numbering model = time = distance • no unique IDs 9

Some important ideas 2 3 3 and concepts 1 2 1 1 2 1 3 2 • Solving vs. checking 3 • finding a solution vs. verifying a solution • cf. deterministic vs. nondeterministic Turing machines, P vs. NP • Problem family of “ locally checkable labelings ” ( LCL s) • O (1) input labels, O (1) output labels, max degree O (1) • verification: check each radius- O (1) neighborhood Example: • Naor & Stockmeyer (1993, 1995) vertex coloring • Proof labeling schemes with 3 colors • Korman, Kutten, Peleg (2005) 10

maximal maximal Four key independent set matching problems ( Δ +1)-vertex (2 Δ− 1)-edge coloring coloring • Key primitives for symmetry breaking • e.g. input is a symmetric cycle → output has to break symmetry • Trivial linear-time centralized algorithms • e.g. maximal matching: pick non-adjacent edges until stuck • Can we solve these efficiently in a distributed setting? 11

maximal maximal Four key independent set matching problems ( Δ +1)-vertex (2 Δ− 1)-edge coloring coloring • Pioneering work on upper bounds: • Cole & Vishkin (1986), Luby (1985, 1986), Alon, Babai, Itai (1986), Israeli & Itai (1986), Panconesi & Srinivasan (1996), Hanckowiak, Karonski, Panconesi (1998, 2001), Panconesi & Rizzi (2001) … • Pioneering work on lower bounds: • Linial (1987, 1992), Naor (1991), Kuhn, Moscibroda, Wattenhofer (2004) 12

maximal maximal Four key independent set matching problems ( Δ +1)-vertex (2 Δ− 1)-edge coloring coloring • Still wide gaps between upper and lower bounds • Role of randomness poorly understood 13

Early days: summary • Lots of work focused on specific problems • proving upper & lower bounds for problem X • connecting complexity of problem X through reductions to problem Y • Not so much effort in understanding the overall landscape of distributed computational complexity • what are the meaningful classes of problems? • what can we prove about entire classes of problems? • We were lacking general-purpose techniques for studying distributed computing 14

Some highlights of distributed computing in the 2010s 15

From the 2010s: Classification of LCLs 16

LCL problems • Examples of LCL problems (in graphs of max degree Δ = O (1)): • (Δ+1)- coloring , Δ -coloring , 3 -coloring … • maximal independent set , maximal matching … • sinkless orientation • orient all edges • all nodes of degree ≥ 3 have outdegree ≥ 1 Can we say • locally optimal cut something • label nodes black/white about all • at least half of the neighbors have opposite color • SAT (when interpreted as a graph problem) of these? • many other constraint satisfaction problems 17

Landscape of n LCL problems log n Randomized log log n time complexity log ∗ n Deterministic time complexity log log ∗ n 1 log log ∗ n log ∗ n 1 log log n log n n 18

Landscape of randomized n LCL problems log n log log n Θ (log n) deterministic log ∗ n Θ (log log n) log log ∗ n randomized 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 19

Landscape of randomized n LCL problems Trivial log n log log n log ∗ n log log ∗ n Trivial 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 20

Landscape of randomized n LCL problems log n log log n Maximal independent set log ∗ n Cole & Vishkin 1986 Linial 1987, 1992 log log ∗ n Naor 1991 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 21

randomized State of the n art 1992 log n log log n log ∗ n log log ∗ n 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 22

randomized State of the n art 2015 log n log log n log ∗ n ??? log log ∗ n 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 23

randomized State of the n Balliu et al. 2018b Chang & Pettie 2017 art 2019 Balliu et al. 2018a Ghaffari et al. 2018 log n Balliu et al. 2019 Chang & Pettie 2017 Fischer & Ghaffari 2017 Cole & Vishkin 1986 log log n Linial 1992 Rozhon & Ghaffari 2019 Naor 1991 Brandt et al. 2016 log ∗ n Chang et al. 2016 Balliu et al. 2018a Ghaffari & Su 2017 Chang et al. 2016 log log ∗ n Chang & Pettie 2017 Naor & Stockmeyer 1995 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 24

randomized State of the n art 2019 log n log log n log ∗ n log log ∗ n 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 25

Four classes of randomized n graph problems log n log log n log ∗ n log log ∗ n 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 26

Gaps randomized n log n log log n log ∗ n log log ∗ n 1 deterministic log log ∗ n log ∗ n 1 log log n log n n 27

Gaps have direct algorithmic implications If you can solve an LCL problem • in o (log n ) rounds with a deterministic algorithm or • in o (log log n ) rounds with a randomized algorithm then you can also solve it • in O (log* n ) rounds with a deterministic algorithms 28

Gaps have direct complexity-theoretic implications If you can show that there is no O (log* n ) -time deterministic algorithm then: • deterministic complexity is at least Ω(log n ) • randomized complexity is at least Ω(log log n ) 29

From the 2010s: Complexity of maximal independent set & maximal matching 30

2 of 4 key problems maximal maximal independent set matching well understood ( Δ +1)-vertex (2 Δ− 1)-edge coloring coloring • Maximal independent set & matching: • deterministic O (Δ + log* n ) • deterministic poly(log n ) • randomized O (log Δ) + poly(log log n ) • cannot improve any of these much • Upper bound: Rozhon & Ghaffari (2019) + many others • a new algorithm for deterministic network decomposition • Lower bound: Balliu et al. (2019) • based on the “round elimination” technique 31

From the 2010s: Round elimination technique 32

Round elimination technique • Given: • algorithm A 0 solves problem P 0 in T rounds • We construct: • algorithm A 1 solves problem P 1 in T − 1 rounds • algorithm A 2 solves problem P 2 in T − 2 rounds • algorithm A 3 solves problem P 3 in T − 3 rounds … • algorithm A T solves problem P T in 0 rounds • But P T is nontrivial, so A 0 cannot exist 33

Foundations of Distributed Computing in the 2020s Jukka Suomela - PowerPoint PPT Presentation

Foundations of Distributed Computing in the 2020s Jukka Suomela What are the theoretical foundations of the modern society? Modern world large-scale communication networks Physical side: practice: computers, network equipment,

Foundations of Distributed Computing in the 2020s Jukka Suomela Aalto University, Finland What

recap to this point foundations foundations foundations foundations genetics =

Boosting: Foundations and Algorithms Boosting: Foundations and Algorithms Boosting: Foundations

Anticipating the European Supercomputing Infrastructure of the Early 2020s Thomas C. Schulthess

The Fourth Carbon Budget Reducing emissions through the 2020s Committee on Climate Change,

Hierarchical Probabilistic Inference of Cosmic Shear Astronomy in the 2020s: Synergies with WFIRST

On safety in distributed computing Srivatsan Ravi On safety in distributed computing Safety in

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Unleashing Talent in A Distributed Workforce C O R E N E T 2 0 2 0 HACKATHON: DISTRIBUTED W O R K

BUILDING THE FOUNDATIONS OF A WORLD BUILDING THE FOUNDATIONS OF A WORLD CLASS BUILDING THE

For personal use only BUILDING THE FOUNDATIONS OF A WORLD BUILDING THE FOUNDATIONS OF A WORLD

For personal use only BUILDING THE FOUNDATIONS OF A WORLD BUILDING THE FOUNDATIONS OF A WORLD

Outline Foundations of Data and Knowledge Systems EPCL Basic Training Camp 2012 3. Foundations

DISTRIBUTED SYSTEMS Department of Computing Science Umea University Distributed Systems - D N

Collaborative Human Computing Zack Zhu March 31, 2010 Seminar for Distributed Computing 1

Distributed Databases Distributed database management system A distributed database (DDB) is

Haldin Company Presentation Haldin, Introduction Haldin is an Indonesian based manufacturing

Ebook Free Library Portfolio Presentation For Fashion Designers Designer's portfolios are their

On Equitable Coloring q g for Strong Product of T Two cycles l Advisor : Yung-Ling Lai St d

The Hebrew-Christian Messiah; Or, the Presentation of the Messiah to the The Hebrew-Christian

Upcoming changes to DEAP methodology Update from: Fiona Smith (Marketing and Communications

Bu Bubb bble ba bath ths and and be better da data Getting a better understanding of

campus site plan completjon of 2008 bond buildout SE GRANT ST. SE 80TH AVE. MT. TABOR HALL SE

STEPS TO ACADEMIC SUCCESS STEPS TO ACADEMIC SUCCESS To create a To awarene ness a and nd und