A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia - PowerPoint PPT Presentation

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia Vassilevska Williams Stanford University

THE CENTRAL QUESTION OF ALGORITHMS RESEARCH ``How fast can we solve fundamental problems, in the worst case?’’ etc.

ALGORITHMIC TECHNIQUES Divide and Conquer Kernelization Dynamic … Programming Color-coding Inclusion- exclusion Greedy Iterative approaches Backtracking compression Sum of Squares Linear Semidefinite Specialized programming Programming data structures

HARD PROBLEMS For many problems, the known techniques get stuck: � • Very important computational problems from diverse areas • They have simple, often brute-force, classical algorithms • No improvements in many decades!

Example 1 A CANONICAL HARD PROBLEM k-SAT Input : variables x 1 , … ,x n and a formula F = C 1 ∧ C 2 ∧ … ∧ C m so that each C i is of the form {y 1 ∨ y 2 ∨ … ∨ y k } and ∀ i, y i is either x t or ¬ x t for some t. � Output : A boolean assignment to {x 1 ,…,x n } that satisfies all the clauses, or NO if the formula is not satisfiable � Brute-force algorithm: try all 2 n assignments Goes to 2 n Best known algorithm: O(2 n-(cn/k) n d ) time for const c,d as k grows.

??? Example 2 ANOTHER HARD PROBLEM:   LONGEST COMMON SUBSEQUENCE (LCS) Given two strings on n letters � Algorithms: ATCGGGTTCCTTAAGGG AT C GG G T T CCTT A A G GG � Classical O(n 2 ) time ATTGGTACCTTCAGG AT T GG _ T A CCTT C A _ GG � � Best algorithm: Find a subsequence of both strings of maximum length. O(n 2 / log 2 n) time [MP’80] � Applications both in computational biology and in spellcheckers. Solved daily on huge strings! (Human genome: 3 x 10 9 base pairs.)

THE REAL WORLD AND NP-HARD PROBLEMS LIKE K-SAT I’ve got data. I want I’m sorry, this problem to solve this algorithmic problem is NP-hard. A fast but I’m stuck! algorithm for it would resolve a hard problem in CS/math. Ok, thanks, I feel better that none of my attempts worked. I’ll use some heuristics.

THE REAL WORLD AND EASIER PROBLEMS LIKE LCS I’ve got data. I want Great news! Your to solve this problem is in P. algorithmic problem but I’m stuck! Here’s an O(n 2 ) time algorithm! But my data size n is huge! Don’t you have a faster algorithm? Uhm, I don’t know… This is already theoretically fast… For some reason I can’t ?!? … Should I wait? come up with a faster … Or should I be algorithm for it right now… satisfied with heuristics?

IN THEORETICAL CS,   POLYNOMIAL TIME = EFFICIENT/EASY.   � This is for a variety of reasons. E.g. composing two efficient algorithms results in an efficient algorithm. Also, model-independence. � However, noone would consider an O(n 100 ) time algorithm efficient in practice. If n is huge, then O(n 2 ) can also be inefficient.

WE ARE STUCK ON MANY PROBLEMS, EVEN JUST IN O(N 2 ) TIME No N 2 - ε time algorithms known for: � � Many string matching problems: Edit distance, Sequence local alignment, LCS, jumbled indexing … � General form : given two sequences of length n, how similar are they? All variants can be solved in O(n 2 ) time by dynamic programming. � ATCGGGTTCCTTAAGGG � ATTGGTACCTTCAGG

WE ARE STUCK ON MANY PROBLEMS, EVEN JUST IN O(N 2 ) TIME No N 2 - ε time algorithms known for: � � Many string matching problems � Many problems in computational geometry : e.g Given n points in the plane, are any three colinear? A very important primitive! �

WE ARE STUCK ON MANY PROBLEMS, EVEN JUST IN O(N 2 ) TIME No N 2 - ε time algorithms known for: � � Many string matching problems � Many problems in computational geometry � Many graph problems in sparse graphs: e.g. � Given an n node, O(n) edge graph, what is its diameter? Fundamental problem. Even approximation algorithms seem hard! � �

WE ARE STUCK ON MANY PROBLEMS, EVEN JUST IN O(N 2 ) TIME No N 2 - ε time algorithms known for: � � Many string matching problems � Many problems in computational geometry � Many graph problems in sparse graphs � Many other problems … � Why are we stuck? � � Are we stuck because of the same reason?

PLAN • Traditional hardness in complexity � • A fine-grained approach � • New Developments

COMPUTATIONAL COMPLEXITY QBF with k alternations in 2 n- Ω (k) time PSPACE QBF gets easier as the # of quantifiers increases... NP P Best SAT alg: 2 n Logspace AC0 Even O(n 2 ) time is inefficient Contains the “1000-clique” This traditional complexity class problem; best runtime: Ω (n 790 ) approach says little about runtime!

NP P It also does not apply to WHY IS K-SAT HARD? problems in P! Unless N – size P=NP of input NP-completeness addresses Theorem [Cook, Karp’72]: runtime, but it is too coarse- k-SAT is NP-complete for all k ≥ 3. grained! � That is, if there is an algorithm that solves k-SAT instances on n variables in poly(n) time, then all problems in NP have poly(N) time solutions, and so P=NP. � k-SAT (and all other NP-complete problems) are considered hard because fast algorithms for them imply fast algorithms for many important problems .

TIME HIERARCHY THEOREMS For most natural computational models one can prove: for any constant c, there exist problems solvable in O(n c ) time but not in O(n c- ε ) time for any ε > 0. � It is completely unclear how to show that a particular problem in O(n c ) time is not in O(n c- ε ) time for any ε > 0. Unconditional lower bounds seem hard. � In fact, it is not even known if SAT is in linear time! � We instead develop a fine-grained theory of hardness that is conditional and mimics NP-completeness.

PLAN • Traditional hardness in complexity � • A fine-grained approach � • Fine-grained reductions lead to new algorithms

FINE-GRAINED HARDNESS   IDEA Idea: Mimic NP-completeness 1. Identify key hard problems � 2. Reduce these to all (?) other problems believed hard � 3. Hopefully form equivalence classes � Goal: understand the landscape of algorithmic problems

CNF SAT IS CONJECTURED TO BE REALLY HARD � Two popular conjectures about SAT on n variables [IPZ01]: ETH: 3-SAT requires 2 δ n time for some constant δ > 0. � SETH: for every ε > 0, there is a k such that k-SAT on n variables, m clauses cannot be solved in 2 (1- ε )n poly m time. � So we can use k-SAT as our hard problem and ETH or SETH as the conjecture we base hardness on.

Recent research [CGIMPS’16] suggests these problems are not equivalent ! Conjecture : Orthog. [W’04]: SETH implies Vecs. requires n 2-o(1) this conjecture! time. Given a set S of n vectors in Easy O( n 2 d) time alg {0,1} d , for d = ω (log n) are Best known [AWY’15]: n 2 - Θ (1 / log (d/log n)) there u, v 2 S with u ¢ v = 0 ? Orthogonal vectors Conjecture : APSP requires n 3-o(1) time. More key problems to All pairs shortest paths : blame Given a set S of n integers, given an n-node weighted are there a, b, c 2 S with 3SUM APSP a + b + c = 0 ? graph, find the distance between every two nodes. Easy O(n 2 ) time alg [BDP’05]: ~ n 2 / log 2 n time for integers Classical algs: O( n 3 ) time Conjecture : 3SUM [GP’14] : ~ n 2 / log n time for reals [W’14]: n 3 / exp( √ log n) time requires n 2-o(1) time.

WORK ON APSP Author Runtime Year Floyd, Warshall n 3 1962 Fredman n 3 1976 Classical problem Takaoka n 3 1992 Long history Dobosiewicz n 3 1992 Han n 3 2004 Takaoka n 3 2004 Zwick n 3 2004 Chan n 3 2005 Han n 3 2006 Chan n 3 2007 Han, Takaoka n 3 2012 Williams n 3 2014

FINE-GRAINED HARDNESS Idea: Mimic NP-completeness 1. Identify key hard problems � 2. Reduce these to all (?) other hard problems � 3. Hopefully form equivalence classes � Goal: understand the landscape of algorithmic problems

Intuition : a(n),b(n) are the naive FINE-GRAINED REDUCTIONS runtimes for A and B. A reducible to B implies that beating the naive runtime for B implies also beating the naive runtime for A. • A is (a,b)-reducible to B if for every ε >0 ∃ δ > 0, and an O(a(n) 1- δ ) time algorithm that adaptively transforms any A-instance of size n to B-instances of size n 1 ,…,n k so that Σ i b(n i ) 1- ε < a(n) 1- δ . n A ● If B is in O(b(n) 1- ε ) time, then A is in O(a(n) 1- δ ) time. a(n) 1- δ ● Focus on exponents. ● We can build equivalences. B B B B n 1 , n 2 , …, n k Next: an example

AN EXAMPLE FINE-GRAINED EQUIVALENCE THEOREM [VW’10]: Boolean matrix multiplication (BMM) is equivalent to Triangle detection under subcubic fine-grained reductions. � BMM: Given two n x n Boolean matrices X and Y, return an n x n matrix Z where for all i and j, Z[i , j] = OR k (X[i , k] AND Y[k , j]). � Triangle detection: Given an n node graph G, does it contain three vertices a, b, c, such that (a, b), (b, c), (c, a) are all edges? a We will show that (1) an O(n 3-e ) time alg for BMM can give an O(n 3-e ) time triangle alg, and c b (2) an O(n 3-e ) time alg for triangle can give an O(n 3-e/3 ) time BMM alg.

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia - PowerPoint PPT Presentation

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia Vassilevska Williams Stanford University THE CENTRAL QUESTION OF ALGORITHMS RESEARCH ``How fast can we solve fundamental problems, in the worst case? etc. ALGORITHMIC

Fine Grained Access Control Fine-Grained Access Control Fine Grained Access Control

Fine-Grained Access Control Fine Grained Access Control Fine-grained access control examples:

Fine-Grained Geographic Communication (Geocast) Nexus Workshop Frank Drr 23.07.2003 1

Average-Case Fine-Grained Hardness Marshall Ball Alon Rosen Manuel Sabin Prashant Nalini

Fine-grained Visual Analysis: From Classification to Retrieval Yi-Zhe Song SketchX Lab, CVSSP,

Mechanized Verification of Fine-grained Concurrent Programs Ilya Sergey Aleks Nanevski

Phase Transition in 3SAT Yi Zhou Phase Transition in 3SAT Phase Transition in 3SAT Fine Grained

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained

Combining Data-Intense and Compute-Intense Methods for Fine-Grained Morphological Analyses Petra

Fine-Grained Power Modeling for Smartphones Using System Call Tracing Based on paper and

Fine-grained Image Recognition Lei Wang VILA group School of Computing and Information

Enhancing Fine- Grained Parallelism Loop vectorization, Loop distribution, Scalar expansion

Fine-Grained Tracking of Grid Infections Ashish Gehani SRI Basim Baig, Salman Mahmood, Dawood

Addressing Inter-Class Similarity in Fine-Grained Visual Classification Abhimanyu Dubey

On the Correctness Criteria of Fine-Grained Access Control in Relational Databases Qihua Wang,

Fine Grained Coordinated Parallelism in a Real World Application Mohammad Rezaei, PhD June 2012

Lecture 04 (11.11.2013) Hazard Analysis Techniques Christoph Lth Christian Liguda SQS, WS

Decision aid methodologies in transportation Lecture 6 (part-2): Container terminal management

Automating Second Language Acquisition Research: Integrating Information Visualisation and

Towards a theory of multivariate generating functions Mark C. Wilson University of Auckland

I n s o l i d a r i t y w i t h t h e p e o p l e o f F e r g u s

www.mpc.berkeley.edu Borrelli (UC Berkeley) Iterative Learning MPC 2018 CDC Slide 1

Driver Kinase Fusions in Cancer TCGA 4 th Annual Scientific Symposium May 12 th , 2015 Nicolas

AIA Continuing Education Hall & Company is a Registered Provider with the American Institute of

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia - PowerPoint PPT Presentation

A FINE-GRAINED APPROACH TO ALGORITHMS AND COMPLEXITY Virginia Vassilevska Williams Stanford University THE CENTRAL QUESTION OF ALGORITHMS RESEARCH ``How fast can we solve fundamental problems, in the worst case? etc. ALGORITHMIC

Fine Grained Access Control Fine-Grained Access Control Fine Grained Access Control

Fine-Grained Access Control Fine Grained Access Control Fine-grained access control examples:

Fine-Grained Geographic Communication (Geocast) Nexus Workshop Frank Drr 23.07.2003 1

Average-Case Fine-Grained Hardness Marshall Ball Alon Rosen Manuel Sabin Prashant Nalini

Fine-grained Visual Analysis: From Classification to Retrieval Yi-Zhe Song SketchX Lab, CVSSP,

Mechanized Verification of Fine-grained Concurrent Programs Ilya Sergey Aleks Nanevski

Phase Transition in 3SAT Yi Zhou Phase Transition in 3SAT Phase Transition in 3SAT Fine Grained

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained

Combining Data-Intense and Compute-Intense Methods for Fine-Grained Morphological Analyses Petra

Fine-Grained Power Modeling for Smartphones Using System Call Tracing Based on paper and

Fine-grained Image Recognition Lei Wang VILA group School of Computing and Information

Enhancing Fine- Grained Parallelism Loop vectorization, Loop distribution, Scalar expansion

Fine-Grained Tracking of Grid Infections Ashish Gehani SRI Basim Baig, Salman Mahmood, Dawood

Addressing Inter-Class Similarity in Fine-Grained Visual Classification Abhimanyu Dubey

On the Correctness Criteria of Fine-Grained Access Control in Relational Databases Qihua Wang,

Fine Grained Coordinated Parallelism in a Real World Application Mohammad Rezaei, PhD June 2012

Lecture 04 (11.11.2013) Hazard Analysis Techniques Christoph Lth Christian Liguda SQS, WS

Decision aid methodologies in transportation Lecture 6 (part-2): Container terminal management

Automating Second Language Acquisition Research: Integrating Information Visualisation and

Towards a theory of multivariate generating functions Mark C. Wilson University of Auckland

I n s o l i d a r i t y w i t h t h e p e o p l e o f F e r g u s

www.mpc.berkeley.edu Borrelli (UC Berkeley) Iterative Learning MPC 2018 CDC Slide 1

Driver Kinase Fusions in Cancer TCGA 4 th Annual Scientific Symposium May 12 th , 2015 Nicolas

AIA Continuing Education Hall &amp; Company is a Registered Provider with the American Institute of

AIA Continuing Education Hall & Company is a Registered Provider with the American Institute of