Approximating Orthogonal Matrices with Effective Givens - PowerPoint PPT Presentation

Approximating Orthogonal Matrices with Effective Givens Factorization Thomas Frerix Technical University of Munich joint work with Joan Bruna (NYU) Poster #164

Givens Factorization of Orthogonal Matrices   1 ··· 0 0 ··· 0 ··· . . . . . ... . . . . . . .    0 ··· cos( α ) ··· − sin( α ) ··· 0    . . . . G T ( i , j , α ) = ...  . . . .  . . . .   0 ··· sin( α ) ··· cos( α ) ··· 0     . . . ... . . . . .   . . . . 0 ··· 0 ··· 0 ··· 1

Givens Factorization of Orthogonal Matrices   1 ··· 0 0 ··· 0 ··· . . . . . ... . . . . . . .    0 ··· cos( α ) ··· − sin( α ) ··· 0    . . . . G T ( i , j , α ) = ...  . . . .  . . . .   0 ··· sin( α ) ··· cos( α ) ··· 0     . . . ... . . . . .   . . . . 0 ··· 0 ··· 0 ··· 1 Exact Givens Factorization N = d ( d − 1) U = G 1 . . . G N 2

Approximate Givens Factorization Approximate Givens Factorization N ≪ d ( d − 1) U ≈ G 1 . . . G N 2 computationally hard problem

Approximate Givens Factorization Approximate Givens Factorization N ≪ d ( d − 1) U ≈ G 1 . . . G N 2 computationally hard problem Our Questions in this Context 1. Which orthogonal matrices can be effectively approximated? (not all of them)

Approximate Givens Factorization Approximate Givens Factorization N ≪ d ( d − 1) U ≈ G 1 . . . G N 2 computationally hard problem Our Questions in this Context 1. Which orthogonal matrices can be effectively approximated? (not all of them) 2. Which principles are behind effective approximation algorithms? (sparsity-inducing algorithms)

Motivation: Unitary Basis Transform / FFT Advantageous Setting Once computed, applied many times

Motivation: Unitary Basis Transform / FFT Advantageous Setting Once computed, applied many times Unitary Basis Transform d 2 � � � � FFT: O → O d log( d )

Motivation: Unitary Basis Transform / FFT Advantageous Setting Once computed, applied many times Unitary Basis Transform d 2 � � � � FFT: O → O d log( d ) Application: Graph Fourier Transform

Which Matrices can be Effectively Approximated? Theorem � d 2 / log( d ) � Let ǫ > 0 . If N = o , then as d → ∞ ,  � � � � �  → 0 , µ U ∈ U ( d ) inf � U − G n � 2 ≤ ǫ  � G 1 ... G N � n where µ is the Haar measure over U ( d ) .

Which Matrices can be Effectively Approximated? Theorem � d 2 / log( d ) � Let ǫ > 0 . If N = o , then as d → ∞ ,  � � � � �  → 0 , µ U ∈ U ( d ) inf � U − G n � 2 ≤ ǫ  � G 1 ... G N � n where µ is the Haar measure over U ( d ) . • proof is based on an ǫ -covering argument • suggests computational-to-statistical gap together with experimental results (details at poster)

K -planted Distribution over SO ( d ) Sample U = G 1 . . . G K • choose subspace ( i k , j k ) uniformly with replacement • choose rotation angle α k ∈ [0 , 2 π ) uniformly

K -planted Distribution over SO ( d ) Sample U = G 1 . . . G K • choose subspace ( i k , j k ) uniformly with replacement • choose rotation angle α k ∈ [0 , 2 π ) uniformly 1 0 . 8 0 . 6 || U || 0 / d 2 K -planted matrices 0 . 4 quickly become dense 0 . 2 0 0 0 . 2 0 . 4 0 . 6 0 . 8 1 K / d log 2 ( d ) 256 512 1024

Minimizing Sparsity-Inducing Norms over O ( d ) ˆ G T N . . . G T N U ≈ I U = G 1 . . . G N

Minimizing Sparsity-Inducing Norms over O ( d ) ˆ G T N . . . G T N U ≈ I U = G 1 . . . G N Approximation criterion � � � � � U − ˆ � U − ˆ U F , sym := min UP � � � � � � P ∈P d F

Minimizing Sparsity-Inducing Norms over O ( d ) ˆ G T N . . . G T N U ≈ I U = G 1 . . . G N Approximation criterion � � � � � U − ˆ � U − ˆ U F , sym := min UP � � � � � � P ∈P d F Better functions to be minimized greedily? d f ( U ) := d − 1 � U � 1 = d − 1 � � � � U ij � i , j =1

Minimizing Sparsity-Inducing Norms over O ( d ) ˆ G T N . . . G T N U ≈ I U = G 1 . . . G N Approximation criterion � � � � � U − ˆ � U − ˆ U F , sym := min UP � � � � � � P ∈P d F Better functions to be minimized greedily? d f ( U ) := d − 1 � U � 1 = d − 1 � � � � U ij � i , j =1 • Non-convex greedy step • global optimum in O ( d 2 ) amortized time complexity

Thank you Poster #164 https://github.com/tfrerix/givens-factorization

Approximating Orthogonal Matrices with Effective Givens - PowerPoint PPT Presentation

Approximating Orthogonal Matrices with Effective Givens Factorization Thomas Frerix Technical University of Munich joint work with Joan Bruna (NYU) Poster #164 Givens Factorization of Orthogonal Matrices 1 0 0 0

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

Orthogonal Complements and Orthonormal Matrices Orthogonal Complements Defn. For a set W , the

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Orthogonal range searching Orthogonal range searching Problem: Given a set of n points Orthogonal

Random Orthogonal Polynomials: From matrices to point processes Diane Holcomb, KTH Integrability

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)

Latin Squares and Orthogonal Arrays Lucia Moura School of Electrical Engineering and Computer

Cheap Orthogonal Constraints in Neural Networks: A Simple Parametrization of the Orthogonal and

Classification of self-orthogonal F q + u F q -codes Classification of self-orthogonal F q + u F q

Designs of Orthogonal Filter Banks and Orthogonal Cosine-Modulated Filter Banks Jie Yan

Codes from orbit matrices of weakly q -self-orthogonal 1-designs 2 / 19 On some self-orthogonal

Planar orthogonal polynomials and related determinantal processes: random normal matrices and

Asymptotic Analysis of Random Matrices and Orthogonal Polynomials Arno Kuijlaars University of

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Transformations and Matrices Transformations I Transformations are functions Matrices

Structural Matrices in MDOF Systems Evaluation of Structural Matrices Choice of Property

for BlueGene/P Franz Franchetti 1 , Yevgen Voronenko 2 , Gheorghe Almasi 3 1 Carnegie Mellon

3rd Grade Shapes and Perimeter 2015-11-10 www.njctl.org Slide 3 / 102 Slide 4 / 102 Table of

Synchronising C/C++ and POWER Susmit Sarkar 1 Kayvan Memarian 1 Scott Owens 1 Mark Batty 1 Peter

Warm up Sketch the graph of f ( x ) = ( x 3)( x 2)( x 1) = x 3 6 x 2 + 11 x 6

Exascale-ability Today N=4096 3 12.3 10 12 Flops 1.1 TB of Data 3D FFT Exascale-ability

The tangent FFT D. J. Bernstein University of Illinois at Chicago See online version of paper,

Automatic physical inference with information maximising neural networks Physical Review D 97 ,

Model dependences, uncertain1es, and combined analysis Intro

Approximating Orthogonal Matrices with Effective Givens - PowerPoint PPT Presentation

Approximating Orthogonal Matrices with Effective Givens Factorization Thomas Frerix Technical University of Munich joint work with Joan Bruna (NYU) Poster #164 Givens Factorization of Orthogonal Matrices 1 0 0 0

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

Orthogonal Complements and Orthonormal Matrices Orthogonal Complements Defn. For a set W , the

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Orthogonal range searching Orthogonal range searching Problem: Given a set of n points Orthogonal

Random Orthogonal Polynomials: From matrices to point processes Diane Holcomb, KTH Integrability

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices &amp; quadratic forms)

Latin Squares and Orthogonal Arrays Lucia Moura School of Electrical Engineering and Computer

Cheap Orthogonal Constraints in Neural Networks: A Simple Parametrization of the Orthogonal and

Classification of self-orthogonal F q + u F q -codes Classification of self-orthogonal F q + u F q

Designs of Orthogonal Filter Banks and Orthogonal Cosine-Modulated Filter Banks Jie Yan

Codes from orbit matrices of weakly q -self-orthogonal 1-designs 2 / 19 On some self-orthogonal

Planar orthogonal polynomials and related determinantal processes: random normal matrices and

Asymptotic Analysis of Random Matrices and Orthogonal Polynomials Arno Kuijlaars University of

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Transformations and Matrices Transformations I Transformations are functions Matrices

Structural Matrices in MDOF Systems Evaluation of Structural Matrices Choice of Property

for BlueGene/P Franz Franchetti 1 , Yevgen Voronenko 2 , Gheorghe Almasi 3 1 Carnegie Mellon

3rd Grade Shapes and Perimeter 2015-11-10 www.njctl.org Slide 3 / 102 Slide 4 / 102 Table of

Synchronising C/C++ and POWER Susmit Sarkar 1 Kayvan Memarian 1 Scott Owens 1 Mark Batty 1 Peter

Warm up Sketch the graph of f ( x ) = ( x 3)( x 2)( x 1) = x 3 6 x 2 + 11 x 6

Exascale-ability Today N=4096 3 12.3 10 12 Flops 1.1 TB of Data 3D FFT Exascale-ability

The tangent FFT D. J. Bernstein University of Illinois at Chicago See online version of paper,

Automatic physical inference with information maximising neural networks Physical Review D 97 ,

Model dependences, uncertain1es, and combined analysis Intro

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)