A Model For Mixed Linear-Tropical Matrix Factorization James Hook, - PowerPoint PPT Presentation

A Model For Mixed Linear-Tropical Matrix Factorization James Hook, Sanjar Karaev, Pauli Miettinen University of Birmingham: 18th June 2018

Low-Rank Approximate Factorization Given a matrix A ∈ R n × m , an approximate factorization of rank k is a pair B ∈ R n × k and C ∈ R k × m , such that A ≈ BC . Such approximate factorizations are used throughout applied mathematics in... Compression Visualization/interpretation Matrix completion/prediction Huge number of variations Constrains on factor matrices e.g. orthogonal, triangular, non-negative... Measure of closeness e.g. Frobenius norm, KL divergence... What about the matrix-matrix product itself?

Tropical Semirings Tropical algebra concerns any semiring whose ‘addition’ operation is max or min. E.g. the min-plus semiring R min + = [ R ∪ {∞} , ⊕ , ⊗ ], where a ⊕ b = min { a , b } , a ⊗ b = a + b , ∀ a , b ∈ R min + . Min-plus matrix multiplication is defined in analogy to the classical case. For A ∈ R n × m min + and B ∈ R m × d min + we have A ⊗ B ∈ R n × d min + , with m m � ( A ⊗ B ) ij = a ik ⊗ b kj = k =1 ( a ik + b kj ) . min k =1 For example       0 2 3 0 2 3 0 2 2  ⊗  =  . ∞ 0 0 ∞ 0 0 0 0 0    0 1 0 0 1 0 0 1 0

Paths through graphs viewpoint       0 2 3 0 2 3 0 2 2  =  ⊗ 0 0 0 0 0 0 0 ∞ ∞     0 1 0 0 1 0 0 1 0 0 v (1) 3 2 0 0 v (2) v (3) 0 0 1 For A ∈ R n × n min + , precedence graph Γ( A ). Proposition � A ⊗ ℓ � ij = the weight of the minimally weighted path of length ℓ , through Γ( A ) , from v ( i ) to v ( j ) .

Paths through graphs viewpoint � 1     1 1 2 1 1 0 1 �  ⊗ 0 1 = · 0 1    1 1 0 1 0 · · 0 u (1) u (2) v (1) v (2) v (3) For A ∈ R n × d min + , precedence bipartite graph B ( A ). Proposition � A ⊗ A T � ij = the weight of the minimally weighted path (of length 2) through B ( A ) from v ( i ) to v ( j ) .

Min-Plus Low-Rank Matrix Approximation Min-plus low-rank matrix approximation For M ∈ R n × m min + and 0 < k ≤ min { n , m } , we seek � M − A ⊗ B � 2 min F . A ∈ R n × k min + , B ∈ R k × m min + Network interpretation Given a network with shortest path distances M build a new network with k ‘transport hub’ vertices whose shortest path distances approximate M . Geometrical interpretation Given m points m 1 , . . . , m m ∈ R n max find a k -dimensional min-plus linear space C to minimize m � dist( m i − C ) 2 . i =1

Min-Plus Low-Rank Matrix Approximation J. Hook. Min-plus algebraic low rank matrix approximation: a new method for revealing structure in networks . arXiv:1708.06552. J. Hook. Linear regression over the max-plus semiring: algorithms and applications. Figure: Original image taken from arXiv:1712.03499. Network Rail

Column space geometry viewpoint � 0 . 5     0 0 0 0 0 0 � 4 4 . 5 ∞  ≈  ⊗ 0 4 5 8 0 . 5 8 . 5   0 0 − 0 . 25 − 0 . 67 0 3 2 1 − 1 . 5 2 . 5   0 0 − 0 . 25 − 0 . 67 = 1 4 . 5 5 7 . 83   − 1 2 . 5 2 . 25 1 . 83 x 3   0 8 . 5   2 . 5 x 2   0 0 . 5   − 1 . 5

Max-Times Semiring The max-times semiring R max × = [ R + , ⊞ , ⊠ ], where a ⊞ b = max { a , b } , a ⊠ b = a × b , ∀ a , b ∈ R max × . Max-times matrix multiplication is defined in analogy to the classical case. For A ∈ R n × m max × and B ∈ R m × d max × we have A ⊠ B ∈ R n × d max × , with m m ⊞ ( A ⊠ B ) ij = a ik ⊠ b kj = max k =1 ( a ik b kj ) . k =1 For example       0 100 100 0 100 100 100 1000 100  =  ⊠  . 0 1 1 0 1 1 1 10 1    1 10 1 1 10 1 1 100 100

Max-Times Low-Rank Approximation Max-Times Low-Rank Approximation Given an input matrix A ∈ R max × and an integer k > 0, find B ∈ R max × R n × k , C ∈ R max × R k × m , such that + + � A − B ⊠ C � F is minimized. S. Karaev and P. Miettinen. Capricorn: An Algorithm for Subtropical Matrix Factorization . SIAM International Conference on Data Mining 2016. S. Karaev and P. Miettinen. Cancer: Another Algorithm for Subtropical Matrix Factorization. ECML PKDD 2016.

Factorization Models Figure: Image taken from blog2.sigopt.com 1 SVD: Sum of parts of different signs. Optimal with ‘classical’ product. 2 NMF: Sum of non-negative parts. Interpretable factors ‘parts of a whole’. 3 Max-times: Maximum of non-negative parts. Interpretable factors ‘winner takes all’ 4 Mixed Tropical-Linear Model: Some entries determined by NMF some entries determined by Max-times.

The Mixed Tropical-Linear Model Given an input matrix A ∈ R n × m , we seek factor matrices + B ∈ R n × k and C ∈ R k × m and parameters α ∈ R n × m , such that + + A ij ≈ α ij ( B ⊠ C ) + (1 − α ij )( BC ) ij . α ij ≈ 1 ⇔ A ij determined by tropical product α ij ≈ 0 ⇔ A ij determined by linear product We enforce α ij = σ ( θ i + φ j ) , where θ ∈ R n and φ ∈ R m are vectors to be determined and σ is the logistic sigmoid 1 σ ( x ) = 1 + exp( − x ) .

The Mixed Tropical-Linear Model , θ ∈ R n and φ ∈ R m define the mixed For B ∈ R n × k , C ∈ R k × m + + tropical-linear product ( B ⊠ θ,φ C ) ij = α ij ( B ⊠ C ) + (1 − α ij )( BC ) ij , where α ij = σ ( θ i + φ j ). Mixed Tropical-Linear Low-Rank Approximation Given an input matrix A ∈ R n × m and an integer k > 0, find + , θ ∈ R n and φ ∈ R m such that B ∈ R n × k , C ∈ R k × m + + � A − B ⊠ θ,φ C � F is minimized.

Our Algorithm

Examples Table: Reconstruction error for real-world datasets. Climate NPAS Face 4NEWS HPI k = 10 10 40 20 15 Latitude 0.023 0.207 0.157 0.536 0.016 SVD 0.025 0.209 0.140 0.533 0.015 NMF 0.080 0.223 0.302 0.541 0.124 Cancer 0.066 0.237 0.205 0.554 0.026

Examples

Conclusion ’Classical’ low-rank approximate factorizations used throughout applied maths. Tropical low-rank approximate factorizations including min-plus and max-times provide a completely different model but with analogous algebraic structure. We introduced a novel model that interpolates between NNMF and max-times. Able to outperform SVD on some real life data sets. What is the structure being detected? S. Karaev, J. Hook and P. Miettinen. Latitude: A Model for Mixed Linear-Tropical Matrix Factorization . SIAM International Conference on Data Mining 2018.

A Model For Mixed Linear-Tropical Matrix Factorization James Hook, - PowerPoint PPT Presentation

A Model For Mixed Linear-Tropical Matrix Factorization James Hook, Sanjar Karaev, Pauli Miettinen University of Birmingham: 18th June 2018 Low-Rank Approximate Factorization Given a matrix A R n m , an approximate factorization of rank k

Tropical Cyclones Tropical Cyclones Part 10 Part 10 W hat is a Tropical Cyclone? A tropical

L101: Matrix Factorization In a nutshell Matrix factorization/completion you know? In NLP?

Online-Updating Regularized Kernel Matrix Factorization Models for Large-Scale Recommender

Tensor Factorization via Matrix Factorization Volodymyr Kuleshov Arun Tejasvi Chaganty Percy

The MJO in Tropical Total Ozone The MJO in Tropical Total Ozone The MJO in Tropical Total Ozone

Overview of this module Course 02429 Design Matrix 1 Analysis of correlated data: Mixed Linear

Outline Statistical inference for linear mixed models general form of linear mixed models

Singular Value Decomposition (matrix factorization) Singular Value Decomposition The SVD is a

Mixing it up with random effects Joshua Loftus Mixed models Intro to mixed models What is a

Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in Selective Mixed Oxides in

Mixed Precision Training PAI Overview What is mixed-precision

Tropical Connected Sets problem Eugene Vagin 07.04.2020 Eugene Vagin Tropical Connected Sets

Gov 2000: 10. Multiple Regression in Matrix Form Matthew Blackwell Fall 2016 1 / 64 1. Matrix

Lifting Tropical Curves and Linear Systems on Graphs Eric Katz (University of Waterloo) September

Matrix Factorization with Binary Components Uniqueness in a randomized model Felix Krahmer,

Matrix Factorization and Factorization Machines for Recommender Systems Chih-Jen Lin Department

Applying Learning Technologies to Foreign Language Teaching Attilio Galimberti Local Education

The road not taken Robert Frost Two roads diverged in a yellow wood, And sorry I could not

DMIP DMIP team DMIP DMIP team team team Data Mining and Inductive Data Mining and Inductive

HIGH SPEED UK ..connecting the nation Colin Elliff BSc CEng MICE Civil Engineering

A practical example: The adoption of a Travel Plan Strategy at a Local Major Employer. Andrew

A14 Cambridge to Huntingdon improvement scheme Members Briefing Monday, 18 July 2016 Agenda

The East of England Transport Economic Evidence Study (TEES) A presentation by The East of

Scotland Route Study Working Group 9a 4-Sep-17 / 1 Scotland Route Study Agenda 1. Welcome