Data Mining and Matrices 02 Linear Algebra Refresher Rainer - PowerPoint PPT Presentation

Data Mining and Matrices 02 – Linear Algebra Refresher Rainer Gemulla, Pauli Miettinen April 18, 2013

Vectors A vector is ◮ a 1D array of numbers ◮ a geometric entity with magnitude and direction ◮ a matrix with exactly one row or column ⇒ row and column vectors A transpose a T transposes a row vector into a column vector and vice versa The norm of vector defines its magnitude � 1 / 2 �� n ◮ Euclidean or L 2 : � a � = � a � 2 = i =1 a 2 i � 1 / p �� n i =1 a p ◮ General L p (1 ≤ p ≤ ∞ ): � a � p = i A dot product of two vectors of same dimension is a · b = � n i =1 a i b i ◮ Also known as scalar product or inner product ◮ Alternative notations: � a , b � , a T b (for column vectors), ab T (for row vectors) In Euclidean space we can define a · b = � a �� b � cos θ ◮ θ is the angle between a and b ◮ a · b = 0 if θ = 1 2 π + k π (they are orthogonal ) 2 / 11

Matrix algebra Matrices in R n × n form a ring ◮ Addition, subtraction, and multiplication ◮ Addition and subtraction are element-wise ◮ Multiplication doesn’t always have inverse (division) ◮ Multiplication isn’t commutative ( AB � = BA in general) ◮ The identity for the multiplication is the identity matrix I with 1s on the main diagonal and 0s elsewhere ⋆ I ij = 1 iff i = j ; I ij = 0 iff i � = j If A ∈ R m × k and B ∈ R k × n , then AB ∈ R m × n with ( AB ) ij = � k ℓ =1 a i ℓ b ℓ j ◮ The inner dimension ( k ) of A and B must agree ◮ The dimensions of the product are the outer dimensions of A and B 3 / 11

Intuition for Matrix Multiplication Element ( AB ) ij is the inner product of row i of A and column j of B Row i of AB is the linear combination of rows of B with the coefficients coming from row i of A ◮ Similarly, column j is a linear combination of columns of A Matrix AB is a sum of k matrices a ℓ b T ℓ obtained by multiplying ℓ -th column of A with ℓ -th row of B ◮ This is known as vector outer product B B + + = = C C C A 4 / 11

Matrices as linear mappings A matrix M ∈ R m × n is a linear mapping from R n to R m ◮ If x ∈ R n then y = Mx ∈ R m is the image of x ◮ y i = � n j =1 M ij x j If A ∈ R m × k and B ∈ R k × n , then AB is a mapping from R n to R m ◮ Combination of A and B Square matrix A ∈ R n × n is invertible if there is matrix B ∈ R n × n such that AB = I ◮ Matrix B is the inverse of A , denoted A − 1 ◮ If A is invertible, then AA − 1 = A − 1 A = I ⋆ AA − 1 x = A − 1 Ax = x ◮ Non-square matrices don’t have (general) inverses but can have left or right inverses : AR = I or LA = I The transpose of M ∈ R m × n is a linear mapping M T : R m → R n ◮ ( M T ) ij = M ji ◮ Generally, transpose is not the inverse ( AA T � = I ) 5 / 11

Matrix rank and linear independence A vector u ∈ R n is linearly dependent on set of vectors V = { v i } ⊂ R n if u can be expressed as a linear combination of vectors in V ◮ u = � i a i v i for some a i ∈ R ◮ Set V is linearly dependent if some v i ∈ V is linearly dependent on V \ { v i } ◮ If V is not linearly dependent, it is linearly independent The column rank of matrix M is the number of linearly independent columns of M The row rank of M is the number of linearly independent rows of M The Schein rank of M is the least integer k such that M = AB for some A ∈ R m × k and B ∈ R k × n ◮ Equivalently, the least k such that M is a sum of k vector outer products All these ranks are equivalent! ◮ Matrix has rank 1 iff it is an outer product of two vectors 6 / 11

Matrix norms Matrix norms measure the magnitude of the matrix ◮ Magnitude of the values ◮ Magnitude of the image Operator norms measure how big the image of an unit vector can be ◮ For p ≥ 1, � M � p = max {� Mx � p : � x � p = 1 } The Frobenius norm is the vector- L 2 norm applied to matrices � 1 / 2 �� m � n j =1 M 2 ◮ � M � F = i =1 ij ◮ N.B. � M � F � = � M � 2 (but sometimes Frobenius norm is referred to as L 2 norm) 7 / 11

Matrices as systems of linear equations A matrix can hold the coefficients of a system of linear equations ◮ The original use of matrices (Chinese The Nine Chapters on the Mathematical Art ) a 1 , 1 x 1 + a 1 , 2 x 2 + · · · + a 1 , m x m = b 1       a 1 , 1 a 1 , 2 , · · · a 1 , m x 1 b 1 a 2 , 1 x 1 + a 2 , 2 x 2 + · · · + a 2 , m x m = b 2 a 2 , 1 a 2 , 2 · · · a 2 , m x 2 b 2       ⇔  = . . . . . ... .       . . . . . . . . . . .      . · · · a n , 1 a n , 2 a n , m x m b n a n , 1 x 1 + a n , 2 x 2 + · · · + a n , m x m = b n If the coefficient matrix A is invertible, the system has exact solution x = A − 1 b If m < n the system is underdetermined and can have infinite number of solutions If m > n the system is overdetermined and (usually) does not have an exact solution The least-squares solution is the vector x that minimizes � Ax − b � 2 2 ◮ Linear regression 8 / 11

Special types of matrices The diagonals of matrix M go from top-left to bottom-right ◮ The main diagonal contains the elements M i , i ◮ The k -th upper diagonal contains the elements M i , ( i + k ) ◮ The k -th lower diagonal contains the elements M ( i + k ) , i ) ◮ The anti-diagonals go rom top-right to bottom-left Matrix is diagonal if all its non-zero values are in a diagonal (typically main diagonal) ◮ Bi-diagonal matrices have values in two diagonals, etc. Matrix M is upper (right) triangular if all of its non-zeros are in or above the main diagonal ◮ Lower (left) triangular matrices have all non-zeros in or below main diagonal ◮ Upper left and lower right triangular matrices replace diagonal with anti-diagonal A square matrix P is permutation matrix if each row and each column of P has exactly one 1 and rest are 0s ◮ If P is a permutation matrix, PM is like M but with permuted order of rows 9 / 11

Orthogonal matrices A set V = { v i } ⊂ R n is orthogonal if all vectors in V are mutually orthogonal ◮ v · u = 0 for all v , u ∈ V ◮ If all vectors in V also have unit norm ( � v � 2 = 1), V is orthonormal A square matrix M is orthogonal if its columns are a set of orthonormal vector ◮ Then also rows are orthonormal ◮ If M ∈ R n × m and n > m , M can be column-orthogonal, but its rows cannot be orthogonal If M is orthogonal, M T = M − 1 (i.e. MM T = M T M = I n ) ◮ If M is only column-orthogonal ( n > m ), M T is the left inverse ( M T M = I m ) ◮ If M is row-orthogonal ( n < m ), M T is the right inverse ( MM T = I n ) 10 / 11

Suggested reading Any (elementary) linear algebra text book ◮ For example: Carl Meyer Matrix Analysis and Applied Linear Algebra Society for Industrial and Applied Mathematics, 2000 http://www.matrixanalysis.com Wolfram MathWorld articles Wikipedia articles 11 / 11

Data Mining and Matrices 02 Linear Algebra Refresher Rainer - PowerPoint PPT Presentation

Data Mining and Matrices 02 Linear Algebra Refresher Rainer Gemulla, Pauli Miettinen April 18, 2013 Vectors A vector is a 1D array of numbers a geometric entity with magnitude and direction a matrix with exactly one row or column

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Data Mining and Matrices 01 Introduction Rainer Gemulla, Pauli Miettinen April 18, 2013

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

JUST THE MATHS SLIDES NUMBER 9.9 MATRICES 9 (Modal & spectral matrices) by

Transformations and Matrices Transformations I Transformations are functions Matrices

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Structural Matrices in MDOF Systems Evaluation of Structural Matrices Choice of Property

Structural Matrices in MDOF Systems Structural Matrices Evaluation of Structural Giacomo Boffi

GCDs & linear (ab)c = a (bc), a a = 0, combinations a + 0 = a, a+1 > a, .

1.4 The Matrix Equation A x = b McDonald Fall 2018, MATH 2210Q 1.4 &1.5 Slides 1.4 Homework :

Matrix Calculations: Solutions of Systems of Linear Equations A. Kissinger Institute for

Math 211 Math 211 Lecture #19 Nullspaces and Subspaces October 10, 2001 2 Structure of the

0.5 = 4 5 1 * 0 0.5 0 7 1 1 7 1 1 7 0 1.0 kernel Why filter images? Convolution

The Distribution of a Linear Combination of Random Variables Bernd Schr oder logo1 Bernd

Machine Learning Fall 2017 Kernels (Kernels, Kernelized Perceptron and SVM) Professor Liang

A Quick Review of Linear Algebra (linear combination, linear independence, span, basis) +

Data Mining and Matrices 02 Linear Algebra Refresher Rainer - PowerPoint PPT Presentation

Data Mining and Matrices 02 Linear Algebra Refresher Rainer Gemulla, Pauli Miettinen April 18, 2013 Vectors A vector is a 1D array of numbers a geometric entity with magnitude and direction a matrix with exactly one row or column

Results for different matrices and comparisons Dense Matrices Rectangular Matrices

MATHEMATICS 1 CONTENTS Matrices Special matrices Operations with matrices Matrix

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Data Mining and Matrices 01 Introduction Rainer Gemulla, Pauli Miettinen April 18, 2013

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices &amp; quadratic forms)

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

JUST THE MATHS SLIDES NUMBER 9.9 MATRICES 9 (Modal &amp; spectral matrices) by

Transformations and Matrices Transformations I Transformations are functions Matrices

Matrices with Application to Page Rank Markov Matrices Pagerank Anil Maheshwari

Structural Matrices in MDOF Systems Evaluation of Structural Matrices Choice of Property

Structural Matrices in MDOF Systems Structural Matrices Evaluation of Structural Giacomo Boffi

GCDs &amp; linear (ab)c = a (bc), a a = 0, combinations a + 0 = a, a+1 &gt; a, .

1.4 The Matrix Equation A x = b McDonald Fall 2018, MATH 2210Q 1.4 &amp;1.5 Slides 1.4 Homework :

Matrix Calculations: Solutions of Systems of Linear Equations A. Kissinger Institute for

Math 211 Math 211 Lecture #19 Nullspaces and Subspaces October 10, 2001 2 Structure of the

0.5 = 4 5 1 * 0 0.5 0 7 1 1 7 1 1 7 0 1.0 kernel Why filter images? Convolution

The Distribution of a Linear Combination of Random Variables Bernd Schr oder logo1 Bernd

Machine Learning Fall 2017 Kernels (Kernels, Kernelized Perceptron and SVM) Professor Liang

A Quick Review of Linear Algebra (linear combination, linear independence, span, basis) +

JUST THE MATHS SLIDES NUMBER 9.10 MATRICES 10 (Symmetric matrices & quadratic forms)

JUST THE MATHS SLIDES NUMBER 9.9 MATRICES 9 (Modal & spectral matrices) by

GCDs & linear (ab)c = a (bc), a a = 0, combinations a + 0 = a, a+1 > a, .

1.4 The Matrix Equation A x = b McDonald Fall 2018, MATH 2210Q 1.4 &1.5 Slides 1.4 Homework :