Data Sciences CentraleSupelec Advance Machine Learning Course VII - PowerPoint PPT Presentation

Data Sciences – CentraleSupelec Advance Machine Learning Course VII - Inference on Graphical Models Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr

Graphical models ∗ A graph G consists of a pair ( V , E ), with V the set of vertices and E the set of edges. ∗ In graphical models, each vertex represents a random variable, and the graph gives a visual way of understanding the joint distribution P of a set of random variables X : X = ( X (1) , . . . , X ( p ) ) ∼ P :

Graphical models ∗ A graph G consists of a pair ( V , E ), with V the set of vertices and E the set of edges. ∗ In graphical models, each vertex represents a random variable, and the graph gives a visual way of understanding the joint distribution P of a set of random variables X : X = ( X (1) , . . . , X ( p ) ) ∼ P ∗ In an undirected graph, the edges have no directional arrows. We say that the pairwise Markov property holds if, for every ( j , k ) ∈ V 2 , the absence of an edge between X ( j ) and X ( k ) is equivalent to the conditionally independence of the corresponding random variables, given the other variables: X ( j ) ⊥ X ( k ) | X ( V\{ j , k } ) . ∗ Undirected + pairwise Markov = conditional independence graph model. :

Gaussian graphical model ∗ A Gaussian graphical model (GGM) is a conditional independence graph with a multivariate Gaussian distribution: X = ( X (1) , . . . , X ( p ) ) ∼ N (0 , Σ) with positive definite covariance matrix Σ ∈ R p × p . :

Gaussian graphical model ∗ A Gaussian graphical model (GGM) is a conditional independence graph with a multivariate Gaussian distribution: X = ( X (1) , . . . , X ( p ) ) ∼ N (0 , Σ) with positive definite covariance matrix Σ ∈ R p × p . ∗ The partial correlation between X ( j ) and X ( k ) given X ( V\{ j , k } ) equals: K jk K = Σ − 1 ρ jk |V\{ j , k } = − with � K jj K kk :

Gaussian graphical model ∗ A Gaussian graphical model (GGM) is a conditional independence graph with a multivariate Gaussian distribution: X = ( X (1) , . . . , X ( p ) ) ∼ N (0 , Σ) with positive definite covariance matrix Σ ∈ R p × p . ∗ The partial correlation between X ( j ) and X ( k ) given X ( V\{ j , k } ) equals: K jk K = Σ − 1 ρ jk |V\{ j , k } = − with � K jj K kk ∗ Consider the linear regression : X ( j ) = β ( j ) k X ( k ) + � r X ( r ) + ǫ ( j ) r ∈V\{ j , k } β ( j ) with ǫ ( j ) zero-mean and independant from X ( r ) , r ∈ V \ { j } . Then, β ( j ) = − K jk / K jj , β j ( k ) = − K jk / K kk k :

Gaussian graphical model ∗ A Gaussian graphical model (GGM) is a conditional independence graph with a multivariate Gaussian distribution: X = ( X (1) , . . . , X ( p ) ) ∼ N (0 , Σ) with positive definite covariance matrix Σ ∈ R p × p . ∗ The partial correlation between X ( j ) and X ( k ) given X ( V\{ j , k } ) equals: K jk K = Σ − 1 ρ jk |V\{ j , k } = − with � K jj K kk ∗ Consider the linear regression : X ( j ) = β ( j ) k X ( k ) + � r X ( r ) + ǫ ( j ) r ∈V\{ j , k } β ( j ) with ǫ ( j ) zero-mean and independant from X ( r ) , r ∈ V \ { j } . Then, β ( j ) = − K jk / K jj , β j ( k ) = − K jk / K kk k ∗ The edges in a GGM are then related to Σ, K and β through: jk � = 0 ⇔ ρ jk |V\{ j , k } � = 0 ⇔ β ( j ) � = 0 and β ( k ) ( j , k ) and ( k , j ) ∈ E ⇔ Σ − 1 � = 0 k j :

Nodewise regression ∗ We aim at inferring the presence of edges in a GGM. Nodewise regression consists in performing many regressions [Meinshausen et al., 2006], relying on the fact that: X ( j ) = r X ( r ) + ǫ ( j ) , β ( j ) � ¯ j = 1 , . . . , p r � = j 1) For j = 1 , . . . , p , apply a variable selection method providing an S ( j ) of estimate ˆ S ( j ) = � � β ( j ) ¯ r | ¯ � = 0 , r = 1 , . . . , p , r � = j r � Lasso regression of X ( j ) versus yields ˆ X ( r ) , r � = j β ( j ) , which then � � S ( j ) = � β ( j ) � = 0 � yields the support estimate ˆ r | ˆ . 2) Build an estimate of the graph structure , using AND/OR rule: S ( j ) AND/OR j ∈ ˆ Edge present between nodes j and k ⇔ k ∈ ˆ S ( k ) :

Graphical LASSO ∗ We aim at inferring GGM parameters ( µ, Σ) from n i.i.d realizations: X 1 , . . . , X n of N ( µ, Σ) with µ ∈ R p and Σ ∈ R p × p sdp. We introduce the sample mean and the empirical covariance matrix: n n � � µ = n − 1 S = n − 1 µ ) ⊤ . ˆ X i , ( X i − ˆ µ )( X i − ˆ i =1 i =1 Then, the negative Gaussian log-likelihood reads − n − 1 ℓ (Σ − 1 | X 1 , . . . , X n ) = − log det Σ − 1 + trace( S Σ − 1 ) + constant . ∗ GLASSO is an estimator of Σ − 1 based on the use of ℓ 1 penalty: Σ − 1 = argmin Σ − 1 ≻ 0 − log det Σ − 1 + trace( S Σ − 1 ) + λ � Σ − 1 � 1 ˆ j < k | Σ − 1 with � Σ − 1 � 1 = � jk | , and λ > 0 regularization parameter. ∗ Convex optimization problem. Several solvers available. Example: ADMM algorithm. :

Example Four different GLASSO solutions for the flow-cytometry data with p = 11 proteins measured on n = 7466 cells [Sachs et al., 2003]. :

Example Six different GLASSO solutions for the genomic dataset about riboflavin production with Bacillus subtilis , p = 160 and n = 115. [Meinshausen et al., 2010]. :

Whiteboard :

Data Sciences CentraleSupelec Advance Machine Learning Course VII - PowerPoint PPT Presentation

Data Sciences CentraleSupelec Advance Machine Learning Course VII - Inference on Graphical Models Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr Graphical models A graph G consists

Data Sciences CentraleSupelec Advance Machine Learning Course II - Linear regression/Linear

Data Sciences CentraleSupelec Advance Machine Learning Course VI - Nonnegative matrix

Data Sciences CentraleSupelec Advance Machine Learning Course III - Stochastic approximation

Inperia Advance BIS Coated CoCr BMS for BTK Indications DS - 2018 Inperia Advance Inperia

Clustering Lesson 3 : Lab Session Advanced Machine Learning, CentraleSupelec Teachers Assistant

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Deep learning J er emy Fix CentraleSup elec jeremy.fix@centralesupelec.fr 2016 1 / 94

T1 ADVANCE + / T1D ABOUT THE T1 ADVANCE The T1 ADVANCE + from TRIWATER SOLUTIONS INC. was

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

Machine Learning 1 Machine(Learning(in(a(Nutshell ( Data$ Model$ Performance$ Measure$

ST5 : Semi autonomous drone navigation Introduction J er emy Fix

Learning without correspondence Daniel Hsu Computer Science Department & Data Science

Galaxy: An Open Platform for Data Analysis and Integration PAG XXVIII, January 2020 Dave

BIOS FOR EVER Carlos Eduardo Pedreira COPPE PESC rea de Inteligncia Artifjcial Yesterday

POCT Coordination: Managing Your Sanity as Your Program Expands Beyond the Horizon James H.

L04: Eukaryotes BIOL 153/L Black Hills State Univ. Ramseys See 'Tree of Life' Lecture I.

Exploring Complex Reaction Pathways Mahmoud Moradi Department of Chemistry and Biochemistry

Dr David Millan CONSULTANT PATHOLOGIST SCOTTISH GYNAECOLOGICAL CANCER GROUP Representative of

Algorithms (2IL15) Lecture 4 DYNAMIC PROGRAMMING II 0 0 0 0 0 0 0 0 0 0 1 TU/e

Data Sciences CentraleSupelec Advance Machine Learning Course VII - PowerPoint PPT Presentation

Data Sciences CentraleSupelec Advance Machine Learning Course VII - Inference on Graphical Models Emilie Chouzenoux Center for Visual Computing CentraleSupelec emilie.chouzenoux@centralesupelec.fr Graphical models A graph G consists

Data Sciences CentraleSupelec Advance Machine Learning Course II - Linear regression/Linear

Data Sciences CentraleSupelec Advance Machine Learning Course VI - Nonnegative matrix

Data Sciences CentraleSupelec Advance Machine Learning Course III - Stochastic approximation

Inperia Advance BIS Coated CoCr BMS for BTK Indications DS - 2018 Inperia Advance Inperia

Clustering Lesson 3 : Lab Session Advanced Machine Learning, CentraleSupelec Teachers Assistant

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

Deep learning J er emy Fix CentraleSup elec jeremy.fix@centralesupelec.fr 2016 1 / 94

T1 ADVANCE + / T1D ABOUT THE T1 ADVANCE The T1 ADVANCE + from TRIWATER SOLUTIONS INC. was

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

Machine learning for finance Nathan George Data Science Professor DataCamp Machine Learning

Machine Learning 1 Machine(Learning(in(a(Nutshell ( Data$ Model$ Performance$ Measure$

ST5 : Semi autonomous drone navigation Introduction J er emy Fix

Learning without correspondence Daniel Hsu Computer Science Department &amp; Data Science

Galaxy: An Open Platform for Data Analysis and Integration PAG XXVIII, January 2020 Dave

BIOS FOR EVER Carlos Eduardo Pedreira COPPE PESC rea de Inteligncia Artifjcial Yesterday

POCT Coordination: Managing Your Sanity as Your Program Expands Beyond the Horizon James H.

L04: Eukaryotes BIOL 153/L Black Hills State Univ. Ramseys See 'Tree of Life' Lecture I.

Exploring Complex Reaction Pathways Mahmoud Moradi Department of Chemistry and Biochemistry

Dr David Millan CONSULTANT PATHOLOGIST SCOTTISH GYNAECOLOGICAL CANCER GROUP Representative of

Algorithms (2IL15) Lecture 4 DYNAMIC PROGRAMMING II 0 0 0 0 0 0 0 0 0 0 1 TU/e

Learning without correspondence Daniel Hsu Computer Science Department & Data Science