Markov Networks Alan Ri2er Markov Networks Undirected - PowerPoint PPT Presentation

Markov ¡Networks ¡ Alan ¡Ri2er ¡

Markov Networks • Undirected graphical models Smoking ¡ Cancer ¡ Asthma ¡ Cough ¡ l Poten;al ¡func;ons ¡defined ¡over ¡cliques ¡ Smoking Cancer Ф (S,C) 1 P ( x ) c x ( ) ∏ Φ = c False False 4.5 Z c False True 4.5 Z c x ( ) ∑∏ Φ True False 2.7 = c x c True True 4.5

Undirected ¡Graphical ¡Models: ¡ Mo;va;on ¡ • Terminology: ¡ – Directed ¡graphical ¡models ¡= ¡Bayesian ¡Networks ¡ – Undirected ¡graphical ¡models ¡= ¡Markov ¡Networks ¡ • We ¡just ¡learned ¡about ¡DGMs ¡(Bayes ¡Nets) ¡ • For ¡some ¡domains ¡being ¡forced ¡to ¡choose ¡a ¡ direc;on ¡of ¡edges ¡is ¡awkward. ¡ • Example: ¡consider ¡modeling ¡an ¡image ¡ – Assump;on: ¡neighboring ¡pixels ¡are ¡correlated ¡ – We ¡could ¡create ¡a ¡DAG ¡model ¡w/ ¡2D ¡topology ¡

2D ¡Bayesian ¡Network ¡ X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X 11 X 12 X 13 X 14 X 15 X 16 X 17 X 18 X 19 X 20

Markov ¡Random ¡Field ¡ ¡ (Markov ¡Network) ¡ X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X 11 X 12 X 13 X 14 X 15 X 16 X 17 X 18 X 19 X 20

UGMs ¡(Bayes ¡Nets) ¡vs ¡ ¡ DGMs ¡(Markov ¡Nets) ¡ • Advantages ¡ 1. Symmetric ¡ More ¡natural ¡for ¡certain ¡domains ¡(e.g. ¡spa;al ¡or ¡ • rela;onal ¡data) ¡ 2. Discrimina;ve ¡UGMs ¡(A.K.A ¡Condi;onal ¡Random ¡ Fields) ¡work ¡be2er ¡than ¡discrimina;ve ¡UGMs ¡ • Disadvantages ¡ 1. Parameters ¡are ¡less ¡interpretable ¡and ¡modular ¡ 2. Parameter ¡es;ma;on ¡is ¡computa;onally ¡more ¡ expensive ¡

Condi;onal ¡Independence ¡Proper;es ¡ • Much ¡Simpler ¡than ¡Bayesian ¡Networks ¡ – No ¡d-‑sepera;on, ¡v-‑structures, ¡etc… ¡ • UGMs ¡define ¡CI ¡via ¡simple ¡graph ¡separa;on ¡ X A ⊥ G X B | X E ⇐ ⇒ E separates A from B in G • E.g. ¡if ¡we ¡remove ¡all ¡the ¡evidence ¡nodes ¡from ¡ the ¡graph, ¡are ¡there ¡any ¡paths ¡connec;ng ¡A ¡ and ¡B? ¡

Markov ¡Blanket ¡ • Also ¡Simple ¡ – Markov ¡blanket ¡of ¡a ¡node ¡is ¡just ¡the ¡set ¡of ¡it’s ¡ immediate ¡neighbors ¡ – Don’t ¡need ¡to ¡worry ¡about ¡co-‑parents ¡

Independence ¡Proper;es ¡ 2 4 G L P 1 5 7 3 6 p ( x ) > 0 1 ⊥ 7 | rest Pairwise: ¡ 1 ⊥ rest | 2 , 3 Local: ¡ 1 , 2 ⊥ 6 , 7 | 3 , 4 , 5 Global: ¡

Conver;ng ¡a ¡Bayesian ¡Network ¡to ¡a ¡ Markov ¡Network ¡ • Temp;ng: ¡ – Just ¡drop ¡direc;onality ¡of ¡the ¡edges ¡ – But ¡this ¡is ¡clearly ¡incorrect ¡(v-‑structure) ¡ – Introduces ¡incorrect ¡CI ¡statements ¡ • Solu;on: ¡ – Add ¡edges ¡between ¡“unmarried” ¡parents ¡ – This ¡process ¡is ¡called ¡ moraliza2on ¡

Example: ¡moraliza;on ¡ 2 4 1 5 7 3 6 2 4 1 5 7 3 6 • Unfortunately, ¡this ¡looses ¡some ¡CI ¡informa;on ¡ – Example: ¡ 4 ⊥ 5 | 2

Directed ¡vs. ¡Undirected ¡GMs ¡ • Q: ¡which ¡has ¡ ¡more ¡“expressive ¡power”? ¡ • Recall: ¡ – G ¡is ¡an ¡I-‑map ¡of ¡P ¡if: ¡ ¡ I ( G ) ⊆ I ( P ) • Now ¡define: ¡ – G ¡is ¡a ¡ perfect ¡I-‑map ¡ of ¡P ¡if: ¡ I ( G ) = I ( P ) • Graph ¡can ¡represent ¡all ¡(and ¡only) ¡CIs ¡in ¡P ¡ ¡ Bayesian ¡Networks ¡and ¡Markov ¡Networks ¡are ¡ perfect ¡maps ¡for ¡different ¡sets ¡of ¡distribu;ons ¡

Probabilistic Models Graphical Models Directed Chordal Undirected

Parameteriza;on ¡ • No ¡topological ¡ordering ¡on ¡undirected ¡graph ¡ • Can’t ¡use ¡the ¡chain ¡rule ¡of ¡probability ¡to ¡ represent ¡P(y) ¡ • Instead ¡we ¡will ¡use ¡ poten2al ¡func2ons: ¡ – associate ¡poten;al ¡func;ons ¡with ¡each ¡maximal ¡ clique ¡in ¡the ¡graph ¡ ψ c ( y c | θ c ) – A ¡poten;al ¡can ¡be ¡any ¡non-‑nega;ve ¡func;on ¡ • Joint ¡distribu2on ¡is ¡defined ¡to ¡be ¡ propor2onal ¡to ¡product ¡of ¡clique ¡poten2als ¡

Parameteriza;on ¡(con’t) ¡ • Joint ¡distribu2on ¡is ¡defined ¡to ¡be ¡ propor2onal ¡to ¡product ¡of ¡clique ¡poten2als ¡ • Any ¡posi2ve ¡distribu2on ¡whose ¡CI ¡proper2es ¡ can ¡be ¡represented ¡by ¡an ¡UGM ¡can ¡be ¡ represented ¡this ¡way. ¡

Hammersly-‑Clifford ¡Theorem ¡ • A ¡posi;ve ¡distribu;on ¡P(Y) ¡> ¡0 ¡sa;sfies ¡the ¡CI ¡ proper;es ¡of ¡an ¡undirected ¡graph ¡G ¡iff ¡P ¡can ¡ be ¡represented ¡as ¡a ¡product ¡of ¡factors, ¡one ¡ per ¡maximal ¡clique ¡ 1 Y P ( y | θ ) = ψ c ( y c | θ c ) Z ( θ ) c ∈ C X Y Z ¡is ¡the ¡par;;on ¡ Z ( θ ) = ψ c ( y c | θ c ) func;on ¡ y c ∈ C

1 Example ¡ 2 3 • If ¡P ¡sa;sfies ¡the ¡condi;onal ¡ independence ¡assump;ons ¡ 4 5 of ¡this ¡graph, ¡we ¡can ¡write ¡ 1 P ( y | θ ) = Z ( θ ) ψ 123 ( y 1 , y 2 , y 3 ) ψ 234 ( y 2 , y 3 , y 4 ) ψ 35 ( y 3 , y 5 ) X Z ( θ ) = ψ 123 ( y 1 , y 2 , y 3 ) ψ 234 ( y 2 , y 3 , y 4 ) ψ 35 ( y 3 , y 5 ) y

Pairwise ¡MRF ¡ • Poten;als ¡don’t ¡need ¡to ¡ correspond ¡to ¡maximal ¡ 1 cliques ¡ • We ¡can ¡also ¡restrict ¡ 2 3 parameteriza;on ¡to ¡edges ¡ (or ¡any ¡other ¡cliques) ¡ 4 5 • Pairwise ¡MRF: ¡ P ( y | θ ) = ψ 12 ( y 1 , y 2) ψ 13 ( y 1 , y 3 ) ψ 23 ( y 2 , y 3 ) ψ 24 ( y 2 , y 4 ) ψ 34 ( y 3 , y 4 ) ψ 35 ( y 3 , y 5 )

Represen;ng ¡Poten;al ¡Func;ons ¡ • Can ¡represent ¡as ¡CPTs ¡like ¡we ¡did ¡for ¡Bayesian ¡ Networks ¡(DGMs) ¡ – But, ¡poten;als ¡are ¡ not ¡probabili;es ¡ – Represent ¡rela;ve ¡“compa;bility” ¡between ¡ various ¡assignments ¡

Represen;ng ¡Poten;al ¡Func;ons ¡ • More ¡general ¡approach: ¡ – Represent ¡the ¡log ¡poten;als ¡as ¡a ¡linear ¡func;on ¡of ¡ the ¡parameters ¡ – Log-‑linear ¡(maximum ¡entropy) ¡models ¡ ψ c ( y c ) T θ c − log Z ( θ ) X log P ( y | θ ) = c

Log-Linear Models Smoking ¡ Cancer ¡ Asthma ¡ Cough ¡ l Log-‑linear ¡model: ¡ 1 ⎛ ⎞ P ( x ) exp w f ( x ) ∑ = ⎜ ⎟ i i Z ⎝ ⎠ i Weight of Feature i Feature i 1 if Smoking Cancer ⎧ ¬ ∨ f ( Smoking, Cancer ) = ⎨ 1 0 otherwise ⎩ w 0 . 51 1 =

Log-‑Linear ¡models ¡can ¡ ¡ represent ¡Table ¡CPTs ¡ • Consider ¡pairwise ¡MRF ¡where ¡each ¡edge ¡has ¡ an ¡associated ¡poten;al ¡w/ ¡K^2 ¡features: ¡ ¡ φ ( y s , y t ) = [ . . . , I ( y s = j, y t = k ) , . . . ] • Then ¡we ¡can ¡convert ¡into ¡a ¡poten;al ¡func;on ¡ using ¡the ¡weight ¡for ¡each ¡feature: ¡ ψ ( y s , y t ) = exp ([ θ T st φ st ] jk ]) = exp ( θ st ( j, k )) • But, ¡log-‑linear ¡model ¡is ¡more ¡general ¡ – Feature ¡vectors ¡can ¡be ¡arbitrarily ¡designed ¡

Markov Networks Alan Ri2er Markov Networks Undirected - PowerPoint PPT Presentation

Markov Networks Alan Ri2er Markov Networks Undirected graphical models Smoking Cancer Asthma Cough l Poten;al func;ons defined over cliques Smoking Cancer (S,C) 1 P ( x

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Outline Markov networks (a.k.a. Markov random fields) Markov Networks Reading: Michael

Markov Logic Networks Matt Richardson and Pedro Domingos (2006), Markov Logic Networks, Machine

Markov Logic Networks Matt Richardson and Pedro Domingos (2006), Markov Logic Networks, Machine

Model Repair for Markov Decision Model Repair for Markov Decision Model Repair for Markov

Imprecise Markov chains From basic theory to applications II prof. Jasper De Bock Imprecise

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

Discrete Time Markov Chains Discrete-Time Markov Chains Books - Introduction to Stochastic

Hidden Markov Models Steven J Zeil Old Dominion Univ. Fall 2010 1 Discrete Markov Processes

Overview Motivation Verifying Continuous-Time Markov Chains 1 Lecture 1+2: Discrete-Time Markov

Markov processes (Markov chains) Construct a Bayes net from these variables: parents? Markov

Probabilistic Graphical Models Part II: Undirected Graphical Models Selim Aksoy Department of

Directed Graphical Models + Undirected Graphical Models Matt Gormley Lecture 7 Sep. 18, 2019

Undirected Graphical Models Michael Gutmann Probabilistic Modelling and Reasoning (INFR11134)

Properties of the QGP with hard probes Oliver Busch for the ALICE collaboration 1 Oliver Busch

Undirected graphical models Graph G : arbitrary undirected graph Useful when variables interact

Overview MAXENT-Modeling: A framework for Discrete MAXENT-Models and RMs IRT-Modeling?

Outline Vienna, Austria - introduction to the giRaph package The giRaph package for graph

International Association for Cryptologic Research Christian Cachin President, IACR Crypto 2018