Bayesian Network Parameter Learning from Incomplete Data Guy Van - - PowerPoint PPT Presentation

bayesian network parameter learning
SMART_READER_LITE
LIVE PREVIEW

Bayesian Network Parameter Learning from Incomplete Data Guy Van - - PowerPoint PPT Presentation

Efficient Algorithms for Bayesian Network Parameter Learning from Incomplete Data Guy Van den Broeck, Karthika Mohan, Arthur Choi, Adnan Darwiche, and Judea Pearl UCLA UAI 2015 Learning from Incomplete Data Input: data and BN structure


slide-1
SLIDE 1

Efficient Algorithms for Bayesian Network Parameter Learning from Incomplete Data

Guy Van den Broeck, Karthika Mohan, Arthur Choi, Adnan Darwiche, and Judea Pearl

UCLA

UAI 2015

slide-2
SLIDE 2

Learning from Incomplete Data

  • Input: data and BN structure

E.g., Gender wage gap study

  • Output: BN parameters

E.g., θGender ,θ Experience|Gender ,θ Qualification|Gender, , etc.

X1 (Gender) X2 (Experience) X3 (Qualification) X4 (Income)

1 1 1 1 ? 1 1 1 1 ? 1 1 ? ? ? ? 1 1

( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

slide-3
SLIDE 3

Current Approaches: Properties

Likelihood Optimization Inference-Free ✘ Consistent for MCAR ✔ Consistent for MAR ✔ Consistent for MNAR ✘ Maximum Likelihood ✔

slide-4
SLIDE 4

Current Approaches: Properties

Likelihood Optimization Expectation Maximization Inference-Free ✘ ✘ Consistent for MCAR ✔ ✔/✘ Consistent for MAR ✔ ✔/✘ Consistent for MNAR ✘ ✘ Maximum Likelihood ✔ ✔/✘ Closed Form n/a ✘ Passes over the data n/a ?

slide-5
SLIDE 5

Problem Statement

Likelihood Optimization Expectation Maximization Inference-Free ✘ ✘ Consistent for MCAR ✔ ✔/✘ Consistent for MAR ✔ ✔/✘ Consistent for MNAR ✘ ✘ Maximum Likelihood ✔ ✔/✘ Closed Form n/a ✘ Passes over the data n/a ?

Conventional wisdom: this is inevitable!

slide-6
SLIDE 6

Contribution

Likelihood Optimization Expectation Maximization Deletion

[this paper]

Inference-Free ✘ ✘ ✔ Consistent for MCAR ✔ ✔/✘ ✔ Consistent for MAR ✔ ✔/✘ ✔ Consistent for MNAR ✘ ✘ ✔/✘ Maximum Likelihood ✔ ✔/✘ ✘ Closed Form n/a ✘ ✔ Passes over the data n/a ? 1

slide-7
SLIDE 7

Missingness Graphs

X1 RX1 X1 * X1 * = X1 if RX1 = ob m if RX1 = unob

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

X2 * ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

+

Fully observed variables Xo = {X1} Partially observed variables Xm = {X2, X3, X4}

slide-8
SLIDE 8

Missingness Dataset

  • Encoding of the data

– Fully observed vars Xo – Causal mechanisms R – Proxies for Xm

  • Fully observed
  • Data distribution PrD(.)

X1 * = X1 if RX1 = ob m if RX1 = unob

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 m unob

  • b

0.100 m 1 unob

  • b

0.020 … … … … … …

slide-9
SLIDE 9

Algorithms

  • Missingness categories (classes of graphs)

– Missing Completely At Random (MCAR) – Missing At Random (MAR) – Missing Not At Random (MNAR)

  • Deletion techniques

– Direct Deletion – Factored Deletion – Informed Deletion

slide-10
SLIDE 10

Missing Completely at Random (MCAR)

(Xm Xo ) R

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Experience Income

slide-11
SLIDE 11

Missing Completely at Random (MCAR)

(Xm Xo ) R (X1 X2 X3 X4 ) (RX2 RX3 RX4 )

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Experience Income

slide-12
SLIDE 12

Direct Deletion (MCAR)

(Xm Xo) R

Estimand: 𝑄𝑠 𝑌1, 𝑌2 Independencies:

  • (X1 X2) ⫫ R
  • (X1 X2) ⫫ RX2
slide-13
SLIDE 13

Direct Deletion (MCAR)

(Xm Xo) R

Estimand: 𝑄𝑠 𝑌1, 𝑌2 = 𝑄𝑠 𝑌1𝑌2 𝑆𝑌2 = 𝑝𝑐 Independencies:

  • (X1 X2) ⫫ R
  • (X1 X2) ⫫ RX2
slide-14
SLIDE 14

Direct Deletion (MCAR)

(Xm Xo) R

Estimand: 𝑄𝑠 𝑌1, 𝑌2 = 𝑄𝑠 𝑌1𝑌2 𝑆𝑌2 = 𝑝𝑐 = 𝑄𝑠 𝑌1𝑌2

∗ 𝑆𝑌2 = 𝑝𝑐

Independencies:

  • (X1 X2) ⫫ R
  • (X1 X2) ⫫ RX2
slide-15
SLIDE 15

Direct Deletion (MCAR)

(Xm Xo) R

Estimand: 𝑄𝑠 𝑌1, 𝑌2 = 𝑄𝑠 𝑌1𝑌2 𝑆𝑌2 = 𝑝𝑐 = 𝑄𝑠 𝑌1𝑌2

∗ 𝑆𝑌2 = 𝑝𝑐

= 𝑄𝑠𝐸(𝑌1𝑌2

∗|𝑆𝑌2 = 𝑝𝑐)

Independencies:

  • (X1 X2) ⫫ R
  • (X1 X2) ⫫ RX2
slide-16
SLIDE 16

Direct Deletion (MCAR)

(Xm Xo) R

Estimand: 𝑄𝑠 𝑌1, 𝑌2 = 𝑄𝑠 𝑌1𝑌2 𝑆𝑌2 = 𝑝𝑐 = 𝑄𝑠 𝑌1𝑌2

∗ 𝑆𝑌2 = 𝑝𝑐

= 𝑄𝑠𝐸(𝑌1𝑌2

∗|𝑆𝑌2 = 𝑝𝑐)

Independencies:

  • (X1 X2) ⫫ R
  • (X1 X2) ⫫ RX2

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 … … … … … … 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 m unob

  • b

0.100 m 1 unob

  • b

0.020 … … … … … …

slide-17
SLIDE 17

Direct Deletion (MCAR)

(Xm Xo) R

Estimand: 𝑄𝑠 𝑌1, 𝑌2 = 𝑄𝑠 𝑌1𝑌2 𝑆𝑌2 = 𝑝𝑐 = 𝑄𝑠 𝑌1𝑌2

∗ 𝑆𝑌2 = 𝑝𝑐

= 𝑄𝑠𝐸(𝑌1𝑌2

∗|𝑆𝑌2 = 𝑝𝑐)

Independencies:

  • (X1 X2) ⫫ R
  • (X1 X2) ⫫ RX2
  • Cf. listwise and pairwise deletion in statistics

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 … … … … … … 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 m unob

  • b

0.100 m 1 unob

  • b

0.020 … … … … … …

slide-18
SLIDE 18

Factored Deletion (MCAR)

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020

Many ways of factorizing the estimand!

slide-19
SLIDE 19

Factored Deletion (MCAR)

𝑄(𝑌1) 1 X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020

Many ways of factorizing the estimand!

slide-20
SLIDE 20

Factored Deletion (MCAR)

𝑄(𝑌1) 1 𝑄(𝑌2) 𝑄 𝑌2 = 𝑄(𝑌2|𝑆𝑌2 = 𝑝𝑐) X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020 𝑄(𝑌2)

Many ways of factorizing the estimand!

slide-21
SLIDE 21

𝑄 𝑌3 = 𝑄(𝑌3|𝑆𝑌3 = 𝑝𝑐)

Factored Deletion (MCAR)

𝑄(𝑌1) 1 𝑄(𝑌3) 𝑄(𝑌2) X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020 𝑄(𝑌2)

Many ways of factorizing the estimand!

slide-22
SLIDE 22

𝑄 𝑌1, 𝑌2 = 𝑄 𝑌2 𝑌1, 𝑆𝑌2 = 𝑝𝑐 𝑄(𝑌1) 𝑄(𝑌2|𝑌1)

Factored Deletion (MCAR)

𝑄(𝑌1) 1 𝑄(𝑌3) 𝑄(𝑌2) 𝑄(𝑌1, 𝑌2 ) 𝑄 𝑌1, 𝑌2 = 𝑄(𝑌1|𝑌2, 𝑆𝑌2 = 𝑝𝑐) 𝑄(𝑌2|𝑆𝑌2 = 𝑝𝑐) X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020 𝑄(𝑌2)

Many ways of factorizing the estimand!

slide-23
SLIDE 23

𝑄(𝑌2|𝑌1)

Factored Deletion (MCAR)

𝑄(𝑌1) 1 𝑄(𝑌3) 𝑄(𝑌2) 𝑄(𝑌1, 𝑌2 ) 𝑄(𝑌1, 𝑌3 ) X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020 𝑄(𝑌2)

Many ways of factorizing the estimand!

slide-24
SLIDE 24

𝑄(𝑌2|𝑌3) 𝑄(𝑌2|𝑌1)

Factored Deletion (MCAR)

𝑄(𝑌1) 1 𝑄(𝑌3) 𝑄(𝑌2) 𝑄(𝑌1, 𝑌2 ) 𝑄(𝑌2, 𝑌3 ) 𝑄(𝑌1, 𝑌3 ) X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020 𝑄(𝑌2)

Many ways of factorizing the estimand!

slide-25
SLIDE 25

𝑄(𝑌2|𝑌3) 𝑄(𝑌2|𝑌1)

Factored Deletion (MCAR)

𝑄(𝑌1) 1 𝑄(𝑌3) 𝑄(𝑌2) 𝑄(𝑌1, 𝑌2 ) 𝑄(𝑌2, 𝑌3 ) 𝑄(𝑌1, 𝑌3 ) 𝑄(𝑌1, 𝑌2 , 𝑌3) X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020 𝑄(𝑌2)

Many ways of factorizing the estimand!

slide-26
SLIDE 26

Factored Deletion (MCAR)

  • Aggregate all factorizations in lattice
  • Simple algorithm
  • Data used for Pr(Y)?
  • Direct deletion: data where all Y observed
  • Factored deletion: data where some Y observed
slide-27
SLIDE 27

MCAR Experiments (data size)

(Alarm network) Small loss of statistical power

slide-28
SLIDE 28

MCAR Experiments (time)

(Alarm network) Huge gain in computational power

slide-29
SLIDE 29

Missing At Random (MAR)

Xm R | Xo

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

slide-30
SLIDE 30

Missing At Random (MAR)

Xm R | Xo (X2 X3 X4 ) (RX2 RX3 RX4 ) | X1

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

slide-31
SLIDE 31

Missing At Random (MAR)

Xm R | Xo (X2 X3 X4 ) (RX2 RX3 RX4 ) | X1

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

Most-general class where maximum-likelihood is consistent!

slide-32
SLIDE 32

Direct Deletion (MAR)

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020

Independencies: (X2 X3 ) R | X1

Xm R | Xo

Estimand:

slide-33
SLIDE 33

Direct Deletion (MAR)

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020

Independencies: (X2 X3 ) R | X1

𝑄 𝑌2, 𝑌3 = 𝑄 𝑌2𝑌3 𝑌1 𝑄(𝑌1)

𝑌1

= 𝑄 𝑌2𝑌3 𝑌1, 𝑆 = 𝑝𝑐 𝑄(𝑌1)

𝑌1

Xm R | Xo

Estimand:

slide-34
SLIDE 34

Direct Deletion (MAR)

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020

Independencies: (X2 X3 ) R | X1

𝑄 𝑌2, 𝑌3 = 𝑄 𝑌2𝑌3 𝑌1 𝑄(𝑌1)

𝑌1

= 𝑄 𝑌2𝑌3 𝑌1, 𝑆 = 𝑝𝑐 𝑄(𝑌1)

𝑌1

Xm R | Xo

Estimand:

slide-35
SLIDE 35

Direct Deletion (MAR)

X1 X*2 X*3 RX2 RX3 P*

  • b
  • b

0.200 1

  • b
  • b

0.100 1

  • b
  • b

0.050 1 1

  • b
  • b

0.050 1

  • b
  • b

0.060 1 1

  • b
  • b

0.040 1 1

  • b
  • b

0.070 1 1 1

  • b
  • b

0.030 m

  • b

unob 0.100 1 m

  • b

unob 0.020 1 m

  • b

unob 0.080 1 1 m

  • b

unob 0.180 1 m m unob unob 0.020

Independencies: (X2 X3 ) R | X1

𝑄 𝑌2, 𝑌3 = 𝑄 𝑌2𝑌3 𝑌1 𝑄(𝑌1)

𝑌1

= 𝑄 𝑌2𝑌3 𝑌1, 𝑆 = 𝑝𝑐 𝑄(𝑌1)

𝑌1

Xm R | Xo

Estimand:

slide-36
SLIDE 36

Factored Deletion (MAR)

slide-37
SLIDE 37

MAR Experiments (Fire Alarm)

INCONSISTENT

slide-38
SLIDE 38

MAR Experiments (Alarm)

slide-39
SLIDE 39

MAR Experiments (Intractable)

Log-likelihoods of large intractable networks

slide-40
SLIDE 40

Informed Deletion

Xm R | Xo

Direct Deletion 𝑄 𝑌1, 𝑌2 = 𝑄 𝑌2|𝑌1, 𝑌3, 𝑆𝑌2 = 𝑝𝑐 𝑄(𝑌1, 𝑌3)

𝑌3

RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 General m-graph depicting MAR

slide-41
SLIDE 41

Informed Deletion

Xm R | Xo

Direct Deletion 𝑄 𝑌1, 𝑌2 = 𝑄 𝑌2|𝑌1, 𝑌3, 𝑆𝑌2 = 𝑝𝑐 𝑄(𝑌1, 𝑌3)

𝑌3

RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 General m-graph depicting MAR Problem specific m-graph

slide-42
SLIDE 42

Informed Deletion

Xm R | Xo

Direct Deletion 𝑄 𝑌1, 𝑌2 = 𝑄 𝑌2|𝑌1, 𝑌3, 𝑆𝑌2 = 𝑝𝑐 𝑄(𝑌1, 𝑌3)

𝑌3

RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 General m-graph depicting MAR Problem specific m-graph

𝑺𝒀𝟑 | 𝒀𝟒 𝒀𝟐 𝑺𝒀𝟑 | 𝒀𝟐 𝒀𝟒

slide-43
SLIDE 43

Informed Deletion

Xm R | Xo

Direct Deletion 𝑄 𝑌1, 𝑌2 = 𝑄 𝑌2|𝑌1, 𝑌3, 𝑆𝑌2 = 𝑝𝑐 𝑄(𝑌1, 𝑌3)

𝑌3

Informed Deletion 𝑄 𝑌1, 𝑌2 = 𝑸 𝒀𝟑|𝒀𝟐, 𝑺𝒀𝟑 = 𝒑𝒄 𝑄(𝑌1) RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 RX2 ( X1 ) ( X3 ) ( X2 ) ( X4 ) RX4 General m-graph depicting MAR Problem specific m-graph

𝑺𝒀𝟑 | 𝒀𝟒 𝒀𝟐 𝑺𝒀𝟑 | 𝒀𝟐 𝒀𝟒

slide-44
SLIDE 44

Missing Not At Random (MNAR)

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

slide-45
SLIDE 45

Missing Not At Random (MNAR)

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

slide-46
SLIDE 46

Missing Not At Random (MNAR)

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

RX2 RX4 RX3 ( X1 ) ( X3 ) ( X2 ) ( X4 )

Gender Qualification Experience Income

X Y 𝑆𝑌 𝑆𝑍

𝑄 𝑌, 𝑍 = 𝑄(𝑆𝑌 = 𝑝𝑐, 𝑆𝑍 = 𝑝𝑐, 𝑌, 𝑍) 𝑄 𝑆𝑌 = 𝑝𝑐 𝑍, 𝑆𝑍 = 𝑝𝑐 𝑄(𝑆𝑍 = 𝑝𝑐|𝑌, 𝑆𝑌 = 𝑝𝑐)

slide-47
SLIDE 47

Conclusions

  • Everybody loves to hate EM (slow, stuck, etc.)
  • Deletion is solution to some EM problems
  • Opens doors

– Big incomplete data – Consistent learning of intractable networks – Efficient structure learning from incomplete data – Learning from MNAR data

  • Surprising (given BN textbooks)?
  • Code: http://reasoning.cs.ucla.edu/deletion
slide-48
SLIDE 48

Thanks!