Privately Learning Markov Random Fields Huanyu Zhang, Cornell - PowerPoint PPT Presentation

Privately Learning Markov Random Fields Huanyu Zhang, Cornell University Gautam Kamath, University of Waterloo Janardhan Kulkarni, Microsoft Research Zhiwei Steven Wu, University of Minnesota

Table of contents 1. Problem formulation 2. Main results 3. Private structure learning 4. Private parameter learning 5. Generalization to other GMs 1

Problem formulation

Ising models D ( A ) is a distribution on {± 1 } p s.t. Pr ( Z = z ) ∝ exp (Σ i < j A i , j z i z j + Σ i A i , i z i ) , where A ∈ R p × p is a symmetric weight matrix.   0 1 1 0 0 0 1 0 1 0 0 0       1 1 0 0 0 0   A =   0 0 0 0 1 1     0 0 0 1 0 1     0 0 0 1 1 0 2

Applications of Ising models Ising models are heavily used in physics, social network, etc. Magnet: • Each dimension represents a particular ‘spin’ in the material. • − 1 if the spin points down or +1 if the spin points up. Social network: • Each of the dimensions is a person in the network. • − 1 represents voting for Hilary; +1 represents for Trump. 3

Two alternative objectives h : unknown Ising model Input : i.i.d. samples X n 1 from h A ∈ { 0 , 1 } p × p s.t. Structure learning: output ˆ ∀ i � = j , ˆ w.h.p., A i , j = 1 ( A i , j � = 0) . A ∈ R p × p s.t. Parameter learning: given accuracy α , output ˆ � � � ˆ w.h.p., ∀ i � = j , A i , j − A i , j � ≤ α. � � Sample complexity : least n to estimate h 4

Privacy Data may contain sensitive information. Medical studies: • Learn behavior of genetic mutations. • Data contains health records or disease history. Navigation: • Suggests routes based on aggregate positions of individuals. • Position information indicates users’ residence. 5

Differential privacy (DP) [Dwork et al., 2006] ˆ f is ( ε, δ )-DP for any X n 1 and Y n 1 , with d ham ( X n 1 , Y n 1 ) ≤ 1, for all measurable S , � � ≤ e ε · Pr � � f ( X n ˆ f ( Y n ˆ Pr 1 ) ∈ S 1 ) ∈ S + δ 6

Privately learning Ising models Given i.i.d. samples from distribution p , the goals are: • Accuracy : achieve structure learning or parameter learning. • Privacy : estimator must satisfy ( ε, δ )-DP. 7

Main results

Main results Assumption: the underlying graph has a bounded degree. Parameter Structure Learning Learning Non- O (log p ) O (log p ) private [Wu et al., 2019] [Wu et al., 2019] Θ( √ p ) ( ε, δ ) -DP Θ(log p ) ( ε, 0) -DP Ω( p ) Ω( p ) 8

Main results Assumption: the underlying graph has a bounded degree. Parameter Structure Learning Learning Non- O (log p ) O (log p ) private [Wu et al., 2019] [Wu et al., 2019] Θ( √ p ) ( ε, δ ) -DP Θ(log p ) ( ε, 0) -DP Ω( p ) Ω( p ) Only ( ε, δ )-DP structure learning is tractable in high dimensions! 8

Private structure learning

Private structure learning - upper bound Our ( ε, δ )-DP UB comes from Propose-Test-Release . Lemma 1 [Dwork and Lei, 2009]. Given the existence of a m -sample non-private SL algorithm, there exists an ( ε, δ )-DP � � m log(1 /δ ) algorithm with the sample complexity n = O . ε We note that this method does not work when δ = 0. 9

Private structure learning - lower bound Our ( ε, 0)-LB comes from a reduction from product distribution learning . By packing argument, we show n = Ω( p ). 10

Private structure learning Parameter Structure Learning Learning Non- O (log p ) O (log p ) private [Wu et al., 2019] [Wu et al., 2019] ( ε, δ ) -DP ( ε, 0) -DP 11

Private structure learning Parameter Structure Learning Learning Non- O (log p ) O (log p ) private [Wu et al., 2019] [Wu et al., 2019] ( ε, δ ) -DP Θ(log p ) ( ε, 0) -DP Ω( p ) Ω( p ) 11

Private parameter learning

Private parameter learning - upper bound The following lemma is a nice property of Ising model. Lemma 2. Let Z ∼ D ( A ), then ∀ i ∈ [ p ], ∀ x ∈ {± 1 } [ p − 1] , Pr ( Z i = 1 | Z − i = x ) = σ (Σ j � = i 2 A i , j x j + 2 A i , i ) . -1 ? ? … ? +1 -1 +1 Question: Can we utilize sparse logistic regression ? 12

Private parameter learning - upper bound Answer: Yes! And there are two advantages: • O (log p ) samples are enough without privacy [Wu et al., 2019]. • It can be efficiently and privately solved by private Frank-Wolfe algorithm [Talwar et al., 2015]. 13

Private parameter learning - lower bound We consider a similar reduction as structure learning. Our ( ε, δ )-DP LB comes from a reduction from product distribution learning . 14

Private parameter learning Parameter Structure Learning Learning Non- O (log p ) O (log p ) private [Wu et al., 2019] [Wu et al., 2019] � √ p � ( ε, δ ) -DP Θ Θ(log p ) ( ε, 0) -DP Ω( p ) Ω( p ) 15

Generalization to other GMs

Generalization to other GMs Similar results are shown in other graphical models: • Binary t -wise Markov Random Field: From pairwise to t -wise dependency. • Pairwise Graphical Model on General Alphabet: Alphabet from {± 1 } p to [ k ] p . 16

The End Paper ID: 112 Details in paper online: https://arxiv.org/pdf/2002.09463.pdf 17

Dwork, C. and Lei, J. (2009). Differential privacy and robust statistics. In Proceedings of the forty-first annual ACM symposium on Theory of computing , pages 371–380. Dwork, C., McSherry, F., Nissim, K., and Smith, A. (2006). Calibrating noise to sensitivity in private data analysis. In Proceedings of the 3rd Conference on Theory of Cryptography , TCC ’06, pages 265–284, Berlin, Heidelberg. Springer. Talwar, K., Thakurta, A. G., and Zhang, L. (2015). Nearly optimal private lasso. In Advances in Neural Information Processing Systems , pages 3025–3033. Wu, S., Sanghavi, S., and Dimakis, A. G. (2019). 17

Sparse logistic regression learns all discrete pairwise graphical models. In Advances in Neural Information Processing Systems , pages 8069–8079. 17

Privately Learning Markov Random Fields Huanyu Zhang, Cornell - PowerPoint PPT Presentation

Privately Learning Markov Random Fields Huanyu Zhang, Cornell University Gautam Kamath, University of Waterloo Janardhan Kulkarni, Microsoft Research Zhiwei Steven Wu, University of Minnesota Table of contents 1. Problem formulation 2. Main

Graphical Models - Part II Oliver Schulte - CMPT 726 Bishop PRML Ch. 8 Markov Random Fields

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Markov Random Fields Umamahesh Srinivas iPAL Group Meeting February 25, 2011 Outline Basic

Markov Random Fields and its Applications Huiwen Chang Introduction Markov Random

Visualization Visualization Height Fields and Contours Height Fields and Contours Scalar Fields

Outline Markov networks (a.k.a. Markov random fields) Markov Networks Reading: Michael

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Privately Developed Privately Developed Groundwater Recharge Dave Tuthill, Ph.D., P.E. August

Evidence estimation for Markov random fields: a triply intractable problem Richard Everitt

Planar Markov fields Marie-Colette van Lieshout colette@cwi.nl CWI P .O. Box 94079, NL-1090 GB

Multiscale Conditional 1) Generalization of conditional random fields (CRF) to multiscale

Conditional Random Fields Dietrich Klakow Overview Sequence Labeling Bayesian Networks

Random Numbers RANDOM VS PSEUDO RANDOM Truly Random numbers From Wolfram: A random number

Stuarts Draft - Lyndhurst Road, Augusta, Virginia Looking Northwest - Existing View Easting

Realistic Image Synthesis - Perception-based Rendering - Philipp Slusallek Karol Myszkowski

Visualization and Visual Analysis of Multi-faceted Scientific Data: A Survey Johannes Kehrer 1,2

Unsupervised Deep Learning Tutorial - Part 2 Alex Graves MarcAurelio Ranzato

Medical and Health Research Ohio public opinion survey commissioned by May 2016 Research!America

Climate, Health, and Statistics Bo Li December 5, 2019 University of Illinois at

2017 Climate Change Sc oping Plan Update F e br uar y 9, 2017 Outline Ba c kg ro und

Suggested Real-time Reference Sources: Internet Free: Emedicine.medscape.com (reference