Violations by Sampling and Optimization Dana Benjamin Bichsel - PowerPoint PPT Presentation

DP-Finder: Finding Differential Privacy Violations by Sampling and Optimization Dana Benjamin Bichsel Timon Gehr PetarTsankov Martin Vechev Drachsler-Cohen

Differential Privacy – Basic Setting # disease 7 2

Differential Privacy – Basic Setting # disease 7.3 + noise What about my privacy? 3

Differential Privacy - Intuition ? # disease or 7.3 + noise Change my data # disease 7.6 + noise 4

Differential Privacy – More Abstractly 𝑦 Attacker check 𝐺 𝐺(𝑦) 𝐺(𝑦) ∈ Φ ? Neighboring 𝑦′ Attacker check 𝐺 𝐺(𝑦′) 𝐺(𝑦′) ∈ Φ ? 5

Differential Privacy - Definition 𝑦 𝜁 -DP: Attacker check 𝐺 Pr[ 𝐺 𝑦 ∈ Φ ] 𝐺(𝑦) 𝐺(𝑦) ∈ Φ ? Pr[ 𝐺(𝑦 ′ ) ∈ Φ ] ≤ exp 𝜁 ≈ 1 + 𝜁 Neighbouring Challenges induced by DP: 𝑦′ • Proving/checking 𝜁 -DP is hard Attacker check 𝐺 (buggy algorithms) 𝐺(𝑦′) 𝐺(𝑦′) ∈ Φ ? • Proof strategies not complete • Proofs only provide upper bounds 6

𝜁 -DP Counterexamples ( , , ) Φ 𝑦 𝑦′ that violate 𝜁 - DP: Pr[𝐺 𝑦 ∈ Φ] Pr[𝐺(𝑦 ′ ) ∈ Φ] > exp 𝜁 ⟺ log Pr[ 𝐺 𝑦 ∈ Φ ] Pr[ 𝐺(𝑦 ′ ) ∈ Φ ] > 𝜁 7

𝜁 -DP Counterexamples ( , , ) Φ 𝑦 𝑦′ that violate 𝜁 - DP: Pr[𝐺 𝑦 ∈ Φ] Pr[𝐺(𝑦 ′ ) ∈ Φ] > exp 𝜁 ⟺ log Pr[ 𝐺 𝑦 ∈ Φ ] Maximize Pr[ 𝐺(𝑦 ′ ) ∈ Φ ] > 𝜁 ε(𝑦, 𝑦 ′ , Φ) 8

Bounds on "true" 𝜁 Counterexample: Counterexample: Counterexample: 5% -DP 9.9% -DP 15% -DP Proven: 10% -DP ( 𝜁 = 10% = 0.1 ) Evaluation : We get precise and large ε , close to known upper bounds 9

Ƹ Ƹ 𝜁 -DP Counterexamples Goal : Maximize ε(𝑦, 𝑦 ′ , Φ) Challenge 2 : Search space is Challenge 1 : Expensive to sparse: Few 𝑦, 𝑦 ′ , Φ lead to compute ε precisely large ε(𝑦, 𝑦 ′ , Φ) 𝜁 𝑒 𝜁 𝜁 Estimate 𝜁 𝜁 Make Ƹ by sampling differentiable 10

Ƹ Step 1: Estimate 𝜁 𝜁 𝜁 Estimate 𝜁 by sampling 11

Estimating 𝜁 𝜁 x, x ′ , Φ ≔ log Pr[ 𝐺 𝑦 ∈ Φ ] Pr[ 𝐺(𝑦 ′ ) ∈ Φ ] 12

Estimating 𝜁 𝑜 Pr 𝐺(𝑦) ∈ Φ = 1 𝜁 x, x ′ , Φ ≔ log Pr[ 𝐺 𝑦 ∈ Φ ] 𝑗 ෢ check 𝐺,Φ (𝑦) 𝑜 ෍ Pr[ 𝐺(𝑦 ′ ) ∈ Φ ] 𝑗=1 𝑗 𝐺(𝑦) check 𝐺,Φ (𝑦) 𝑦 yes 𝐺 7.3 33% no 𝐺 7.6 67% yes 𝐺 6.8 13

How precise is our estimate? Counterexample: 9.9% ± 10% -DP vs Counterexample: 9.9% ± 2 ∙ 10 −3 -DP Precision of Pr[ 𝐺 𝑦 ∈ Φ ] Sampling Precision of 𝜁 effort 𝑜 and Pr[ 𝐺 𝑦′ ∈ Φ ] Exponential search 14

Estimating precisely is expensive Probabillstic guarantees Heuristic Efficient Heuristic 10 4 Estimating 𝜁 up to an error of 2 ∙ 10 −3 with confidence of 90% 15

Applying the M-CLT (Correlation) yes 𝐺 7.3 𝑜 no 1 𝑗 𝐺 𝑜 ෍ check 𝐺,Φ 𝑦 7.6 𝑗=1 yes 𝐺 6.8 Follows 2D Gaussian distribution yes 𝐺 7.3 𝑜 1 no 𝑗 𝐺 𝑜 ෍ check 𝐺,Φ 𝑦′ 7.6 𝑗=1 no 𝐺 8.2 16

Obtaining a Confidence Interval for 𝜁 Joint likelihood of Likelihood of Confidence Interval Pr[ 𝐺 𝑦 ∈ Φ ] ε(𝑦, x′, Φ) for ε(𝑦, x′, Φ) Pr[ 𝐺 𝑦′ ∈ Φ ] Distribution of Gauss Gauss (correlated): D. V. Hinkley. 1969. On the Ratio of Two Correlated Normal Random Variables. Biometrika 56, 3 (1969), 635 – 639. http://www.jstor.org/stable/2334671 17

How precise is our estimate? Counterexample: 9.9% ± 10% -DP vs Counterexample: 9.9% ± 2 ∙ 10 −3 -DP 18

Ƹ Ƹ Step 2: Finding Counterexamples 𝜁 𝑒 𝜁 𝜁 Make Ƹ differentiable 19

Ƹ How can we optimize our estimate? 1 𝑜 𝑗 𝑜 σ 𝑗=1 check 𝐺,Φ (𝑦) Not differentiable 𝜗 𝑦, 𝑦 ′ , Φ = log maximize 1 𝑜 𝑗 𝑜 σ 𝑗=1 check 𝐺,Φ (𝑦′) Goals • Make differentiable ¬𝐶 ↝ 1 − 𝐶 • Preserve semantics 𝐶 1 ∧ 𝐶 2 ↝ 𝐶 1 ∙ 𝐶 2 if 𝐶 ∶ 𝑦 = 𝐹 1 else ∶ 𝑦 = 𝐹 2 ↝ 𝑦 = 𝐶 ∙ 𝐹 1 + (1 − 𝐶) ∙ 𝐹 2 20

Ƹ How can we optimize our estimate? 1 𝑜 𝑗 𝑜 σ 𝑗=1 check 𝐺,Φ (𝑦) Not differentiable 𝜗 𝑦, 𝑦 ′ , Φ = log maximize 1 𝑜 𝑗 𝑜 σ 𝑗=1 check 𝐺,Φ (𝑦′) • Maximize using SLSQP (supports hard constraints for neighborhood) • Random starting point (+ restart) • What about division by zero? • What about very small denominators? 21

Main differences to Ding et al. Dimension Ding et al. This work Problem statement ε 𝑦, 𝑦 ′ , Φ > ε 0 ? Maximize ε(𝑦, 𝑦 ′ , Φ) Approach Statistical tests Estimate + confidence interval Search By patterns Gradient descent (incremental) 22

Evaluation Exact solver (PSI) • How precise is the differentiable estimate? for ground truth • How efficient is DP-Finder in finding violations compared to random search? 23

Ƹ Precision of Differentiable Estimate 𝜁 𝜁 𝑒 𝜁 Algorithms 24

Random vs Optimized Optimized Random start 25

Ƹ Ƹ Ƹ Conclusion 𝜁 -DP Counterexamples Differential Privacy ( , , ) Estimate 𝜁 Finding Counterexamples 𝜁 𝑒 𝜁 𝜁 𝜁 26

Violations by Sampling and Optimization Dana Benjamin Bichsel - PowerPoint PPT Presentation

DP-Finder: Finding Differential Privacy Violations by Sampling and Optimization Dana Benjamin Bichsel Timon Gehr PetarTsankov Martin Vechev Drachsler-Cohen Differential Privacy Basic Setting # disease 7 2 Differential Privacy Basic

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

Astronomical Tests Possible Violations of . . . Possible Violations of . . . of Relativity:

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

TOP TEN TOP TEN HAZARDOUS WASTE HAZARDOUS WASTE GENERATOR VIOLATIONS GENERATOR VIOLATIONS And

A National Perspective on Responding to Parole Violations Responses to Parole Violations

Avoiding Antitrust Violations In Avoiding Antitrust Violations In Employment Recruiting Leveraging

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Sampling

Overview of Sampling Topics (Shannon) sampling theorem Impulse-train sampling

Optimization of a Sampling Plan using R Optimization of a Sampling Plan using R for Economic Data

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Scalable Differential Privacy with Certified Robustness in Adversarial Learning NhatHai Phan 1 ,

CSC2412: Definition of Di ff erential Privacy Sasho Nikolov 1 An Ideal Goal The study reveals

3.1 Classic Differential Geometry Hao Li http://cs599.hao-li.com 1 Spring 2014 CSCI 599:

LightDP: Towards Automating Differential Privacy Proofs Danfeng Zhang Daniel Kifer Penn

Gaussian Process Approximations of Stochastic Differential Equations

Linear Differential Equations With Constant Coefficients Alan H. Stein University of Connecticut

Another look at estimating parameters in systems of ordinary differential equations via

Metrics for Differential Privacy in Concurrent Systems Lili Xu 1 , 3 , 4 Konstantinos

Sambuz

Useful Links

Newsletter

Mail Us

Violations by Sampling and Optimization Dana Benjamin Bichsel - PowerPoint PPT Presentation

DP-Finder: Finding Differential Privacy Violations by Sampling and Optimization Dana Benjamin Bichsel Timon Gehr PetarTsankov Martin Vechev Drachsler-Cohen Differential Privacy Basic Setting # disease 7 2 Differential Privacy Basic

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

Astronomical Tests Possible Violations of . . . Possible Violations of . . . of Relativity:

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

TOP TEN TOP TEN HAZARDOUS WASTE HAZARDOUS WASTE GENERATOR VIOLATIONS GENERATOR VIOLATIONS And

A National Perspective on Responding to Parole Violations Responses to Parole Violations

Avoiding Antitrust Violations In Avoiding Antitrust Violations In Employment Recruiting Leveraging

Newfound Water Quality Sampling: In Lake Sampling 8 Historic Sampling locations

Sampling Distributions Sampling Distribution of the Mean &amp; Hypothesis Testing Sampling

Overview of Sampling Topics (Shannon) sampling theorem Impulse-train sampling

Optimization of a Sampling Plan using R Optimization of a Sampling Plan using R for Economic Data

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Scalable Differential Privacy with Certified Robustness in Adversarial Learning NhatHai Phan 1 ,

CSC2412: Definition of Di ff erential Privacy Sasho Nikolov 1 An Ideal Goal The study reveals

3.1 Classic Differential Geometry Hao Li http://cs599.hao-li.com 1 Spring 2014 CSCI 599:

LightDP: Towards Automating Differential Privacy Proofs Danfeng Zhang Daniel Kifer Penn

Gaussian Process Approximations of Stochastic Differential Equations

Linear Differential Equations With Constant Coefficients Alan H. Stein University of Connecticut

Another look at estimating parameters in systems of ordinary differential equations via

Metrics for Differential Privacy in Concurrent Systems Lili Xu 1 , 3 , 4 Konstantinos

Sambuz

Useful Links

Newsletter

Mail Us

Sampling Distributions Sampling Distribution of the Mean & Hypothesis Testing Sampling