private equilibrium computation for analyst privacy
play

Private Equilibrium Computation for Analyst Privacy ?? ?? ?? ?? - PowerPoint PPT Presentation

Private Equilibrium Computation for Analyst Privacy ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? Justin Hsu, Aaron Roth, 1 Jonathan Ullman 2 1 University of Pennsylvania 2 Harvard University June 2, 2013 A market survey scenario A market


  1. Private Equilibrium Computation for Analyst Privacy ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? Justin Hsu, Aaron Roth, 1 Jonathan Ullman 2 1 University of Pennsylvania 2 Harvard University June 2, 2013

  2. A market survey scenario

  3. A market survey scenario ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  4. A market survey scenario ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  5. A market survey scenario ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? Requirements • Data privacy: protect the consumer’s privacy

  6. A market survey scenario ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? Requirements • Data privacy: protect the consumer’s privacy

  7. A market survey scenario ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? Requirements • Data privacy: protect the consumer’s privacy • Analyst privacy [DNV’12]: protect the analyst’s privacy

  8. (Standard) Differential privacy [DMNS’06] [Dwork-McSherry-Nissim-Smith 06] D Bob Chris Xavier Donna Ernie Alice Algorithm ratio bounded Pr [r]

  9. More formally Definition (DMNS’06) Let M be a randomized mechanism from databases to range R , and let D , D ′ be databases differing in one record. M is ǫ -differentially private if for every r ∈ R , Pr[ M ( D ) = r ] ≤ e ǫ · Pr[ M ( D ′ ) = r ] . Useful properties • Very strong, worst-case privacy guarantee • Well-behaved under composition, post-processing

  10. Many-to-one-analyst privacy [DNV’12] Intuition • A single analyst can’t tell if other analysts change their queries

  11. Many-to-one-analyst privacy [DNV’12] Intuition • A single analyst can’t tell if other analysts change their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  12. Many-to-one-analyst privacy [DNV’12] Intuition • A single analyst can’t tell if other analysts change their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  13. Many-to-one-analyst privacy [DNV’12] Intuition • A single analyst can’t tell if other analysts change their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  14. Many-to-one-analyst privacy [DNV’12] Intuition • A single analyst can’t tell if other analysts change their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  15. One-query-to-many-analyst privacy (Today) Intuition • All but one analyst (possibly colluding) can’t tell if last analyst changes one of their queries

  16. One-query-to-many-analyst privacy (Today) Intuition • All but one analyst (possibly colluding) can’t tell if last analyst changes one of their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  17. One-query-to-many-analyst privacy (Today) Intuition • All but one analyst (possibly colluding) can’t tell if last analyst changes one of their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  18. One-query-to-many-analyst privacy (Today) Intuition • All but one analyst (possibly colluding) can’t tell if last analyst changes one of their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  19. One-query-to-many-analyst privacy (Today) Intuition • All but one analyst (possibly colluding) can’t tell if last analyst changes one of their queries ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ?? ??

  20. The query release problem Basic problem • Analysts want accurate answers to a large set Q of counting (linear) queries

  21. The query release problem Basic problem • Analysts want accurate answers to a large set Q of counting (linear) queries “What fraction of records satisfy P ?”

  22. The query release problem Basic problem • Analysts want accurate answers to a large set Q of counting (linear) queries “What fraction of records satisfy P ?” • Privately construct synthetic database to answer queries

  23. The query release problem Basic problem • Analysts want accurate answers to a large set Q of counting (linear) queries “What fraction of records satisfy P ?” • Privately construct synthetic database to answer queries Prior work • Long line of work [BLR’08, RR’09, HR’10,. . . ], data privacy

  24. The query release problem Basic problem • Analysts want accurate answers to a large set Q of counting (linear) queries “What fraction of records satisfy P ?” • Privately construct synthetic database to answer queries Prior work • Long line of work [BLR’08, RR’09, HR’10,. . . ], data privacy • Stateful mechanisms: not analyst private

  25. Accuracy Theorem Suppose the analysts ask queries Q , and let the database have n records from X . There exists an ǫ analyst and data private mechanism which achieves error α on all queries in Q , where � polylog ( |X| , |Q| ) � α = O ǫ √ n .

  26. Plan for rest of the talk Outline • Interpretation of query release as a game • Privately solving the query release game • Analyst private query release

  27. The query release game

  28. The query release game Record r

  29. The query release game Record r Query q

  30. The query release game Record r Query q Loss q ( r ) − q ( D ) ( D is true database)

  31. The query release game Record r Query q Loss q ( r ) − q ( D ) Loss − ( q ( r ) − q ( D )) ( D is true database)

  32. From strategies to query release Database as a distribution • Think of true database D as a distribution over records • ˆ D is data player’s distribution over records

  33. From strategies to query release Database as a distribution • Think of true database D as a distribution over records • ˆ D is data player’s distribution over records Mixed strategy

  34. From strategies to query release Database as a distribution • Think of true database D as a distribution over records • ˆ D is data player’s distribution over records Mixed strategy • Versus a counting query q , data player’s expected loss: D [ q ( r ) − q ( D )] = q ( ˆ D ) − q ( D ) E r ∼ ˆ

  35. From strategies to query release Database as a distribution • Think of true database D as a distribution over records • ˆ D is data player’s distribution over records Mixed strategy • Versus a counting query q , data player’s expected loss: D [ q ( r ) − q ( D )] = q ( ˆ D ) − q ( D ) E r ∼ ˆ • D is mixed strategy with zero loss Equilibrium strategy

  36. From strategies to query release What if small expected loss? • Suppose data player’s expected loss less than α for all queries

  37. From strategies to query release α -approximate equilibrium What if small expected loss? • Suppose data player’s expected loss less than α for all queries

  38. From strategies to query release α -approximate equilibrium What if small expected loss? • Suppose data player’s expected loss less than α for all queries • Data distribution answers all queries with error at most α

  39. From strategies to query release α -approximate equilibrium What if small expected loss? • Suppose data player’s expected loss less than α for all queries • Data distribution answers all queries with error at most α Query release!

  40. From strategies to query release α -approximate equilibrium What if small expected loss? • Suppose data player’s expected loss less than α for all queries • Data distribution answers all queries with error at most α Synthetic Query release! database

  41. From strategies to query release α -approximate equilibrium What if small expected loss? • Suppose data player’s expected loss less than α for all queries • Data distribution answers all queries with error at most α Synthetic Query release! database • But how to compute this?

  42. Computing the equilibrium privately Known approach: repeated game • Players maintain distributions over actions

  43. Computing the equilibrium privately Known approach: repeated game • Players maintain distributions over actions • Loop: • Sample and play action

  44. Computing the equilibrium privately Known approach: repeated game • Players maintain distributions over actions • Loop: • Sample and play action • Receive loss for all actions

  45. Computing the equilibrium privately Known approach: repeated game • Players maintain distributions over actions • Loop: • Sample and play action • Receive loss for all actions • Update distribution: increase probability of better actions

  46. Computing the equilibrium privately Known approach: repeated game • Players maintain distributions over actions • Loop: • Sample and play action • Receive loss for all actions • Update distribution: increase probability of better actions Multiplicative weights (MW)

  47. Computing equilibrium strategy privately Query q Record r Loss q ( r ) − q ( D ) Loss − ( q ( r ) − q ( D ))

  48. Computing equilibrium strategy privately MW MW Loss q ( r ) − q ( D ) Loss − ( q ( r ) − q ( D ))

  49. Computing equilibrium strategy privately Query q Record r MW MW

  50. Computing equilibrium strategy privately Idea: use distribution over plays [FS’96] • Both players use multiplicative weights • MW distributions converge to approximate equilibrium

  51. Computing equilibrium strategy privately Idea: use distribution over plays [FS’96] • Both players use multiplicative weights Not private • MW distributions converge to approximate equilibrium

Recommend


More recommend