t Else? SHAFI GOLDWASSER Crypto 81 Exci%ng Informal Art rather - PowerPoint PPT Presentation

Cr Cryp yptog ography & M y & Mach chine Lea e Learning: Wha What Else? t Else? SHAFI GOLDWASSER

Crypto 81 • Exci%ng • Informal • Art rather than a science

Simons Ins>tute for Theory of Compu>ng Integer Lattices: Algorithms, Complexity and Applications to Cryptography Jan 15 – May 15, 2020

The Surprising Consequences Of Basic Cryptographic Research d Next Fron%er: Cryptography for Safe Machine Learning

Outline • Historical connec%ons between Cryptography and Machine Learning • Safe Machine Learning: a Cryptographic Opportunity • A sampling of what is done already today

Mac Machine Learning hine Learning AI Sta%s%cs Theore%cal Computer Science “Explores the study and construc%on of algorithms that can learn from and make predic%ons on DATA without being explicitly programmed, through building a model from sample inputs. “

Many Machine Learning Models Phase 1 : Learning/training Given training data= {(labeled) instances} , drawn from an unknown distribu%on D, generate an hypothesis/model, ordinarily tested against test data Phase 2: Hypothesis/model developed is used to • Classify new data drawn from D • Generate new data similar to D • Explain the data.

Many Machine Learning Models Phase 1 : Learning/training Given training data= {(labeled) instances} , drawn from an unknown distribu%on D, generate an Training hypothesis/model, ordinarily tested against test data Phase 2: Hypothesis/model developed is used to • Classify new data drawn from D Classifica-on/Genera-on/Explana-on • Generate new data similar to D • Explain the data.

Lets be more concrete A magic DNF Boolean formula c is hidden in a black box. c(x 1 , x 2 , x 3 ) = (x 1 ∧ x 3 )∨ (x 1 ∧ x 2 ∧ not-x 3 ) c could be used to answer: • Is a tumor malignant • Should a bank loan be approved • Should a suspect be released on bail. • Is an email message spam

Lets be more concrete A magic DNF Boolean formula c is hidden in a black box. c(x 1 , x 2 , x 3 ) = (x 1 ∧ x 3 )∨ (x 1 ∧ x 2 ∧ not-x 3 ) c could be used to answer: • Is a tumor malignant Obviously, we would love to • Should a bank loan be approved learn c • Should a suspect be released on bail. • Is an email message spam But, how hard is it ?

To answer this ques>on Need to define: What ’ s meant by successfully “ learn ” What informa%on is made available to the learner about the hidden c, aka “query model” L. G. Valiant (1984). A theory of the learnable. CACM, 27(11). 1134

Probabilis>cally and Approximately Correct Learning (PAC) [valiant84] Given examples {x ,c(x)} for x ∈ X drawn according to unknown distribu-on D and concept c : X à Label a successful efficient learning algorithm generates an hypothesis h that agrees with c approximately and with high probability on inputs drawn from D Efficient = polynomial in input size n and concept size c Agrees Approximately and with high probability = Let error =Prob x ∈ D [h(x)≠c(x)]. Then, prob[error> ε ] < δ

1984 Valiant PAPER: OPTIMISTIC DNF: c (x 1 , x 2 , x 3 ) = (x 1 ∧ x 3 )∨ (x 1 ∧ x 2 ∧ not-x 3 ) • PAC-learn DNF with random examples from arbitrary D? • PAC-learn DNF with random examples when D=uniform? • PAC learn DNF by polynomial %me h, not neccesarily a DNF? • PAC learn DNF if membership queries are allowed? Progress has been slow: model Time Ref PAC, hypothesis is DNF NP-Hard 2 O(n1/3log2n) [KS01] EASIER PAC, hypothesis is poly of degree n 1/3 log n n O(log n) PAC,D= Uniform Distribution [Ver90] PAC, D=Uniform Distribution poly(n) [Jac94] + Membership queries

History of Cryptography & ML Are there concepts which are not PAC- learnable?

PAC learnability (even representa>on independent) is crypto-hard for many query models [ValiantKearns86] Secure RSA imply the existence of concepts in low level complexity classes (NC) which cannot be PAC-learnable even if hypothesis is any polynomial %me algorithm Proof : <e.N.X e mod N, label = lsb(x)> [PiiWarmath90] Secure PRF f imply the existence of concepts in complexity classTime(f) which cannot be PAC-learnable with membership queries & D uniform [CohenGoldwasserVaikuntanathan14] Secure Aggregate-PRF f imply the existence of concepts in Time(f) not PAC-learnable even if can request count of posi%ve examples in an interval [BonehWaters13, BoyleGoldwasserIvan13] Constrained PRF imply non PAC- learnable c even if can receive a circuit which computes a restric%on of c.

On the Learnability of Discrete Distribu>ons (by Kearns et al, STOC 94) Distribu%on D={D n } computed by a family of polynomial %me circuits C={C n } is hidden in a black box D could be: Learner can request samples • Pictures of cats • Successful college essays please • CV’s that get you a job D • Slides for Keynote talks x • Plays by Shakespeare Goal: output polynomial size C n’ ’ which generates D’ ≈ ε D Naor95 : if ∃ digital signatures Sig secure against CMA , then ∃ such family of distribu%ons which are hard to generate . D= {(m i , verifica%on-key), Sig(m i ))

Crypto93’ Machine Learning Returns the favor… Introducing Learning Parity with Noise (LPN)

Learning Parity with Noise (LPN) [BFKL93] • Let s be a secret vector in Z 2 n • LPN n, ρ : Given an arbitrary number of “noisy” equa%ons in s, find s? 0s 1 +s 2 +s 3 +…+sx n ≈ 0 mod 2 Add noise vector e: 1s 1 +0s 2 +s 3 +…+1s n ≈ 1 mod 2 Bernulli with ρ 1s 1 +1s 2 +0s 3 +…+0s n ≈ 0 mod 2 Σ |e i | over Z is small 1s 1 +1s 2 +0s 3 +…+0s n ≈ 0 mod 2 … 0s 1 +1s 2 +0s 3 +…+0s n ≈ 1 mod 2 Best-Algorithm[BKW03]: Best known algorithm %me 2 O(n/log n) ü Worst case to average reduc%ons[BLVW18], noise: 1/2-1/poly(n) ü = ü “Easy” Hard problem: decoding from relative distance log 2 (n)/n

The Learning with Errors Problem (LWE) [Regev05] • Let s be a secret vector in Z q n • LWE n, α : Given an arbitrary number of “noisy” equa%ons in s, find s? Add noise e: each |e i |<small Gaussian in [q/2,-q/2], std dev α q Equivalent to approxima%ng the size of the shortest vector in a ü worst-case integer lavce [Reg05, BLPRS13] Worst Case to Average [Ajtai98] ü Best known algorithm s%ll 2 O(n/logn) [BKW05] ü Revolu-onary : Homomorphic Encryp%on, Leakage resilient Crypto, ü Func%onal/Airibute Encryp%on, and much more

kjffkdkjsdfjkfdkjdj Thanks to Daniel Masny

Quantum Significance In 2017, Google, Microsox, IBM and many other companies, as well as governments, are racing toward building a quantum computer. NSA and NIST have started planning for post-quantum cryptography

2017:Post Quantum Standardiza>on has begun 82 submissions: 59 encryp%ons, 23 signatures Essen>ally All Candidates are based on one version or another of LWE

Bliss for Crypto is a Nightmare for ML Impossibility Results May be Posi>ve News for Second Part of the Talk

The Evolu>on of Two Fields Since the 1980s Cryptography Machine Learning Theory Practice Theory Practice Theory of ML alive and well, but Theory & Prac%ce the excitement in ML is of cryptography are in prac%ce (DNN) lacking theory coming closer together

Thing is…the Prac>ce of ML is too important to Leave to Prac>ce • Health: disease control by trend predic%on • Finance: predic%ons for financial markets • Economic Growth: intelligent consumer targe%ng • Infrastructure: Traffic paierns and energy usage • Vision: Facial and Image recogni%on • NLP: Speech recogni%on, Machine Transla%on • Security: Threat Predic%on models, spam • Policing : decide which neighborhood to police • Bail : decide who is a flight risk • Credit Ra-ng : decide who gets a loan Sudden Shix of Power

“Data is the new oil” � – Shivon Zilis, Bloomberg Beta “Data will become a currency” � – David Kenny, IBM Watson

The Sudden Shift of Power “Data is the new oil” � – Shivon Zilis, Bloomberg Beta “Data will become a currency” � – David Kenny, IBM Watson Can leave us unprotected and unregulat ed

The Thesis for the rest of the talk Axer 30+ years of working on methods to ensure the privacy and correctness of computa-on as well as communica%on Cryptography has the tools and models that should enable it to play a central role in ensuring power of algorithms is not abused

Challenges that Cryptography can help address (and is addressing) 1. Power of ML comes from Data of individuals Ensure privacy of both data & model during training and classifying (even when not mandated by current regula%ons) to maintain “power to the people” 2. Models should not be tampered-with nor introduce bias for profit or control Develop methods to minimize the influence of maliciously chosen training data and to prove models were derived from reported data. Ex Extr tra Benefit: O a Benefit: Oppo pportunity rtunity for using the last 30 years of “crypto comp mpu>ng” in prac>ce

Ch Challen enges es t that Cr Cryp yptog ography c y can h hel elp ess and is not currently addressing addr address 3. Adversarial ML where clever manipula%ons of an input by an adversary can cause misclassifica%ons and fool applica%ons emerges as a real threat in applica%ons such as self driving cars or virus detec%on

t Else? SHAFI GOLDWASSER Crypto 81 Exci%ng Informal Art rather - PowerPoint PPT Presentation

Cr Cryp yptog ography & M y & Mach chine Lea e Learning: Wha What Else? t Else? SHAFI GOLDWASSER Crypto 81 Exci%ng Informal Art rather than a science Simons Ins>tute for Theory of Compu>ng Integer Lattices:

IEEE Transactions on CAD 1986 Decision Diagrams as if-else Normal Form An if-else

DATE CHANGES OUTSIDE FORMS CONDITIONALS IF THEN ELSE if (MUST BE LOWERCASE) No

Motions and Visual Mode Part II Core Commands The World as Everyone Else Sees it The World as

COSC 4P14 What else could we discuss? Brock University Brock University What else could we

about. We have been a nation of dreamers and risk-takers; people who see what nobody else sees

Decision Structures Joan Boone jpboone@email.unc.edu Summer 2020 Slide 1 Topics Part 1 if

Without Resilience Nothing Else Matters Jonas Bonr CTO TypEsafe @jboner Without Resilience

Week 4 -Wednesday What did we talk about last time? if and else examples Pitfalls

HSCC invited talk, Wednesday 29 March 2006 Hybrid Systems . . . And Everything Else John Rushby

Delegation Pattern Explained Professor Larry Heimann Carnegie Mellon University Information

The additive model revisited Sara van de Geer January 8, 2013 but first something else (Les

More if-statments If... if... else! When in doubt, use parenthesis and blocks! (Some people like

Abstraction Example boolean b = mystery(); if(b) { x = 1; y = 3; } else { x = 3; y = 4; }

CSCI261E/F Lab 3: flow control (if, if-else) September 10, 2010 ? Review Boolean

I have type 1 diabetes. I do everything everyone else does I just have to plan ahead and be more

OTT Video Opportunity, threat or something else? 1 9/21/2015 Rick Paulsen Client Relations

From Reduced-Form to Structural Evaluation: Expanding Financial Infrastructure and Impact Robert

Complete Acyclic Colorings GGTW 2019, Ghent, Belgium attler 2 , Kolja Knauer 3 and Stefan Felsner

Irish Waste Management Conference 2017 Carton House Hotel 28th November

De Development of lopment of a a Risk-Adjusted Risk-Adjusted Readmission Ra eadmission Rate:

Cha halleng nges in n archi hitecting ng fully y autom omated drivi ving ng; on Heav avy

Overview Diagnostic Criteria Current: Diagnostic & Statistical Manual-IV (DSM-IV)

Adi Shamir, How to Share a Secret, CACM November 1979 CACM , November 1979. 2/23/2017

Event semantics and discourse connected-movement in Eastern Cham Kenneth Baclawski Jr.

Sambuz

Useful Links

Newsletter

Mail Us

t Else? SHAFI GOLDWASSER Crypto 81 Exci%ng Informal Art rather - PowerPoint PPT Presentation

Cr Cryp yptog ography & M y & Mach chine Lea e Learning: Wha What Else? t Else? SHAFI GOLDWASSER Crypto 81 Exci%ng Informal Art rather than a science Simons Ins>tute for Theory of Compu>ng Integer Lattices:

IEEE Transactions on CAD 1986 Decision Diagrams as if-else Normal Form An if-else

DATE CHANGES OUTSIDE FORMS CONDITIONALS IF THEN ELSE if (MUST BE LOWERCASE) No

Motions and Visual Mode Part II Core Commands The World as Everyone Else Sees it The World as

COSC 4P14 What else could we discuss? Brock University Brock University What else could we

about. We have been a nation of dreamers and risk-takers; people who see what nobody else sees

Decision Structures Joan Boone jpboone@email.unc.edu Summer 2020 Slide 1 Topics Part 1 if

Without Resilience Nothing Else Matters Jonas Bonr CTO TypEsafe @jboner Without Resilience

Week 4 -Wednesday What did we talk about last time? if and else examples Pitfalls

HSCC invited talk, Wednesday 29 March 2006 Hybrid Systems . . . And Everything Else John Rushby

Delegation Pattern Explained Professor Larry Heimann Carnegie Mellon University Information

The additive model revisited Sara van de Geer January 8, 2013 but first something else (Les

More if-statments If... if... else! When in doubt, use parenthesis and blocks! (Some people like

Abstraction Example boolean b = mystery(); if(b) { x = 1; y = 3; } else { x = 3; y = 4; }

CSCI261E/F Lab 3: flow control (if, if-else) September 10, 2010 ? Review Boolean

I have type 1 diabetes. I do everything everyone else does I just have to plan ahead and be more

OTT Video Opportunity, threat or something else? 1 9/21/2015 Rick Paulsen Client Relations

From Reduced-Form to Structural Evaluation: Expanding Financial Infrastructure and Impact Robert

Complete Acyclic Colorings GGTW 2019, Ghent, Belgium attler 2 , Kolja Knauer 3 and Stefan Felsner

Irish Waste Management Conference 2017 Carton House Hotel 28th November

De Development of lopment of a a Risk-Adjusted Risk-Adjusted Readmission Ra eadmission Rate:

Cha halleng nges in n archi hitecting ng fully y autom omated drivi ving ng; on Heav avy

Overview Diagnostic Criteria Current: Diagnostic &amp; Statistical Manual-IV (DSM-IV)

Adi Shamir, How to Share a Secret, CACM November 1979 CACM , November 1979. 2/23/2017

Event semantics and discourse connected-movement in Eastern Cham Kenneth Baclawski Jr.

Sambuz

Useful Links

Newsletter

Mail Us

Overview Diagnostic Criteria Current: Diagnostic & Statistical Manual-IV (DSM-IV)