Adversarial Machine Learning: Curiosity, Benefit, or Threat? Lujo - PowerPoint PPT Presentation

Adversarial Machine Learning: Curiosity, Benefit, or Threat? Lujo Bauer Associate Professor Electrical & Computer Engineering + Computer Science Director Cyber Autonomy Research Center Collaborators: Mahmood Sharif, Sruti Bhagavatula, Mike Reiter (UNC)

Machine Learning Is Ubiquitous • Cancer diagnosis • Predicting weather • Self-driving cars • Surveillance and access-control 4

What Do You See? Lion (p=0.99) Deep Neural Network* 𝑞(𝑑 1 ) 𝑞(𝑑 2 ) Race car … 𝑞(𝑑 3 ) (p=0.74) … … … … 𝑞(𝑑 𝑂 ) … Traffic light (p=0.99) *CNN- F, proposed by Chatfield et al., “Return of the Devil”, BMVC ‘14 5

What Do You See Now? Pelican (p=0.85) DNN (same as before) 𝑞(𝑑 1 ) 𝑞(𝑑 2 ) Speedboat … 𝑞(𝑑 3 ) (p=0.92) … … … … 𝑞(𝑑 𝑂 ) … Jeans (p=0.89) 6 *The attacks generated following the method proposed by Szegedy et al.

The Difference Amplify × 3 - = - = - = 7

Is This an Attack? Amplify × 3 - = - = - = 8

[Sharif, Bhagavatula, Bauer, Reiter Can an Attacker Fool ML Classifiers? CCS ’16, arXiv ’17, TOPS ’19] Fooling face recognition (e.g., for surveillance, access control) Can change • What is the attack scenario? physical objects, • Does scenario have constraints? in a limited way • On how attacker can manipulate input? Can’t control camera position, • On what the changed input can look like? lighting Defender / beholder doesn’t notice attack (to be measured by user study) 9

Attempt #1 0. Start with Szegedy et al.’s attack 1. Restrict modification to eyeglasses “Inconspicuousness” 2. Smooth pixel transitions 3. Restrict to printable colors Physical realizability 4. Add robustness to pose 10

Step #1: Apply Changes Just to Eyeglasses Vicky McClure Terence Stamp 11

Step #2: Smooth Pixel Transitions Natural images tend to be smooth: We minimize total variations: 2 + 𝑠 𝑗+1,𝑘 − 𝑠 𝑗,𝑘 2 TV 𝑠 = ෍ 𝑠 𝑗,𝑘+1 − 𝑠 𝑗,𝑘 𝑗,𝑘 Sum of differences of Without min TV() With min TV() neighboring pixels 12

Step #3: Restrict to Printable Colors • Challenge: Cannot print all colors • Find printable colors by printing color palette Ideal Printed color palette color palette • Define non-printability score (NPS): • high if colors are not printable; low otherwise • Generate printable eyeglasses by minimizing NPS 13

Step #4: Add Robustness to Pose • Two samples of the same face are almost never the same ⇒ attack should generalize beyond one image • Achieved by finding one eyeglasses that lead any image in a set to be misclassified: argmin ෍ distance 𝑔(𝑦 + 𝑠), 𝑑 𝑢 𝑠 𝑦∈𝑌 X is a set of images, e.g., X = 14

Putting All the Pieces Together argmin ෍ distance 𝑔(𝑦 + 𝑠), 𝑑 𝑢 + 𝜆 1 ∙ TV 𝑠 + 𝜆 2 ∙ NPS(𝑠) 𝑠 𝑦∈𝑌 misclassify as 𝑑 𝑢 smoothness printability (set of images) 15

Time to Test! Procedure: 0. Train face recognizer 1. Collect images of attacker 2. Choose random target 3. Generate and print eyeglasses 4. Collect images of attacker wearing eyeglasses 5. Classify collected images Success metric: fraction of images misclassified as target 26

Physically Realized Impersonation Attacks Work Lujo John Malkovich 100% success 17

Physically Realized Impersonation Attacks Work Mahmood Carson Daly 100% success 18

Can an Attacker Fool ML Classifiers? (Attempt #1) Fooling face recognition (e.g., for surveillance, access control) Can change • What is the attack scenario? physical objects, ✓ • Does scenario have constraints? in a limited way • On how attacker can manipulate input? Can’t control camera position, ? • On what the changed input can look like? lighting ? Defender / beholder doesn’t notice attack (to be measured by user study) 19

Attempt #2 Goal: Capture hard-to-formalize constraints, i.e., “inconspicuousness” Approach : Encode constraints using a neural network 20

Step #1: Generate Realistic Eyeglasses [0..1] … … … … … … Generator Real eyeglasses real / fake … … … … … … Discriminator 21

Step #2: Generate Realistic Eyeglasses Adversarial [0..1] … … … … … … Generator Real eyeglasses real / fake … … … … … … Discriminator 22

Step #2: Generate Realistic Eyeglasses Adversarial [0..1] … … … … … … Generator Russell Crowe / Owen Wilson / … Lujo Bauer / … … … … … … Face recognizer 23

Ariel 24

Are Adversarial Eyeglasses Inconspicuous? real / fake real / fake real / fake … … 25

Are Adversarial Eyeglasses Inconspicuous? fraction of time selected as real Most realistic 10% of physically realized eyeglasses are more realistic than average real eyeglasses Adversarial Adversarial (digital) (realized) Real 26

Can an Attacker Fool ML Classifiers? (Attempt #2) Fooling face recognition (e.g., for surveillance, access control) Can change • What is the attack scenario? physical objects ✓ • Does scenario have constraints? in a limited way • On how attacker can manipulate input? Can’t control ? camera position, • On what the changed input can look like? lighting ? ✓ Defender / beholder doesn’t notice attack (to be measured by user study) 27

Considering Camera Position, Lighting • Used algorithm to measure pose (pitch, roll, yaw) • Mixed-effects logistic regression • Each 1 ° of yaw = 0.94x attack success rate • Each 1 ° of pitch = 0.94x (VGG) or 1.12x (OpenFace) attack success rate • Varied luminance (add 150W incandescent light at 45 ° , 5 luminance levels) • Not included in training → 50% degradation in attack success • Included in training → no degradation in attack success 28

What If Defenses Are in Place? • Already: • Augmentation to make face recognition more robust to eyeglasses • New: • Train attack detector (Metzen et al. 2017) • 100% recall and 100% precision • Attack must fool original DNN and detector • Result (digital environment): attack success unchanged , with minor impact to conspicuousness 29

Can an Attacker Fool ML Classifiers? (Attempt #2) Fooling face recognition (e.g., for surveillance, access control) Can change • What is the attack scenario? physical objects ✓ • Does scenario have constraints? in a limited way • On how attacker can manipulate input? Can’t control ? camera position, • On what the changed input can look like? ✓ lighting ✓ Defender / beholder doesn’t notice attack (to be measured by user study) 30

Other Attack Scenarios? Dodging: One pair of eyeglasses, many attackers? Change to training process: Train with multiple images of one user → train with multiple images of many users Create multiple eyeglasses, test with large population 31

Other Attack Scenarios? 5 pairs of eyeglasses, 85+% of population avoids recognition Dodging: One pair of eyeglasses, many attackers? # of eyeglasses Success rate (VGG143) used for dodging 1 pair of eyeglasses, 50+% of population avoids recognition # of subjects trained on 32

Other Attack Scenarios? or Defense Privacy protection? • E.g., against mass surveillance at a political protest Unhappy speculation: individually, probably not • 90% of video frames successfully misclassified → 100% success at defeating laptop face logon → 0% at avoiding being recognized at a political protest 33

Other Attack Scenarios? or Defense Denial of service / resource exhaustion: “appear” in many locations at once, e.g., for surveillance targets to evade pursuit 34

Other Attack Scenarios? or Defense Stop sign → speed limit sign [Eykholt et al., arXiv ‘18] 35

Other Attack Scenarios? or Defense Stop sign → speed limit sign [Eykholt et al., arXiv ’18] Hidden voice commands [Carlini et al., ’16 -19] noise → “OK, Google, browse to evil dot com” Malware classification [Suciu et al., arXiv ’18] malware → “benign” 36

Fooling ML Classifiers: Summary and Takeaways • “Attacks” may not be meaningful until we fix context • E.g., for face recognition: • Attacker: physically realized (i.e., constrained) attack • Defender / observer: attack isn’t noticed as such • Even in a practical (constrained) context, real attacks exist • Relatively robust, inconspicuous; high success rates • Hard-to-formalize constraints can be captured by a DNN • Similar principles about constrained context apply to other domains: e.g., malware, spam detection For more: www.ece.cmu.edu/~lbauer/proj/advml.php 37

Adversarial Machine Learning: Curiosity, Benefit, or Threat? Lujo - PowerPoint PPT Presentation

Adversarial Machine Learning: Curiosity, Benefit, or Threat? Lujo Bauer Associate Professor Electrical & Computer Engineering + Computer Science Director Cyber Autonomy Research Center Collaborators: Mahmood Sharif, Sruti Bhagavatula,

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

SECURITY, ADVERSARIAL SECURITY, ADVERSARIAL LEARNING, AND PRIVACY LEARNING, AND PRIVACY

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

AdVersarial: Perceptual Ad Blocking meets Adversarial Machine Learning Florian Tramr November

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Adversarial Learning Bounds for Linear Classes and Neural Nets Understanding Adversarial Learning

Designing for democracy Creating voter guides based on research and working in the real world of

Colorization using Optimization Anat Levin Dani Lischinski Yair Weiss Colorization Colorization

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

Western Union Fourth Quarter 2012 Earnings Webcast & Conference Call February 12, 2013 Mike

SURVEY TOOL To investigate teachers beliefs about compositional experiences in the elementary

Ten lessons learned about Ten lessons learned about Ubiquitous Computing Ubiquitous Computing

A FREE OPERATING SYSTEM DESIGNED SPECIFICALLY FOR PERSONAL COMPUTING by Rusty Keele THE PLAN

Short History of Operating Systems CS 4410 Operating Systems [R. Agarwal, L. Alvisi, A. Bracy,

Adversarial Machine Learning: Curiosity, Benefit, or Threat? Lujo - PowerPoint PPT Presentation

Adversarial Machine Learning: Curiosity, Benefit, or Threat? Lujo Bauer Associate Professor Electrical & Computer Engineering + Computer Science Director Cyber Autonomy Research Center Collaborators: Mahmood Sharif, Sruti Bhagavatula,

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

SECURITY, ADVERSARIAL SECURITY, ADVERSARIAL LEARNING, AND PRIVACY LEARNING, AND PRIVACY

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

AdVersarial: Perceptual Ad Blocking meets Adversarial Machine Learning Florian Tramr November

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Synthesizing Robust Adversarial Examples Anish Athalye*, Logan Engstrom*, Andrew Ilyas*, Kevin

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Adversarial Learning Bounds for Linear Classes and Neural Nets Understanding Adversarial Learning

Designing for democracy Creating voter guides based on research and working in the real world of

Colorization using Optimization Anat Levin Dani Lischinski Yair Weiss Colorization Colorization

Image and Video Coding: Video Coding Extensions Screen Content Coding Screen Content Coding

Western Union Fourth Quarter 2012 Earnings Webcast &amp; Conference Call February 12, 2013 Mike

SURVEY TOOL To investigate teachers beliefs about compositional experiences in the elementary

Ten lessons learned about Ten lessons learned about Ubiquitous Computing Ubiquitous Computing

A FREE OPERATING SYSTEM DESIGNED SPECIFICALLY FOR PERSONAL COMPUTING by Rusty Keele THE PLAN

Short History of Operating Systems CS 4410 Operating Systems [R. Agarwal, L. Alvisi, A. Bracy,

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Western Union Fourth Quarter 2012 Earnings Webcast & Conference Call February 12, 2013 Mike