Physical Adversarial Examples Alex Kurakin Ian Goodfellow Output - PowerPoint PPT Presentation

Physical Adversarial Examples Alex Kurakin Ian Goodfellow

Output STOP Machine Learning Training Examples Hidden units / BICYCLE features CAR PEDESTRIA N Parameters Input ImageNet (Russakovsky et al 2015)

Adversarial Examples: Images Machine SCHOOL BUS Learning SCHOOL BUS Machine OSTRICH Learning SCHOOL BUS 3 (Figure credit: Nicolas Papernot)

Fast Gradient Sign Method (FGSM )

Maps of Adversarial Examples Random FGSM

Almost all inputs are misclassified

Generalization across training sets

Cross-Technique Transferability (Papernot et al 2016)

Transferability attack

Results on Real-World Remote Systems All remote classifiers are trained on the MNIST dataset (10 classes, 60,000 training samples) Adversarial examples Remote Platform ML technique Number of queries misclassified (after querying) Deep Learning 6,400 84.24% Linear Regression 800 96.19% Unknown 2,000 97.72% (Papernot et al 2016)

Adversarial examples in the physical world? Question: Can we build adversarial examples in the physical world? ● Let’s try the following: ● ○ Generate and print picture of adversarial example Take a photo of this picture (with cellphone camera) ○ ○ Crop+warp picture from the photo to make it 299x299 input to Imagenet inception Classify this image ○ Would the adversarial image remain misclassified after this transformation? ● If we succeed with “photo” then we potentially can alter real-world objects to mislead ● deep-net classifiers

Adversarial examples in the physical world? Question: Can we build adversarial examples in the physical world? ● Let’s try the following: ● ○ Generate and print picture of adversarial example Take a photo of this picture (with cellphone camera) ○ ○ Crop+warp picture from the photo to make it 299x299 input to Imagenet inception Classify this image ○ Would the adversarial image remain misclassified after this transformation? ● If we succeed with “photo” then we potentially can alter real-world objects to mislead ● deep-net classifiers Answer: IT’S POSSIBLE

Digital adversarial examples Bird Airplane Image Image classifier classifier Crafted adversarial perturbation Clean image Adversarial [ Goodfellow, Shlens & Szegedy, ICLR2015 ] image

Adversarial examples in the physical world Bird Airplane Image Image classifier classifier Crafted Printed adversarial adversarial image perturbation Clean image [ Kurakin & Goodfellow & Bengio, arxiv.org/abs/1607.02533 ]

Our experiment 1. Print pairs of normal and 2. Take picture 3. Auto crop and classify adversarial images Up to 87% of images could remain misclassified!

Live demo Library Washer Washer

Don’t panic! It’s not end of the ML world! Our experiment is a proof-of-concept set up: ● ○ We had full access to the model 87% adversarial images rate is for only one method, which could be resisted by ○ adversarial training. For other methods it’s much lower. In many cases “adversarial” image is not so harmful: one breed of dog confused with ○ another ● In practice: Attacker doesn’t have access to model ○ ○ You might be able to use adversarial training to defend model against some attacks For other attacks, “adversarial examples in the real worlds” won’t work that well ○ ○ It’s REALLY hard to fool your model to predict specific class

Physical Adversarial Examples Alex Kurakin Ian Goodfellow Output - PowerPoint PPT Presentation

Physical Adversarial Examples Alex Kurakin Ian Goodfellow Output STOP Machine Learning Training Examples Hidden units / BICYCLE features CAR PEDESTRIA N Parameters Input ImageNet (Russakovsky et al 2015) Adversarial Examples: Images

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

A Closer Look at Adversarial Examples for Separated Data Kamalika Chaudhuri University of

Adversarial Examples Hanxiao Liu April 2, 2018 1 / 22 Adversarial Examples Inputs to ML

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Adversarial Examples in NLP Sameer Singh sameer@uci.edu @sameer_ sameersingh.org What are

Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy*

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Guest

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

User Interface - Hall of Shame SWEN-261 Introduction to Software Engineering Department of

Legal Analytics: Beyond the Buzz Chicago Association of Law Libraries December 14, 2018 Jesse

S t at e R e pr e se n ta t io n a n d P ol y om i no P la c em e nt f or t he G am e P a tc h wo r

What is a Nerd? A historical and journalistic examination of the cultural psychology of outcast

Advances in Programming Languages APL19: Heterogeneous Metaprogramming in F# Ian Stark School of

Island View Orientation 2019 Island View Orientation 2010 Housing & Residential Education

Interpersonal Skills Transi0on from a Geek to a Geek and a Leader CompSci

HOW YOU CAN GET INVOLVED WITH THE WORDPRESS COMMUNITY WHO WE ARE JOHN HAWKINS - WordPress

Physical Adversarial Examples Alex Kurakin Ian Goodfellow Output - PowerPoint PPT Presentation

Physical Adversarial Examples Alex Kurakin Ian Goodfellow Output STOP Machine Learning Training Examples Hidden units / BICYCLE features CAR PEDESTRIA N Parameters Input ImageNet (Russakovsky et al 2015) Adversarial Examples: Images

Synthesizing Robust Adversarial Examples Anish Athalye*, Logan Engstrom*, Andrew Ilyas*, Kevin

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

A Closer Look at Adversarial Examples for Separated Data Kamalika Chaudhuri University of

Adversarial Examples Hanxiao Liu April 2, 2018 1 / 22 Adversarial Examples Inputs to ML

Synthesizing Robust Adversarial Examples Anish Athalye*, Logan Engstrom*, Andrew Ilyas*, Kevin

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Adversarial Examples in NLP Sameer Singh sameer@uci.edu @sameer_ sameersingh.org What are

Thermometer Encoding: One Hot Way to Resist Adversarial Examples Stanford, 2017-11-16 Aurko Roy*

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Guest

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

User Interface - Hall of Shame SWEN-261 Introduction to Software Engineering Department of

Legal Analytics: Beyond the Buzz Chicago Association of Law Libraries December 14, 2018 Jesse

S t at e R e pr e se n ta t io n a n d P ol y om i no P la c em e nt f or t he G am e P a tc h wo r

What is a Nerd? A historical and journalistic examination of the cultural psychology of outcast

Advances in Programming Languages APL19: Heterogeneous Metaprogramming in F# Ian Stark School of

Island View Orientation 2019 Island View Orientation 2010 Housing &amp; Residential Education

Interpersonal Skills Transi0on from a Geek to a Geek and a Leader CompSci

HOW YOU CAN GET INVOLVED WITH THE WORDPRESS COMMUNITY WHO WE ARE JOHN HAWKINS - WordPress

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

Island View Orientation 2019 Island View Orientation 2010 Housing & Residential Education