learning: defense, transferable and camouflaged attacks Xingjun Ma - PowerPoint PPT Presentation

Recent advances in adversarial machine learning: defense, transferable and camouflaged attacks Xingjun Ma School of Computing and Information Systems The University of Melbourne April 2020

Deep learning models are used everywhere Image classification Object detection Speech recognition Deep Learning Autonomous driving Medical diagnosis Playing games 1

Deep neural networks are vulnerable Small perturbation can fool state-of-the-art ML models. Szegedy et al. 2013, Goodfellow et al. 2014 2

Security risks in medical diagnosis 𝜗 ∙ + = Attack Having disease No disease Understanding Adversarial Attacks on Deep Learning Based Medical Image Analysis Systems Ma et al., Pattern Recognition, 2020. 3

Security threats to autonomous driving Adversarial traffic signs all recognized as: 45km speed limit. Evtimov et al. 2017 4

Security risks in speech and NLP systems Carlini et al. 2018 Riberio et al. 2018 5

Security risks in face or object recognition Brown et al. CVPRW, 2018 https://cvdazzle.com/ 6

Research in adversarial machine learning 1. White-box: restricted (norm-bounded), semantic, sparse, … Advs attack 2. Black-box : query-based, transferable 3. Image, audio, video, text 4. Digital vs Physical-world AML 1. Detection: natural or adversarial? 2. Adversarial training, robust optimization 3. Certifiable robustness Advs defense 4. Data denoising, filtering 5. Model quantization, compression, pruning 6. Input gradient regularization 7

How adversarial examples are crafted Training Training Images DNN Classifier Train a model: Class 1 Class 2 1 2 A test Image Feed into Input Gradient DNN classifier Extractor Perturb Image 3 Adversarial Attack: Adversarial Attack 8

How adversarial examples are crafted 𝐸 𝑢𝑠𝑏𝑗𝑜 : training data 𝑦 𝑗 : training sample min ෍ 𝑀(𝑔 𝜄 𝑦 𝑗 , 𝑧 𝑗 ) 𝑧 𝑗 ： class label Model training: 𝜄 𝑀 : loss function 𝑦 𝑗 , 𝑧 𝑗 ∈ 𝐸 𝑢𝑠𝑏𝑗𝑜 𝑔 𝜄 : model 𝜄 𝑦 ′ , 𝑧) subject to 𝑦 ′ − 𝑦 𝑞 ≤ 𝜗 for x ∈ 𝐸 𝑢𝑓𝑡𝑢 max 𝑀(𝑔 Adversarial attack: 𝑦 ′ increase error small change test time attack 8 𝑦 ′ − 𝑦 ∞ ≤ 𝜗 = 255 ≈ 0.031 • Fast Gradient Sign Method (FGSM) ( Goodfellow et al., 2014 ): 𝑦 ′ = 𝑦 + 𝜁 ⋅ sign 𝛼 𝑦 𝑀(𝑔 𝜄 𝑦 , 𝑧) 𝑦 ′ : advs example 9

Why adversarial examples exist? • Viewing DNN as a sequence of transformed spaces: 1 st layer 10 th layer 20 th layer Non-linear explanation: — Non-linear transformations leads to the existence of small “pockets” in the deep space: • Regions of low probability (not naturally occurring). • Densely scattered regions. • Continuous regions. • Close to normal data subspace. Characterizing Adversarial Subspace Using Local Intrinsic Dimensionality. Ma , et al. ICLR 2018 Szegedy et al. 2013 10

Insufficient training data? • An illustrative example – 𝑦 ∈ −1, 1 , 𝑧 ∈ −1, 1 , 𝑨 ∈ −1, 2 – Binary classification • Class 1: 𝑨 < 𝑦 2 + 𝑧 3 • Class 2: 𝑨 ≥ 𝑦 2 + 𝑧 3 – x , y and z are increased by 0.01 → a total of 200 × 200 × 300 = 1.2 × 10 7 points • How many points are needed to reconstruct the decision boundary? – Training dataset: choose 80, 800, 8000, 80000 points randomly – Test dataset: choose 40, 400, 4000, 40000 points randomly – Boundary dataset (adversarial samples are likely to locate here): 𝑦 2 + 𝑧 3 − 0.1 < 𝑨 < 𝑦 2 + 𝑧 3 + 0.1 11

Insufficient training data? • Test result – RBF SVMs Size of the Accuracy on its Accuracy on the test dataset Accuracy on the with 4 × 10 4 points training dataset own test dataset boundary dataset 80 100 92.7 60.8 800 99.0 97.4 74.9 8000 99.5 99.6 94.1 80000 99.9 99.9 98.9 – Linear SVMs Size of the Accuracy on its Accuracy on the test dataset Accuracy on the with 4 × 10 4 points training dataset own test dataset boundary dataset 80 100 96.3 70.1 800 99.8 99.0 85.7 8000 99.9 99.8 97.3 80000 99.98 99.98 99.5 • 8000: 0.067% of 1.2 × 10 7 • MNIST: 28 × 28 8-bit greyscale images, (2 8 ) 28×28 ≈ 1.1 × 10 1888 • 1.1 × 10 1888 × 0.067% ≫ 6 × 10 5 12

Why adversarial examples exist? 𝒙 𝑼 𝒚 + 𝒄 • Viewing DNN as a stack of linear operations: Linear explanation : ‒ Adversarial subspaces span a contiguous multidimensional space: • Small changes at individual dimensions can sum up to 𝒐 significant change in final output: σ 𝒋=𝟏 𝒚 𝒋 + 𝝑 . Adversarial examples can always be found if 𝜗 is large enough. • Goodfellow et al. 2014, 2016 13

State-of-the-art defense: adversarial training Training models on adversarial examples. Adversarial Training Training Images DNN Classifier Class 1 Class 2 1 2 Adversarial Images • It explicitly generates more examples to fill the gap in the input space to improve robustness. Adversarial Attack 14

Adversarial training: robust optimization Adversarial training is a min-max optimization process: attacking 𝑜 1 ′ ), 𝑧 𝑗 ) min 𝑜 ෍ ′ −𝑦 𝑗 𝑞 ≤ 𝜗 𝑀(𝑔 max 𝜄 (𝑦 𝑗 𝜾 𝑦 𝑗 𝑗=1 ′ ： adversarial example. 𝑀 : loss, 𝑔 𝜄 : model, x i ： clean example, y i ： class, x i 1. Inner Maximization: This is to generate adversarial examples, by maximizing the loss 𝑀 . ‒ ′ − 𝑦 𝑗 𝑦 𝑗 𝑞 ≤ 𝜗 . ‒ It is a constrained optimization problem: 2. Outer Minimization: ′ A typical process to train a model, but on adversarial examples 𝑦 𝑗 ‒ generated by the inner maximization. On the Convergence and Robustness of Adversarial Training. Wang*, Ma* , et al., ICML 2019. 15 Mary et al. ICLR 2018.

Misclassification-Aware adveRsarial Training (MART) Improving Adversarial Robustness Requires Revisiting Misclassified Examples Yisen Wang, Difan Zou, Jinfeng Yi, James Bailey, Xingjun Ma and Quanquan Gu ICLR 2020. 16

Misclassification-Aware adveRsarial Training (MART) Adversarial risk: Revisited adversarial risk (correctly- vs mis-classified): 17

Misclassification-Aware adveRsarial Training (MART) • Surrogate loss functions (existing methods and MART) • Semi-supervised extension of MART: 18

Misclassification-Aware adveRsarial Training (MART) • White-box robustness: ResNet-18, CIFAR-10, 𝜗 = 8/255 • White-box robustness: WideResNet-34-10, CIFAR-10, 𝜗 = 8/255 19

Misclassification-Aware adveRsarial Training (MART) • White-box robustness: unlabled data, CIFAR-10, 𝜗 = 8/255 20

Transferable attack with skip connections Skip Connections Matter: on the Transferability of Adversarial Examples Generated with ResNets Dongxian Wu, Yisen Wang, Shu-Tao Xia, James Bailey and Xingjun Ma. ICLR 2020. 21

Structural weakness of ResNets? • Gradient backpropagation with skip connections Source : ResNet-18 Target : VGG19 White/black-box Skip the gradients incrases transferability! 22

Transferable attack with skipped gradients • New attack method: skip gradient method (SGM) Breaking down a network f according to its L residual blocks. ImageNet, target: Inception V3, 𝜗 = 16/255 23

How much can SGM increases transferability? Combined with existing methods: the success rates (%) of attacks crafted on source model DN201 against 7 target models. 24

Adversarial camouflage attack Adversarial Camouflage: Hiding Adversarial Examples with Natural Styles Ranjie Duan, Xingjun Ma , Yisen Wang, James Bailey, Kai Qin, Yun Yang CVPR 2020. 25

Adversarial camouflage Camouflage adversarial examples with customized styles. 26

Adversarial camouflage Making large perturbations look natural: Adversarial attack + style transfer 27

Adversarial camouflage A visually comparison to existing attacks 28

Adversarial camouflage Revolver --> Toilet tissue Minivan --> Traffic light Scabbard --> Purse Attacking the background is what makes the attack stealthy and ubiquitous. Examples of camouflaged digital attacks 29

Adversarial camouflage Traffic sign -> Barbershop Tree -> Street sign Examples of camouflaged physical-world attacks 30

Using adversarial camouflage to protect privacy Here is an adversarial pikachu to protect you! This is a dog to Google Image Search. 31

Thank you! 32

The huge gap between natural accuracy and robustness 93% vs 53%! Model : WideResNet-28-10 Dataset : CIFAR-10 Perturbation : 𝜗 = 8/255 Attack : 20 step PGD 33

learning: defense, transferable and camouflaged attacks Xingjun Ma - PowerPoint PPT Presentation

Recent advances in adversarial machine learning: defense, transferable and camouflaged attacks Xingjun Ma School of Computing and Information Systems The University of Melbourne April 2020 Deep learning models are used everywhere Image

Transferable Utility Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Course:

Identifying and Showcasing Your Transferable Skills Maggie Evans, Ph.D. July 12, 2018 Learning

Food Defense Food Defense Tabletop Food Defense Food Defense Tabletop Tabletop Tabletop

TRANSFERABLE SKILLS A PRESENTATION TO THE NATIONAL BLACK MBA ASSOCIATION, INC. ATLANTA CHAPTER

Challenges of Implementing Fair Defense Act Requirements & Indigent Defense Grant

Environmental Defense Fund Environmental Defense Fund A leading national nonprofit organization,

Global Threat Reduction Initiative Defense Nuclear Nonproliferation Defense Nuclear

CVPR 2020 Universal Adversarial Attacks Image agnostic and transferable across networks

Learning Transferable Architectures for Scalable Image Recognition - Barret Zoph, Vijay Vasudevan,

Learning Transferable Architectures for Scalable Image Recognition Zoph et al. Introduction

Learning Transferable Distance Functions For Human Action Recognition and Detection Weilong Yang

Transferable Learning Outcomes 1. Purpose 2. Assessment methods 3. Assessing Critical

Learning Transferable Features with Deep Adaptation Networks Mingsheng Long 12 , Yue Cao 1 ,

Measuring What Matters: Using Transferable Skills To Re-Imagine Learning in a Competency-Based

Learning Transferable Architectures for Scalable Image Recognition Barret Zoph, Vijay Vasudevan,

Learning Transferable Graph Exploration Hanjun Dai, Yujia Li, Chenglong Wang, Rishabh Singh,

intermediacy of publications pomembnost prispevkov za razvoj znanstvene tematike Lovro

Interventions to Prevent Skin Cancer by Reducing Exposure to Ultraviolet Radiation An Updated

HEBREW CANON HEBREW CANON TORAH NEVIIM KETHUVIM Psalms Joshua Genesis Proverbs Judges

Effective Behaviors for future procurement success Follow up ISMC 2014 Suppliers become

SESSION III: THE SPECIALTY PHARMACEUTICALS MARKETPLACE: IS PRICE INELASTICITY SUSTAINABLE?

The Effects of Financial Crises October 31, 2018 I. O VERVIEW Central Issue What are the

Business Backtesting of ML Models: A Case Study in Real Estate QCon New York June 2017 Nelson

DAVID SCHNURMAN CEO of Lawline Past President of Entrepreneurs Organization New York

learning: defense, transferable and camouflaged attacks Xingjun Ma - PowerPoint PPT Presentation

Recent advances in adversarial machine learning: defense, transferable and camouflaged attacks Xingjun Ma School of Computing and Information Systems The University of Melbourne April 2020 Deep learning models are used everywhere Image

Transferable Utility Game Theory Course: Jackson, Leyton-Brown &amp; Shoham Game Theory Course:

Identifying and Showcasing Your Transferable Skills Maggie Evans, Ph.D. July 12, 2018 Learning

Food Defense Food Defense Tabletop Food Defense Food Defense Tabletop Tabletop Tabletop

TRANSFERABLE SKILLS A PRESENTATION TO THE NATIONAL BLACK MBA ASSOCIATION, INC. ATLANTA CHAPTER

Challenges of Implementing Fair Defense Act Requirements &amp; Indigent Defense Grant

Environmental Defense Fund Environmental Defense Fund A leading national nonprofit organization,

Global Threat Reduction Initiative Defense Nuclear Nonproliferation Defense Nuclear

CVPR 2020 Universal Adversarial Attacks Image agnostic and transferable across networks

Learning Transferable Architectures for Scalable Image Recognition - Barret Zoph, Vijay Vasudevan,

Learning Transferable Architectures for Scalable Image Recognition Zoph et al. Introduction

Learning Transferable Distance Functions For Human Action Recognition and Detection Weilong Yang

Transferable Learning Outcomes 1. Purpose 2. Assessment methods 3. Assessing Critical

Learning Transferable Features with Deep Adaptation Networks Mingsheng Long 12 , Yue Cao 1 ,

Measuring What Matters: Using Transferable Skills To Re-Imagine Learning in a Competency-Based

Learning Transferable Architectures for Scalable Image Recognition Barret Zoph, Vijay Vasudevan,

Learning Transferable Graph Exploration Hanjun Dai, Yujia Li, Chenglong Wang, Rishabh Singh,

intermediacy of publications pomembnost prispevkov za razvoj znanstvene tematike Lovro

Interventions to Prevent Skin Cancer by Reducing Exposure to Ultraviolet Radiation An Updated

HEBREW CANON HEBREW CANON TORAH NEVIIM KETHUVIM Psalms Joshua Genesis Proverbs Judges

Effective Behaviors for future procurement success Follow up ISMC 2014 Suppliers become

SESSION III: THE SPECIALTY PHARMACEUTICALS MARKETPLACE: IS PRICE INELASTICITY SUSTAINABLE?

The Effects of Financial Crises October 31, 2018 I. O VERVIEW Central Issue What are the

Business Backtesting of ML Models: A Case Study in Real Estate QCon New York June 2017 Nelson

DAVID SCHNURMAN CEO of Lawline Past President of Entrepreneurs Organization New York

Transferable Utility Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Course:

Challenges of Implementing Fair Defense Act Requirements & Indigent Defense Grant