Shrinking and Exploring David Evans University of Virginia - PowerPoint PPT Presentation

evadeML. L.org Shrinking and Exploring David Evans University of Virginia Adversarial Search Spaces ARO Workshop on Adversarial Learning Stanford, 14 Sept 2017 Weilin Xu Yanjun Qi

Machine Learning is Eating Computer Science 1

Security State-of-the-Art Random guessing attack success Threat models Proofs probability information 𝟑 "𝟐𝟑𝟗 Cryptography theoretic, resource required bounded capabilities, 𝟑 "𝟒𝟑 System Security motivations, common rationality Adversarial white-box, 𝟑 "𝟐𝟐 *; 𝟑 "𝟕 rare! Machine Learning black-box 2

Adversarial Examples “panda” “gibbon” + 0.007 × [𝑜𝑝𝑗𝑡𝑓] = Example from: Ian J. Goodfellow, Jonathon Shlens, Christian Szegedy. Explaining and Harnessing Adversarial Examples . 2014. 3

Adversarial Examples Game Given seed sample, 𝑦 , find 𝑦 6 where: 𝑔 𝑦 6 ≠ 𝑔(𝑦 ) Class is different (untargeted) 𝑔 𝑦 6 = 𝑢 Class is 𝑢 (targeted) ∆ 𝑦, 𝑦 6 ≤ 𝜀 Difference below threshold ∆ 𝑦, 𝑦 6 is defined in some (simple!) metric space: 𝑀 @ “norm (# different), 𝑀 A norm, 𝑀 B norm (“Euclidean”), 𝑀 C norm: 4

Detecting Prediction 0 Adversarial Model Examples Squeezer 1 Model Adversarial Prediction 1 Yes Squeezer 2 𝒈(𝑞𝑠𝑓𝑒 @ , 𝑞𝑠𝑓𝑒 A , … , 𝑞𝑠𝑓𝑒 K ) Model Prediction 2 Input No … Legitimate Squeezer k Model’ Prediction k

“Feature Squeezing” [0.054, 0.4894, 0.9258, 0.0116, 0.2898, 0.5222, 0.5074, …] 𝒚 Squeeze: 𝑔 O = round(𝑔 O ×4)/4 [0.0, 0.5, 1.0, 0.0, 0.25, 0.5, 0.5, …] squeeze 𝒚 6 ≈ squeeze 𝒚 ⟹ 𝑔(squeeze 𝒚 6 ) ≈ 𝑔(squeeze 𝒚 ) [0.0, 0.5, 1.0, 0.0, 0.25, 0.5, 0.5, …] Squeeze: 𝑔 O = round(𝑔 O ×4)/4 𝒚 6 [0.0491, 0.4903, 0.9292, 0.009, 0.2942, 0.5243, 0.5078, …] 6

Example Squeezers 1-bit monochrome 8-bit greyscale 3x3 smoothing: Replace with median of pixels and its neighbors Reduce Color Depth Median Smoothing 7

Simple Instantiation Model Prediction 0 (7-layer CNN) Adversarial Yes Bit Depth- Model 1 Prediction 1 max 𝑀 A 𝑞 @ , 𝑞 A , 𝑀 A 𝑞 @ , 𝑞2 > 𝑢 Input No Median Prediction 2 Legitimate 2×2 Model

Number of Examples 800 Legitimate 600 400 Adversarial threshold = 0.0029 200 detection: 98.2%, FP < 4% 0 0.0 0.4 0.8 1.2 1.6 2.0 Maximum 𝑀 A distance between original and squeezed input 9

ImageNet Configuration Prediction 0 Model (MobileNet) Adversarial Bit Depth- Yes Model 5 Prediction 1 max(𝑀 A (𝑞 @ , {𝑞 A , 𝑞 B , 𝑞 b }) > 𝑢 Median Input No 2×2 Model Prediction 2 Legitimate Non-local Mean Model Prediction 3

140 Legitimate 120 100 80 Adversarial 60 threshold = 1.24 40 detection: 85%, FP < 5% 20 0 0.0 0.4 0.8 1.2 1.6 2.0 Maximum 𝑀 A distance between original and squeezed input 11

Detection Performance Summary Detection Rate ROC-AUC False Most Effective Dataset Threshold Positive Successful Failed Including Excluding Squeezers Found Rate AEs AEs FAEs FAEs Bit Depth (1-bit), 0.0029 98.2% 20.0% 3.98% 94.5% 99.6% MNIST Median (2x2) Bit Depth (5-bit), 1.1402 85.0% 9.1% 4.93% 95.7% 95.9% CIFAR-10 Median (2x2), Non-local Mean (13-3-2) Bit Depth (5-bit), 1.2476 85.2% 25.0% 4.70% 94.0% 94.5% ImageNet Median (2x2), Non-local Mean (11-3-4)

JSMA (LL) JSMA (Next) CW 0 (LL) CW 0 (Next) CW 2 (LL) CW 2 (Next) DeepFool CW ∞ (LL) CW ∞ (Next) ImageNet CIFAR-10 BIM MNIST FGSM 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Detection Performance

𝜗 = Composes with model-based defenses 14

Arms Race? WOOT (August 2017) Incorporate 𝑀 A squeezed distance into loss function Untargeted Targeted (Next) Targeted (Least Likely) 64% 41% 21% (Adversary success rate on MNIST) 15

Raising the Bar or Changing the Game ? Metric Space 1: Target Classifier Metric Space 2: “Oracle” Before: find a small perturbation that changes class for classifier, but imperceptible to oracle. Now: change class for both original and squeezed classifier, but imperceptible to oracle. 16

“Feature Squeezing” Conjecture For any distance-limited adversarial method, there exists some feature squeezer that accurately detects its adversarial examples. Intuition: if the perturbation is small (in some simple metric space), there is some squeezer that coalesces original and adversarial example into same sample. 17

Defender’s Prediction 0 random Model Entropy En py seed Advantage Squeezer 1 Model Adversarial Prediction 1 Yes Squeezer 2 𝒈(𝑞𝑠𝑓𝑒 @ , 𝑞𝑠𝑓𝑒 A , … , 𝑞𝑠𝑓𝑒 K ) Model Prediction 2 Input No … Legitimate Squeezer k Model’ Prediction k

More Complex Squeezers + Entropy CCS 2017 Pick a random autoencoder 19

Changing the Game Option 1: Find distance-limited adversarial methods for which it is intractable to find effective feature squeezers. Option 2: Redefine adversarial examples so distance is not limited in a simple metric space... focus of rest of the talk 20

Do Humans Matter? Metric Space 1: Metric Space 2: Metric Space 1: Metric Space 2: Machine Human Machine 1 Machine 2 21

Malware Classifiers

Automated Classifier Evasion Using Genetic Programming Benign Oracle Malicious PDF Benign PDFs Variants Found Evasive? ✓ Variants ✓ ✗ ✓ Select Clone Mutation Variants Variants

Generating Variants Malicious PDF Benign PDFs Variants Found Evasive? ✓ Variants ✓ ✗ ✓ Select Clone Mutation Variants Variants

Generating Variants /Catalog /Pages /Root 0 Found Malicious PDF Benign PDFs Variants Found /JavaScript Evasive Evasive? eval(‘…’); ? ✓ Variants ✓ ✗ Select random node ✓ Select Clone Mutation Randomly transform: delete , insert, replace Variants Variants

Generating Variants /Catalog 7 /Pages 63 128 /Root 0 128 Found Malicious PDF Benign PDFs 546 Variants Found /JavaScript Evasive Evasive? Nodes from eval(‘…’); ? Benign PDFs Variants Select random node Select Clone Mutation Randomly transform: delete , insert , replace Variants Variants

Selecting Promising Variants Malicious PDF Benign PDFs Variants Found Evasive? ✓ Variants ✓ ✗ ✓ Select Clone Mutation Variants Variants

Selecting Promising Variants Malicious /Catalog /Pages Fitness Function Malicious PDF Benign PDFs Oracle Variants Found 128 𝑔(𝑡 ghijkl , 𝑡 jkimm ) /Root 0 Evasive? ✓ /JavaScript Variants ✓ eval(‘…’); ✗ Score ✓ Candidate Variant Select Target Classifier Clone Mutation Variants Variants

Oracle Execute candidate in Cuckoo vulnerable Adobe Reader in virtual environment https://github.com/cuckoosandbox Simulated network: INetSim Behavioral signature: HTTP_URL + HOST extracted from API traces malicious if signature matches Advantage: we know the target malware behavior

Fitness Function Assumes lost malicious behavior will not be recovered 𝑔 𝑤 = o.5 − classifier_score 𝑤 if oracle 𝑤 = "malicious" −∞ otherwise classifier_score ≥ 0.5 : labeled malicious

500 PDFRate Seeds Evaded 400 (out of 500) 300 200 Hidost 100 0 0 100 200 300 Number of Mutations

500 PDFRate Seeds Evaded 400 (out of 500) Simple 300 transformations 200 often worked Hidost 100 0 0 100 200 300 Number of Mutations

500 PDFRate Seeds Evaded 400 (out of 500) ( insert insert , /Root/Pages/Kids, 300 3:/Root/Pages/Kids/4/Kids/5/) Works on 162/500 seeds 200 Hidost 100 0 0 100 200 300 Number of Mutations

500 PDFRate Seeds Evaded 400 (out of 500) 300 Some seeds required complex Works on 162/500 seeds 200 transformations Hidost 100 0 0 100 200 300 Number of Mutations

Possible Defenses

Possible Defense: Adjust Threshold Charles Smutz, Angelos Stavrou. When a Tree Falls: Using Diversity in Ensemble Classifiers to Identify Evasion in Malware Detectors. NDSS 2016.

Original Malicious Seeds Evading PDFrate Malicious Label Threshold

Adjust threshold? Discovered Evasive Variants

Adjust threshold? Variants found with threshold = 0.50 Variants found with threshold = 0.25

Possible Defense: Hide Classifier

Hide the Classifier Score? Malicious /Catalog /Pages Fitness Function Malicious PDF Benign PDFs Oracle Variants Found 128 𝑔(𝑡 ghijkl , 𝑡 jkimm ) /Root 0 Evasive? ✓ /JavaScript Variants ✓ eval(‘…’); ✗ Score ✓ Candidate Variant Select Target Classifier Clone Mutation Variants Variants

Binary Classifier Output is Enough ACM CCS 2017 Malicious /Catalog /Pages Fitness Function Malicious PDF Benign PDFs Oracle Variants Found 128 𝑔(𝑡 ghijkl , 𝑡 jkimm ) /Root 0 Evasive? ✓ /JavaScript Variants ✓ eval(‘…’); ✗ Score ✓ Candidate Variant Select Target Classifier Clone Mutation Variants Variants

Possible Defense: Retrain Classifier

Shrinking and Exploring David Evans University of Virginia - PowerPoint PPT Presentation

evadeML. L.org Shrinking and Exploring David Evans University of Virginia Adversarial Search Spaces ARO Workshop on Adversarial Learning Stanford, 14 Sept 2017 Weilin Xu Yanjun Qi Machine Learning is Eating Computer Science 1 Security

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

Unfolding and Shrinking Neural Machine Translation Ensembles Felix Stahlberg and Bill Byrne

2 To produce strong economic growth in a country with a shrinking population is close to

Guess-then-algebraic attack on the Self-Shrinking Generator Blandine Debraize, Louis Goubin

Pitch location and Greinkes July Exploring Pitch Data in R Strike zone success Exploring

EXPLORE ARIZONA THROUGH DATA FOCUS ON STUDENT DATA OVERVIEW WELCOME! EXPLORING DATA

Middle Grades/High School Exploring Change in the Number of Cases Middle Grades/High School

New IRS "No Private Letter Ruling" Policy: Sec. 355 Transactions and a Shrinking PLR

UNICREDIT FOR CONSUMER PROTECTION: THE ITALIAN BUSINESS CASE CREDIT CRUNCH AND SHRINKING SAVINGS

Performance Prediction and Shrinking Language Models . Chen Stanley F IBM T.J. Watson

MHD FLOW AND HEAT TRANSFER THROUGH A POROUS MEDIUM OVER A STRETCHING / SHRINKING SURFACE WITH

Scientists and Science Policy: Shrinking the Gap Demitri Call - John Mather AIP Policy Intern

Growing and Shrinking Polygons for Random Testing of Computational Geometry Algorithms

Trade, Education, and The Shrinking Middle Class Emily Blanchard Gerald Willmann Tuck,

Shrinking Carbon Emissions Through Innovative Cement and Concrete Technologies Simply better

Macro environment remains challenging International South Africa Demand for resources shrinking;

Adversarial Domain Adaptation and Adversarial Robustness Judy Hoffman + = Big Deep success

ADVERSARIAL EXAMPLES (In 15 minutes or less) Neill Patterson, MscAC PART I - BASIC CONCEPTS WE

2017 Annual General Meeting of Shareholders Charles Gibbon Chair WiseTech Global FY17

Typing AD Hoc Data Kathleen Fisher AT&T Labs Research 1 Data,Data,everywhere! Incredible

A Framework and Implications for Archival Research Cal Lee School of Information and Library

Primate Life Cycles 60 50 40 30 20 10 0 Ring-tailed lemur Capuchin monkey Gibbon

Parallel Numerical Algorithms Chapter 7 Differential Equations Section 7.3 Particle

Ammar Karkar, Ghaith Tarawneh, Ioannis Syranidis and to our colleague Terrence Mak Outline

Shrinking and Exploring David Evans University of Virginia - PowerPoint PPT Presentation

evadeML. L.org Shrinking and Exploring David Evans University of Virginia Adversarial Search Spaces ARO Workshop on Adversarial Learning Stanford, 14 Sept 2017 Weilin Xu Yanjun Qi Machine Learning is Eating Computer Science 1 Security

Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring the IPY with NOAA Exploring

Unfolding and Shrinking Neural Machine Translation Ensembles Felix Stahlberg and Bill Byrne

2 To produce strong economic growth in a country with a shrinking population is close to

Guess-then-algebraic attack on the Self-Shrinking Generator Blandine Debraize, Louis Goubin

Pitch location and Greinkes July Exploring Pitch Data in R Strike zone success Exploring

EXPLORE ARIZONA THROUGH DATA FOCUS ON STUDENT DATA OVERVIEW WELCOME! EXPLORING DATA

Middle Grades/High School Exploring Change in the Number of Cases Middle Grades/High School

New IRS &quot;No Private Letter Ruling&quot; Policy: Sec. 355 Transactions and a Shrinking PLR

UNICREDIT FOR CONSUMER PROTECTION: THE ITALIAN BUSINESS CASE CREDIT CRUNCH AND SHRINKING SAVINGS

Performance Prediction and Shrinking Language Models . Chen Stanley F IBM T.J. Watson

MHD FLOW AND HEAT TRANSFER THROUGH A POROUS MEDIUM OVER A STRETCHING / SHRINKING SURFACE WITH

Scientists and Science Policy: Shrinking the Gap Demitri Call - John Mather AIP Policy Intern

Growing and Shrinking Polygons for Random Testing of Computational Geometry Algorithms

Trade, Education, and The Shrinking Middle Class Emily Blanchard Gerald Willmann Tuck,

Shrinking Carbon Emissions Through Innovative Cement and Concrete Technologies Simply better

Macro environment remains challenging International South Africa Demand for resources shrinking;

Adversarial Domain Adaptation and Adversarial Robustness Judy Hoffman + = Big Deep success

ADVERSARIAL EXAMPLES (In 15 minutes or less) Neill Patterson, MscAC PART I - BASIC CONCEPTS WE

2017 Annual General Meeting of Shareholders Charles Gibbon Chair WiseTech Global FY17

Typing AD Hoc Data Kathleen Fisher AT&amp;T Labs Research 1 Data,Data,everywhere! Incredible

A Framework and Implications for Archival Research Cal Lee School of Information and Library

Primate Life Cycles 60 50 40 30 20 10 0 Ring-tailed lemur Capuchin monkey Gibbon

Parallel Numerical Algorithms Chapter 7 Differential Equations Section 7.3 Particle

Ammar Karkar, Ghaith Tarawneh, Ioannis Syranidis and to our colleague Terrence Mak Outline

New IRS "No Private Letter Ruling" Policy: Sec. 355 Transactions and a Shrinking PLR

Typing AD Hoc Data Kathleen Fisher AT&T Labs Research 1 Data,Data,everywhere! Incredible