Understanding and Mitigating the Tradeoff Between Robustness and - PowerPoint PPT Presentation

Understanding and Mitigating the Tradeoff Between Robustness and Accuracy Aditi Raghunathan* Sang Michael Xie* Fanny Yang John C. Duchi Percy Liang Stanford University

Adversarial examples • Standard training leads to models that are not robust [Goodfellow et al. 2015]

Adversarial examples • Standard training leads to models that are not robust [Goodfellow et al. 2015] • Adversarial training is a popular approach to improve robustness • It augments the training set on-the-fly with adversarial examples

Adversarial training increases standard error CIFAR-10 Method Robust Accuracy Standard Training 0% TRADES Adversarial 55.4% Training (Zhang et al. 2019) Robust Accuracy : % of test examples misclassified after an ℓ ! -bounded adversarial perturbation

Adversarial training increases standard error CIFAR-10 Method Robust Accuracy Standard Accuracy Standard Training 0% 95.2% TRADES Adversarial 55.4% 84.0% Training (Zhang et al. 2019) Robust Accuracy: % of test examples misclassified after an ℓ ! -bounded adversarial perturbation Why is there a tradeoff between robustness and accuracy? We only augmented with more data!

Prior hypotheses for the tradeoff • Optimal predictor not robust to adversarial perturbations [Tsipras et al. 2019] • But typical perturbations are imperceptible, robustness should be possible • Hypothesis class not expressive enough [Nakkiran et al. 2019] • But neural networks highly expressive, reaches 100% std and robust training accuracy

Prior hypotheses for the tradeoff • Optimal predictor not robust to adversarial Perturb perturbations [Tsipras et al. 2019] • But typical perturbations are imperceptible, robustness should be possible • Hypothesis class not expressive enough [Nakkiran et al. 2019] • But neural networks highly expressive, reaches 100% std and robust training accuracy

Prior hypotheses for the tradeoff • Optimal predictor not robust to adversarial Perturb perturbations [Tsipras et al. 2019] • But typical perturbations are imperceptible, robustness should be possible • Hypothesis class not expressive enough [Nakkiran et al. 2019] • But neural networks highly expressive, reaches 100% std and robust training accuracy These hypotheses suggest a tradeoff even in the infinite data limit…

Prior hypotheses for the tradeoff More realistic settings: • Optimal predictor not robust to adversarial perturbations [Tsipras et al. 2019] • But typical perturbations are imperceptible, Consistent robustness should be possible • Hypothesis class not expressive enough [Nakkiran et al. 2019] • But neural networks highly expressive, reaches 100% std and robust training accuracy These hypotheses suggest a tradeoff even in the infinite data limit…

Prior hypotheses for the tradeoff More realistic settings: • Optimal predictor not robust to adversarial perturbations [Tsipras et al. 2019] • But typical perturbations are imperceptible, Consistent robustness should be possible • Hypothesis class not expressive enough [Nakkiran et al. 2019] • But neural networks highly expressive, reaches Well-specified 100% std and robust training accuracy These hypotheses suggest a tradeoff even in the infinite data limit…

No tradeoff with infinite data CIFAR-10 • Observations • Gap between robust and standard accuracies are large for small data regime • Gap decreases with labeled sample size

No tradeoff with infinite data CIFAR-10 • Observations • Gap between robust and standard accuracies are large for small data regime • Gap decreases with labeled sample size • We ask: if we have consistent perturbations + well-specified model family (no inherent tradeoff), why do we observe a tradeoff in practice?

Results overview • Characterize how training with consistent extra data can increase standard error even in well-specified noiseless linear regression • Analysis suggests robust self-training to mitigate tradeoff [Carmon 2019, Najafi 2019, Uesato 2019]

Results overview • Characterize how training with consistent extra data can increase standard error even in well-specified noiseless linear regression • Analysis suggests robust self-training to mitigate tradeoff [Carmon 2019, Najafi 2019, Uesato 2019] • Prove that robust self-training (RST) improves robust error without hurting standard error in linear setting with unlabeled data

Results overview • Characterize how training with consistent extra data can increase standard error even in well-specified noiseless linear regression • Analysis suggests robust self-training to mitigate tradeoff [Carmon 2019, Najafi 2019, Uesato 2019] • Prove that robust self-training (RST) improves robust error without hurting standard error in linear setting with unlabeled data • Empirically, RST improves robust and standard error across different adversarial training algorithms and adversarial perturbation types

Noiseless linear regression • Model: 𝑧 = 𝑦 ! 𝜄 ∗ Well-specified • Standard data: 𝑌 #$% ∈ ℝ &×% , 𝑧 #$% = 𝑌 #$% 𝜄 ∗ , 𝑜 ≪ 𝑒 (overparameterized) • Extra data (adv examples): 𝑌 ()$ ∈ ℝ *×% , 𝑧 ()$ = 𝑌 ()$ 𝜄 ∗ • We study min-norm interpolators • 𝜄 !"# = argmin $ { 𝜄 % : 𝑌 !"# 𝜄 = 𝑧 !"# } • 𝜄 &'( = argmin $ { 𝜄 % : 𝑌 !"# 𝜄 = 𝑧 !"# , 𝑌 )*" 𝜄 = 𝑧 )*" } • Standard error: 𝜄 − 𝜄 ∗ ! Σ 𝜄 − 𝜄 ∗ for population covariance Σ

Noiseless linear regression • Model: 𝑧 = 𝑦 ! 𝜄 ∗ Well-specified • Standard data: 𝑌 #$% ∈ ℝ &×% , 𝑧 #$% = 𝑌 #$% 𝜄 ∗ , 𝑜 ≪ 𝑒 (overparameterized) Consistent • Extra data (adv examples): 𝑌 ()$ ∈ ℝ *×% , 𝑧 ()$ = 𝑌 ()$ 𝜄 ∗ • We study min-norm interpolators • 𝜄 !"# = argmin $ { 𝜄 % : 𝑌 !"# 𝜄 = 𝑧 !"# } • 𝜄 &'( = argmin $ { 𝜄 % : 𝑌 !"# 𝜄 = 𝑧 !"# , 𝑌 )*" 𝜄 = 𝑧 )*" } • Standard error: 𝜄 − 𝜄 ∗ ! Σ 𝜄 − 𝜄 ∗ for population covariance Σ

Noiseless linear regression • Model: 𝑧 = 𝑦 ! 𝜄 ∗ Well-specified • Standard data: 𝑌 #$% ∈ ℝ &×% , 𝑧 #$% = 𝑌 #$% 𝜄 ∗ , 𝑜 ≪ 𝑒 (overparameterized) Consistent • Extra data (adv examples): 𝑌 ()$ ∈ ℝ *×% , 𝑧 ()$ = 𝑌 ()$ 𝜄 ∗ • We study min-norm interpolants • 𝜄 !"# = argmin $ { 𝜄 % : 𝑌 !"# 𝜄 = 𝑧 !"# } • 𝜄 &'( = argmin $ { 𝜄 % : 𝑌 !"# 𝜄 = 𝑧 !"# , 𝑌 )*" 𝜄 = 𝑧 )*" } • Standard error: 𝜄 − 𝜄 ∗ ! Σ 𝜄 − 𝜄 ∗ for population covariance Σ

Understanding and Mitigating the Tradeoff Between Robustness and - PowerPoint PPT Presentation

Understanding and Mitigating the Tradeoff Between Robustness and Accuracy Aditi Raghunathan* Sang Michael Xie* Fanny Yang John C. Duchi Percy Liang Stanford University Adversarial examples Standard training leads to models that are not

Adjoint Data-Flow analyses applied to checkpointing - Tradeoff between snapshots and TBR Benjamin

5. Structured Descriptions & Tradeoff Between Expressiveness and Tractability Outline

Mitigating Seabird Mitigating Seabird Interactions with Interactions with Trawl Nets Trawl

MITIGATING RISK MITIGATING RISK IN GIFT CARD SALES IN GIFT CARD SALES March 2019 MEET

Mitigating Geomagnetic Induced Currents Using Surge Arresters ALBERTO RAMIREZ MITIGATING

Optimal Communication-Distortion Tradeoff in Voting Debmalya Mandal (Columbia), Nisarg Shah

Optimizing the Relevance-Redundancy Tradeoff for Efficient Semantic Segmentation Caner Hazrba

Analysis of the Parallel Distinguished Point Tradeoff Jin Hong, *Ga Won Lee, Daegun Ma Seoul

UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING

Fatal Tradeoff? Toward A Better Understanding of the Costs of Not Evacuating from a Hurricane in

Learning objectives Introduce dimensions and tradeoff between test and analysis activities A

Compressed Sensing and Dictionary Learning to Alleviate Tradeoff between Temporal and Spatial

Tradeoff Between Quality And Quantity Of Raters To Characterize Expressive Speech Alec Burmania,

Tradeoff between Performance and Security Alessandro Aldini University of Urbino Carlo Bo

A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between

PRICING Overview Context: Many firms face a tradeoff between price and quantity. To sell

Outline Assessing the precision of estimates of variance Estimates and standard errors

Sparse Jurdjevic-Quinn stabilization Francesco Rossi - Universit dAix-Marseille, France

Practical denoising of clipped or overexposed noisy images Alessandro Foi www.cs.tut.fi/~foi

Stochastic Models of an Uncertain World x = F ( x , u ) x = F ( x , u , 1 ) Lecture

Blockchains, the web and standardization: the big opportunity conversation starter Keynote

A Statement on Standardization Orr Dunkelman 1 , Atul Luykx 2 , Lo Perrin 3 1 orrd@cs.haifa.ac.il

Standardizing Your Compliance Activities to Implement Data Analytics and RPA #AuditBoardWebinars

Gaussian Multiscale Spatio-temporal Models for Areal Data Marco A. R. Ferreira (University of

Understanding and Mitigating the Tradeoff Between Robustness and - PowerPoint PPT Presentation

Understanding and Mitigating the Tradeoff Between Robustness and Accuracy Aditi Raghunathan* Sang Michael Xie* Fanny Yang John C. Duchi Percy Liang Stanford University Adversarial examples Standard training leads to models that are not

Adjoint Data-Flow analyses applied to checkpointing - Tradeoff between snapshots and TBR Benjamin

5. Structured Descriptions &amp; Tradeoff Between Expressiveness and Tractability Outline

Mitigating Seabird Mitigating Seabird Interactions with Interactions with Trawl Nets Trawl

MITIGATING RISK MITIGATING RISK IN GIFT CARD SALES IN GIFT CARD SALES March 2019 MEET

Mitigating Geomagnetic Induced Currents Using Surge Arresters ALBERTO RAMIREZ MITIGATING

Optimal Communication-Distortion Tradeoff in Voting Debmalya Mandal (Columbia), Nisarg Shah

Optimizing the Relevance-Redundancy Tradeoff for Efficient Semantic Segmentation Caner Hazrba

Analysis of the Parallel Distinguished Point Tradeoff Jin Hong, *Ga Won Lee, Daegun Ma Seoul

UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING (LMOU) LOCAL MEMORANDUM OF UNDERSTANDING

Fatal Tradeoff? Toward A Better Understanding of the Costs of Not Evacuating from a Hurricane in

Learning objectives Introduce dimensions and tradeoff between test and analysis activities A

Compressed Sensing and Dictionary Learning to Alleviate Tradeoff between Temporal and Spatial

Tradeoff Between Quality And Quantity Of Raters To Characterize Expressive Speech Alec Burmania,

Tradeoff between Performance and Security Alessandro Aldini University of Urbino Carlo Bo

A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between

PRICING Overview Context: Many firms face a tradeoff between price and quantity. To sell

Outline Assessing the precision of estimates of variance Estimates and standard errors

Sparse Jurdjevic-Quinn stabilization Francesco Rossi - Universit dAix-Marseille, France

Practical denoising of clipped or overexposed noisy images Alessandro Foi www.cs.tut.fi/~foi

Stochastic Models of an Uncertain World x = F ( x , u ) x = F ( x , u , 1 ) Lecture

Blockchains, the web and standardization: the big opportunity conversation starter Keynote

A Statement on Standardization Orr Dunkelman 1 , Atul Luykx 2 , Lo Perrin 3 1 orrd@cs.haifa.ac.il

Standardizing Your Compliance Activities to Implement Data Analytics and RPA #AuditBoardWebinars

Gaussian Multiscale Spatio-temporal Models for Areal Data Marco A. R. Ferreira (University of

5. Structured Descriptions & Tradeoff Between Expressiveness and Tractability Outline