Kevin Roth, Yannic Kilcher , Thomas Hofmann ETH Zrich 2 6 # r - PowerPoint PPT Presentation

Kevin Roth*, Yannic Kilcher *, Thomas Hofmann ETH Zürich 2 6 # r e t s o p

Log-Odds & Adversarial Examples

Log-Odds & Adversarial Examples Adversarial examples cause atypically large feature space perturbations along the weight-difference direction

Adversarial Cone x*

Adversarial Cone x* x adv

Adversarial Cone x* random x adv

Adversarial Cone P y* (.) = 1 x* random x adv P y* (.) = 0

Adversarial Cone P y* (.) = 1 x* x adv P y* (.) = 0 Adversarial examples are embedded in a cone-like structure

Adversarial Cone softmax x adv + t noise *

Adversarial Cone softmax x adv + t noise * Noise as a probing instrument

Main Idea: Log-Odds Robustness The robustness properties of are different dependent on whether or tends to have a characteristic direction if whereas it tends not to have a specific direction if

Main Idea: Log-Odds Robustness natural adversarial Noise can partially undo effect of adversarial perturbation and directionally revert log-odds towards the true class y*

Statistical Test & Corrected Classification We propose to use noise-perturbed pairwise log-odds to test whether classified as should be thought of as a manipulated example of true class : adversarial if Corrected classification :

Detection Rates & Corrected Classification ● Our statistical test detects nearly all adversarial examples with FPR ~1% ● Our correction method reclassifies almost all adversarial examples successfully ● Drop in performance on clean samples is negligible

Detection Rates & Corrected Classification attack strength ε Detection rate increases with increasing attack strength Corrected classification manages to compensate for decay in uncorrected accuracy due to increase in attack strength

Defending against Defense-Aware Attacks ● Attacker has full knowledge of the defense : perturbations that work in expectation under noise source used for detection Detection rates and corrected accuracies remain remarkably high

Thank You poster #62 Kevin Roth Yannic Kilcher Thomas Hofmann Follow-Up Work: Adversarial Training Generalizes ICML Workshop on Data-dependent Spectral Norm Regularization Generalization (June 14)

References The approaches most related to our work are those that detect whether or not the input has been perturbed, either by detecting characteristic regularities in the adversarial perturbations themselves or in the network activations they induce. ● Grosse, Kathrin, et al. "On the (statistical) detection of adversarial examples." (2017). ● Metzen, Jan Hendrik, et al. "On detecting adversarial perturbations." (2017). ● Feinman, Reuben, et al. "Detecting adversarial samples from artifacts." (2017). ● Xu, Weilin, David Evans, and Yanjun Qi. "Feature squeezing: Detecting adversarial examples in deep neural networks." (2017). ● Song, Yang, et al. "Pixeldefend: Leveraging generative models to understand and defend against adversarial examples." (2017). ● Carlini, Nicholas, and David Wagner. "Adversarial examples are not easily detected: Bypassing ten detection methods." (2017). ● … and many more

Kevin Roth, Yannic Kilcher , Thomas Hofmann ETH Zrich 2 6 # r - PowerPoint PPT Presentation

Kevin Roth, Yannic Kilcher , Thomas Hofmann ETH Zrich 2 6 # r e t s o p Log-Odds & Adversarial Examples Log-Odds & Adversarial Examples Adversarial examples cause atypically large feature space perturbations along the

Breaking Encryptions In The Cloud GPU-accelerated supercomputing for everyone Thomas Roth

Next generation mobile rootkits Hack In Paris 2013 leveldown security Thomas Roth Hi!

Machine Learn ning in Search Motivation & Overview Thomas Hofmann Engineering Director g

Badger : Complexity Analysis with Fuzzing and Symbolic Execution Yannic Noller Rody Kersten

DifFuzz: Differential Fuzzing for Side-Channel Analysis Shirin Nilizadeh Yannic Noller Corina

Improving Web Search with Language Technologies Thomas Hofmann Director of Engineering - Zurich

Complete Shadow Symbolic Execution with Java PathFinder Hoang Lam Yannic Noller Minxing Tang

28 th Annual ROTH Conference v Kevin Conroy, Chairman and CEO March 14, 2016 Safe Harbor

Neural Module Networks for Reasoning Over Text Nitish Gupta , Kevin Lin , Dan Roth , Sameer Singh

Title of Webinar Presenters Dr. Kevin Roth NRPA Vice-President, Research since 2015.

Dynamic Proportional Share Scheduling in Hadoop Thomas Sandholm and Kevin Lai Social Computing

"A proof is whatever convinces me.", Shimon Even. G ROTH -S AHAI P ROOFS R EVISITED 1 /

56 MHz SRF Cavity HOM Study Sergey Belomestnykh, Mike Blaskiewicz, Thomas Hayes, Kevin Mernick,

Distributed Algorithms Below the Greedy Regime Yannic Maus Mohsen Ghaffari , Juho Hirvonen,

Touch Technologies Touching the World by Sara Kilcher Distributed Systems Seminar 30. April

Click to add title Speakers: Thomas Roth , Global Data Protec.on Officer, Boehringer Ingelheim

Teaching Statistics In Quality Science Using The R-Package qualityTools Thomas Roth, Joachim

CELEBRITY ESTATE PLANNING DISASTERS: LESSONS FROM THE LIFESTYLES OF THE RICH AND FAMOUS Gregory

Get Rich, Lucky Bitch! Denise Duffield-Thomas 7 Ways To Increase Your Abundance Tonight is

Wino contribution to R K ( ) anomalies with R -parity violation Kevin Earl , Thomas Gr

A new algorithm for Higher-order model checking Jrmy Ledent Martin Hofmann 1 / 25 For first

Laptop and Electronics Searches at the U.S. Border Seth Schoen and Marcia Hofmann Electronic

Globus Online Tutorial Hands On Session Trainers Matthias Hofmann , Technische Universitaet

WHITE WOLF 2.0 Introduction Rich Thomas: Creative Director Eddy Webb: Senior Developer

Kevin Roth*, Yannic Kilcher *, Thomas Hofmann ETH Zrich 2 6 # r - PowerPoint PPT Presentation

Kevin Roth*, Yannic Kilcher *, Thomas Hofmann ETH Zrich 2 6 # r e t s o p Log-Odds & Adversarial Examples Log-Odds & Adversarial Examples Adversarial examples cause atypically large feature space perturbations along the

Breaking Encryptions In The Cloud GPU-accelerated supercomputing for everyone Thomas Roth

Next generation mobile rootkits Hack In Paris 2013 leveldown security Thomas Roth Hi!

Machine Learn ning in Search Motivation &amp; Overview Thomas Hofmann Engineering Director g

Badger : Complexity Analysis with Fuzzing and Symbolic Execution Yannic Noller Rody Kersten

DifFuzz: Differential Fuzzing for Side-Channel Analysis Shirin Nilizadeh Yannic Noller Corina

Improving Web Search with Language Technologies Thomas Hofmann Director of Engineering - Zurich

Complete Shadow Symbolic Execution with Java PathFinder Hoang Lam Yannic Noller Minxing Tang

28 th Annual ROTH Conference v Kevin Conroy, Chairman and CEO March 14, 2016 Safe Harbor

Neural Module Networks for Reasoning Over Text Nitish Gupta , Kevin Lin , Dan Roth , Sameer Singh

Title of Webinar Presenters Dr. Kevin Roth NRPA Vice-President, Research since 2015.

Dynamic Proportional Share Scheduling in Hadoop Thomas Sandholm and Kevin Lai Social Computing

&quot;A proof is whatever convinces me.&quot;, Shimon Even. G ROTH -S AHAI P ROOFS R EVISITED 1 /

56 MHz SRF Cavity HOM Study Sergey Belomestnykh, Mike Blaskiewicz, Thomas Hayes, Kevin Mernick,

Distributed Algorithms Below the Greedy Regime Yannic Maus Mohsen Ghaffari , Juho Hirvonen,

Touch Technologies Touching the World by Sara Kilcher Distributed Systems Seminar 30. April

Click to add title Speakers: Thomas Roth , Global Data Protec.on Officer, Boehringer Ingelheim

Teaching Statistics In Quality Science Using The R-Package qualityTools Thomas Roth, Joachim

CELEBRITY ESTATE PLANNING DISASTERS: LESSONS FROM THE LIFESTYLES OF THE RICH AND FAMOUS Gregory

Get Rich, Lucky Bitch! Denise Duffield-Thomas 7 Ways To Increase Your Abundance Tonight is

Wino contribution to R K ( ) anomalies with R -parity violation Kevin Earl , Thomas Gr

A new algorithm for Higher-order model checking Jrmy Ledent Martin Hofmann 1 / 25 For first

Laptop and Electronics Searches at the U.S. Border Seth Schoen and Marcia Hofmann Electronic

Globus Online Tutorial Hands On Session Trainers Matthias Hofmann , Technische Universitaet

WHITE WOLF 2.0 Introduction Rich Thomas: Creative Director Eddy Webb: Senior Developer

Kevin Roth, Yannic Kilcher , Thomas Hofmann ETH Zrich 2 6 # r - PowerPoint PPT Presentation

Kevin Roth, Yannic Kilcher , Thomas Hofmann ETH Zrich 2 6 # r e t s o p Log-Odds & Adversarial Examples Log-Odds & Adversarial Examples Adversarial examples cause atypically large feature space perturbations along the

Machine Learn ning in Search Motivation & Overview Thomas Hofmann Engineering Director g

"A proof is whatever convinces me.", Shimon Even. G ROTH -S AHAI P ROOFS R EVISITED 1 /