Transfer Adversarial Training: A General Approach to Adapting Deep - PowerPoint PPT Presentation

Transfer Adversarial Training: A General Approach to Adapting Deep Classifiers Hong Liu, Mingsheng Long, Jianmin Wang, Michael I. Jordan School of Software, Tsinghua University National Engineering Lab for Big Data Software University of California. Berkeley https://github.com/thuml 36th International Conference on Machine Learning Hong Liu Transfer Adversarial Training June 8, 2019 1 / 20

Domain Adaptation Outline Domain Adaptation 1 Hidden Limitations of Adversarial Feature Adaptation 2 The adaptability Transferable Adversarial Training 3 Generating Transferable Examples Training with Transferable Examples Experiments 4 Hong Liu Transfer Adversarial Training June 8, 2019 2 / 20

Domain Adaptation Transfer Learning In real-world applications, the IID assumption is frequently violated. How to generalize a learner across Non-IID distributions P � = Q . Source Domain Target Domain 2D Renderings Real Images P ( x , y ) ≠ Q ( x , y ) Model Representation Model f : x → y f : x → y Hong Liu Transfer Adversarial Training June 8, 2019 3 / 20

Domain Adaptation Domain Adaptation Transfer knowledge across different domains: The learner is provided with n s i.i.d. observations { x ( i ) s , y ( i ) s } n s i =1 from a source domain of distribution P ( x s , y s ), and n t i.i.d. observations { x ( i ) t } n t i =1 from a target domain of distribution Q ( x t , y t ). Learn an accurate model for the target domain Formally bound the target risk with the source risk ? Adaptation and knowledge transfer Hong Liu Transfer Adversarial Training June 8, 2019 4 / 20

Domain Adaptation The H ∆ H -divergence For any hypothesis h ∈ H , with probability no less than 1 − δ , P ( h , f P ) + D H ∆ H ( ˆ P , ˆ ǫ Q ( h , f Q ) ≤ ǫ ˆ Q ) + λ � � (1) log 6 log 6 + 10 ˆ R P ( h ) + 8 ˆ δ δ R Q ( h ) + 6 + 3 , m n where D H ∆ H ( P , Q ) = sup h , h ′ ∈H | ǫ Q ( h , h ′ ) − ǫ P ( h , h ′ ) | , λ = ǫ P ( h ∗ , f P ) + ǫ Q ( h ∗ , f Q ) , (2) h ∗ = arg min ǫ P ( h , f P ) + ǫ Q ( h , f Q ) . (3) h ∈H Intuitively, the target risk can be bounded with the source risk + discrepancy between the source and the target + the best hypothesis risk we can expect. Ben-David, S., Blitzer, J., Crammer, K., Kulesza, A., Pereira, F., and Vaughan, J. W. A theory of learning from different domains. Machine Learning, 79(1-2):151 – 175, 2010. Hong Liu Transfer Adversarial Training June 8, 2019 5 / 20

Domain Adaptation Adversarial Feature Adaptation Minimize the source risk Train the model with supervision from the source domain Minimize the discrepancy term Learn a new feature representation where the discrepancy is minimized. The two-player game A domain discriminator tries to discriminate the source and target domains, while the feature extractor tries to confuse it. Two classifier try to maximize their disagreement while the feature extractor tries to minimize it. Ganin, Y., Ustinova, E.,Ajakan, H., Germain, P., Larochelle, H., Marchand, M., and Lempitsky, V. Domain-adversarial training of neural networks. Journal of Machine Learning Research, 17(1):2096 – 2030, 2016. Saito, K., Watanabe, K., Ushiku, Y., and Harada, T. Maximum classifier discrepancy for unsupervised domain adaptation. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3723 – 3732, 2018. Hong Liu Transfer Adversarial Training June 8, 2019 6 / 20

Hidden Limitations of Adversarial Feature Adaptation Outline Domain Adaptation 1 Hidden Limitations of Adversarial Feature Adaptation 2 The adaptability Transferable Adversarial Training 3 Generating Transferable Examples Training with Transferable Examples Experiments 4 Hong Liu Transfer Adversarial Training June 8, 2019 7 / 20

Hidden Limitations of Adversarial Feature Adaptation The adaptability Hidden Limitations of Adversarial Feature Adaptation Adaptability quantified by λ , is an essential prerequisite of domain adaptation. If λ is large, we can never expect to adapt a learner trained on the source domain to the target domain. Simply learning a new feature representation cannot guarantee that the ideal joint risk won’t explode! Diminishing domain-specific variations inevitably breaks the discriminative structures of the original representations. Hong Liu Transfer Adversarial Training June 8, 2019 8 / 20

Hidden Limitations of Adversarial Feature Adaptation The adaptability Possible Solutions Since we have no access to target labels, we cannot expect to minimize λ directly. Can we at least prevent the adaptability from going worse? FIX the feature representations and adapt classifiers instead. With feature representations fixed, how can we adapt to the target domain? Adapt deep classifiers instead. Extend adversarial training paradigm to domain adaptation. Hong Liu Transfer Adversarial Training June 8, 2019 9 / 20

Transferable Adversarial Training Outline Domain Adaptation 1 Hidden Limitations of Adversarial Feature Adaptation 2 The adaptability Transferable Adversarial Training 3 Generating Transferable Examples Training with Transferable Examples Experiments 4 Hong Liu Transfer Adversarial Training June 8, 2019 10 / 20

Transferable Adversarial Training Transferable Adversarial Training Instead of feature adaptation, associate the source and target domain with transferable examples. Generate transferable examples at feature level. Adapt the classifier to the target domain by training on transferable examples. Hong Liu Transfer Adversarial Training June 8, 2019 11 / 20

Transferable Adversarial Training Generating Transferable Examples Generating Transferable Examples Generate Transferable Examples to bridge domain gap. Train a classifier and a domain discriminator. Transferable examples should confuse both the classifier and the domain discriminator. n s n t ℓ d ( θ D , f ) = − 1 s )] − 1 log[ D ( f ( i ) log[1 − D ( f ( i ) � � t )] . (4) n s n t i =1 i =1 n s ℓ c ( θ C , f ) = 1 ℓ ce ( C ( f ( i ) s ) , y ( i ) � s ) . (5) n s i =1 Concretely, we generate transferable examples from both domains in an iterative manner, f t k +1 ← f t k + β ∇ f tk ℓ d ( θ D , f t k ) − γ ∇ f tk ℓ 2 ( f t k , f t 0 ) , (6) f s k +1 ← f s k + β ∇ f sk ℓ d ( θ D , f s k ) − γ ∇ f sk ℓ 2 ( f s k , f s 0 ) + β ∇ f sk ℓ c ( θ C , f s k ) . (7) Hong Liu Transfer Adversarial Training June 8, 2019 12 / 20

Transferable Adversarial Training Training with Transferable Examples Training with Transferable Examples Training the classifier and the domain discriminator on transferable examples. We require the classifier to make consistent predictions for the transferable examples and their original counterparts. Train the domain discriminator to further distinguish transferable examples generated from the source and target. n s n t ℓ c , adv ( θ C , f ∗ ) = 1 s ∗ ) + 1 � � ℓ ce ( C ( f ( i ) s ∗ ) , y ( i ) � C (( f ( i ) t ∗ )) − C (( f ( i ) � � t )) � , (8) � � n s n t i =1 i =1 n s n t ℓ d , adv ( θ D , f ∗ ) = − 1 s ∗ )] − 1 log[ D ( f ( i ) log[1 − D ( f ( i ) � � t ∗ )] . (9) n s n t i =1 i =1 Hong Liu Transfer Adversarial Training June 8, 2019 13 / 20

Transferable Adversarial Training Training with Transferable Examples The Overall Optimization Problem min ℓ d ( θ D , f ) + ℓ c ( θ C , f ) + ℓ d , adv ( θ D , f ∗ ) + ℓ c , adv ( θ C , f ∗ ) . (10) θ D ,θ C Feature Representations Transferable Adversarial Training Fixed feature representations – guaranteed adaptability No need of feature adaptation – light weight computation An order of magnitude faster than adversarial feature adaptation Hong Liu Transfer Adversarial Training June 8, 2019 14 / 20

Experiments Outline Domain Adaptation 1 Hidden Limitations of Adversarial Feature Adaptation 2 The adaptability Transferable Adversarial Training 3 Generating Transferable Examples Training with Transferable Examples Experiments 4 Hong Liu Transfer Adversarial Training June 8, 2019 15 / 20

Experiments Analysis The rotating two moon problem: The target domain is rotated 30 ◦ from the source domain. (a) Source Only Model (b) TAT (c) Transferable Examples Behaviors on the two moon problem. Purple and yellow ”+”s indicate source samples, blue ”+”s are target samples, while dots are transferable examples. (a) The source only model. (b) The decision boundary of TAT. (c) The distribution of the transferable examples. As expected, transferable examples bridge domain gap effectively. Hong Liu Transfer Adversarial Training June 8, 2019 16 / 20

Experiments Experimental Setups Datasets Office-31: Standard benchmark Image-CLEF: Balanced domains Office-home: Large domain gap VisDA: Large-scale synthetic-to-real Multi-domain sentiment: Sentiment polarity classification Synthetic Real Product Clipart Hong Liu Transfer Adversarial Training June 8, 2019 17 / 20

Transfer Adversarial Training: A General Approach to Adapting Deep - PowerPoint PPT Presentation

Transfer Adversarial Training: A General Approach to Adapting Deep Classifiers Hong Liu, Mingsheng Long, Jianmin Wang, Michael I. Jordan School of Software, Tsinghua University National Engineering Lab for Big Data Software University of

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Friendly Adversarial Training: Attacks Which Do Not Kill Training Make Adversarial Learning

A-NICE-MC Jiaming Song 1. Motivation 2. Notations and Problem Setup 3. Adversarial Training for

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Guest

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

Adversarial Training Attacks on Deep Networks and Generative Adversarial Networks Erkut Erdem

Introduction to Generative Adversarial Networks Ian Goodfellow, OpenAI Research Scientist NIPS

Design and Run-Time Quality of Service Management Techniques for Publish/ Subscribe Distributed

Improving Adaptability of Multi-Mode Systems via Program Steering Lee Lin Michael D. Ernst MIT

the Adaptable Intelligent World Amit Gupta Vice President Software Engineering, Xilinx Inc

MODELS for ADAPTABILITY Paola Inverardi Software Engineering and Architecture Group Dipartimento

Building Adaptive and Agile Applications Using Intrusion Detection and Response Joseph P. Loyall,

An Evaluation of Open Source E-Learning Platforms Stressing Adaptation Issues Sabine Graf &

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram Venkataraman, Aurojit Panda, Kay

Learning to Adapt to Dynamic, Real- World Environments Chelsea Finn UC Berkeley Google Brain

Sambuz

Useful Links

Newsletter

Mail Us

Transfer Adversarial Training: A General Approach to Adapting Deep - PowerPoint PPT Presentation

Transfer Adversarial Training: A General Approach to Adapting Deep Classifiers Hong Liu, Mingsheng Long, Jianmin Wang, Michael I. Jordan School of Software, Tsinghua University National Engineering Lab for Big Data Software University of

Confidence-Calibrated Adversarial Training Generalizing to Unseen Attacks David Stutz, Matthias

Deep Adversarial Learning for NLP 9:00 10:30 Introduction and Adversarial Training, GANs

Reinforcing Adversarial Robustness using Model Confidence Induced by Adversarial Training Xi Wu

Adversarial Examples and Adversarial Training Ian Goodfellow, Sta ff Research Scientist, Google

Neglected topics CS 446 Adversarial examples and deep networks 1 / 23 Adversarial

Friendly Adversarial Training: Attacks Which Do Not Kill Training Make Adversarial Learning

A-NICE-MC Jiaming Song 1. Motivation 2. Notations and Problem Setup 3. Adversarial Training for

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

Stronger and Faster Wasserstein Adversarial Attacks Kaiwen Wu kaiwen.wu@uwaterloo.ca Joint work

Synthesizing Robust Adversarial Examples Anish Athalye*, Logan Engstrom*, Andrew Ilyas*, Kevin

CSC321 Lecture 22: Adversarial Learning Roger Grosse Roger Grosse CSC321 Lecture 22: Adversarial

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist Guest

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

Adversarial Examples and Adversarial Training Ian Goodfellow, OpenAI Research Scientist

Adversarial Training Attacks on Deep Networks and Generative Adversarial Networks Erkut Erdem

Introduction to Generative Adversarial Networks Ian Goodfellow, OpenAI Research Scientist NIPS

Design and Run-Time Quality of Service Management Techniques for Publish/ Subscribe Distributed

Improving Adaptability of Multi-Mode Systems via Program Steering Lee Lin Michael D. Ernst MIT

the Adaptable Intelligent World Amit Gupta Vice President Software Engineering, Xilinx Inc

MODELS for ADAPTABILITY Paola Inverardi Software Engineering and Architecture Group Dipartimento

Building Adaptive and Agile Applications Using Intrusion Detection and Response Joseph P. Loyall,

An Evaluation of Open Source E-Learning Platforms Stressing Adaptation Issues Sabine Graf &amp;

DRIZZLE: FAST AND Adaptable STREAM PROCESSING AT SCALE Shivaram Venkataraman, Aurojit Panda, Kay

Learning to Adapt to Dynamic, Real- World Environments Chelsea Finn UC Berkeley Google Brain

Sambuz

Useful Links

Newsletter

Mail Us

Synthesizing Robust Adversarial Examples Anish Athalye, Logan Engstrom, Andrew Ilyas*, Kevin

An Evaluation of Open Source E-Learning Platforms Stressing Adaptation Issues Sabine Graf &