What deep generative models can do for you: Opportunities, challenges, and open questions
1
Giulia Fanti Carnegie Mellon University
What deep generative models can do for you: Opportunities, - - PowerPoint PPT Presentation
What deep generative models can do for you: Opportunities, challenges, and open questions Giulia Fanti Carnegie Mellon University 1 Kiran Zinan Lin Hao Liang Alankar Jain Thekumparampil Chen Wang Sewoong Oh Vyas Sekar 2 Classifying
1
Giulia Fanti Carnegie Mellon University
2
Vyas Sekar Chen Wang Alankar Jain Zinan Lin Hao Liang Kiran Thekumparampil Sewoong Oh
3
Classification
Classifying network traffic
Reinforcement learning
Traffic engineering
Unsupervised methods
Clustering signals
4
5
Noise Learned parameters Model How do we pick π? How to combine noise? Time t π¦[π’ β 1] π¦[0]
Use dom domain n kno nowledg edge to extract high-level insights Design par param ametric model to model those insights Use da data to populate parameters
Network traffic has temporal patterns
π¦ π’ = sin ππ’ + π[π’]
! " = 1 day
Melamed (1993), Denneulin et al (2004), Swing, BURSE, Hierarchical bundling, Di et al (2014), β¦
7
8
π β π!
knowledge
9
Generator G Noise z Discriminator D FAKE! REAL
10
11
Use case 1
12
Vyas Sekar Chen Wang Alankar Jain Zinan Lin
Enterprises Researchers
13
Unreproducible research Limited potential Collaborative opportunities go untapped Division A Division B
14
Generative Model Generative Model Generative Model
Data Clearinghouse (ISAC, ISAO) Enterprises Researchers
Fideli lity Pri rivacy
Real Generated Generative Model Business secrets User data
15
16
Anonymized, raw data Expert-designed parametric models Machine-learned models DoppelGANger
Generating synthetic time series data with GANs
17
Google: task resource usage logs from 12.k machines (2011)
IBM: resource usage measurements from 100k containers
Wikipedia web traffic: # daily views of Wikipedia articles (2016)
FCC Meas asuring Broad adban and America ca: Internet traffic and performance measurements from consumer devices around the country
18
19
RN RNN Noi Noise ο»ΏAu Auxiliary Di Discrimina nator
1: 1: re real 0: 0: fa fake
ο»Ώ Di Discrimina nator
1: 1: re real 0: 0: fa fake
R1,β¦ ,β¦,R ,RS RN RNN Noi Noise RT-s+
s+1,β¦
,β¦,R ,RT
Mi Min/Ma Max Gener Generator (M (MLP) (mi minΒ±ma max/2 /2 Me Metadata Gener Generator (M (MLP) (A (A1, β¦, β¦, Am) Noi Noise
20
Unbatched Batched
21
Day
global min/max
22
max min (min, max) (min, max) (min, max)
23
24
Time series min value
Count
Before: Single generator
25
26
Withou hout auxiliary discriminator Wi With auxiliary discriminator Count Count Time series min value
27
RN RNN Noi Noise ο»ΏAu Auxiliary Di Discrimina nator
1: 1: re real 0: 0: fa fake
ο»Ώ Di Discrimina nator
1: 1: re real 0: 0: fa fake
R1,β¦ ,β¦,R ,RS RN RNN Noi Noise RT-s+
s+1,β¦
,β¦,R ,RT
Mi Min/Ma Max Gener Generator (M (MLP) (mi minΒ±ma max/2 /2 Me Metadata Gener Generator (M (MLP) (A (A1, β¦, β¦, Am) Noi Noise
28
29
30
31
Fidelity
Privacy
32
Use case 2
33
Vyas Sekar Zinan Lin Hao Liang
IoT Devices Servers / Routers Control Units in Vehicles / Manufacturing NO NO so source co code / bi bina nary / pr protoco col fo format / de design do doc
Towards Oblivious Network Analysis using GANs HotNets'19 11/14/2019 34
Towards Oblivious Network Analysis using GANs HotNets'19 11/14/2019
Send packets We We wa want to to id identif tify at attack ack pack packets, but but do do NO NOT ha have so source co code or system descr cript ption
35
Attacker
36
Classification Time Can an attacker identify many y packets with hi high classification times?
37
Thre Threshold hold Slo low p packets Fa Fast pac packets Can can we generate many, d , diverse s slo low packets? Classification Time (ms) Number of packets 2,000 total packets
38
GANs can help!
11/14/2019 Towards Oblivious Network Analysis using GANs HotNets'19
Classification decision tree Random Packets GAN Training Dataset βFastβ packets βSlowβ packets
1% 1%
16
11/14/2019 Towards Oblivious Network Analysis using GANs HotNets'19
GAN Training Dataset βFastβ packets βSlowβ packets Generate packets with condition=βslowβ
18
Random Packets Classification decision tree
Am AmpGAN AN
41
Classification Time (ms) Number of packets Number of packets Random
pac packets Am AmpGAN AN Thre Threshold hold
42
System Calls Fraction of βslowβ packets
Genetic Algorithms Simulated Annealing Generalized SA Bayesian Optimization AmpGAN
2.5x jump 10x jump
43
Use case 3
44
Zinan Lin Kiran Thekumparampil Sewoong Oh
Generator π¨$ π¨% π¨&
β¦
π factors
Vanilla GANs π¨$ π¨% π¨&
β¦
Factor$ Factor% Factor'
β¦
45 Code & Paper: https://github.com/fjxmlzn/InfoGAN-CR
π input noise Disentangled GANs π$ π% π'
β¦
Factor$ Factor% Factor'
β¦
π¨(s
Generator π$ π% π&
β¦
Changing only: π$ π% π) π* π+ hair color rotation lighting background bangs shape scale rotation x-position y-position
(CelebA dataset) (dSprites dataset)
46
π¨(s
* CelebA example is generated by InfoGAN-CR. Dsprites example is synthetic for illustration.
Latent codes The remaining noise dimensions
Generator (π»)
π = 1 π = 2 π = 3 Same shape Same x-position Same y-position
47
Same i-th latent code
/,β¦,π, /,β¦,π. / ) to generate a pair of
images
Equa l
Generator (π»)
π = 1 π = 2 π = 3
Contrastive Regularizer (CR)
1 2 3
Classification task!
48
Same i-th latent code
/,β¦,π, /,β¦,π. / ) to generate a pair of
images
Equa l
InfoGAN-CR loss: min
0,2,3 max 4
π567 π», πΈ β ππ½ π», π β π½π8(π», πΌ)
cl clas assificat cation accu accuracy acy of
CR
πΌ
[1] InfoGAN. Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., & Abbeel, P. (NeurIPS 2016)
! β β! $ β β" % β β#
!
GAN Discriminator InfoGAN Encoder
" #
β" CR β Μ $ β β" Input Noise Latent Factors %β² β β# %β²β² β β#
49
InfoGAN [1] GANβs adversarial loss Mutual info loss
50
51