Model-Assisted Generative Adversarial Networks Leigh Whitehead ICL - PowerPoint PPT Presentation

Model-Assisted Generative Adversarial Networks Leigh Whitehead   ICL Seminar   05/06/20

Overview • What are Generative Adversarial Networks (GANs)? • The Model-Assisted GAN • Case Studies • Case Study I (from the paper) • Case Study II (from the paper) • Light simulation in DUNE • Outlook and Summary 2 Leigh Whitehead - University of Cambridge

Generative Adversarial Networks

What are GANs? • GANs are a type of neural network composed of two different networks • Typically one is known as the generator and the other, the discriminator • Invented by Ian Goodfellow in 2014 (arXiv:1406.2661) • They are typically used for generating images https://medium.com/swlh/face-morphing-using-generative-adversarial-network-gan-c751bba45095 4 Leigh Whitehead - University of Cambridge

What are GANs? • A very simple schematic of the network architecture True Image Input Noise Generator Discriminator The generator takes an input noise vector and produces a generated image 5 Leigh Whitehead - University of Cambridge

Training GANs • The process of training GANs is a competition between the two networks • The generator learns to trick the discriminator into classifying its images as real • The discriminator learns to tell the difference between real and generated images • Mathematically speaking, it is a two player minimax game • In each training iteration, we need to perform three steps to train these networks • Repeat these until an equilibrium is reached, and accurate generated images are produced 6 Leigh Whitehead - University of Cambridge

Training Step 1 • Train the discriminator to identify true images • We tell it that the images   are true (target y = 1) True Image y = 1 Input Noise Generator Discriminator 7 Leigh Whitehead - University of Cambridge

Training Step 2 • Train the discriminator to distinguish true and fake images • We tell it that the images   are fake (target y = 0) True Image Input Noise Generator Discriminator y = 0 8 Leigh Whitehead - University of Cambridge

Training Step 3 • We now train the generator and discriminator as one model • Set the target y = 1 to let   it learn to make realistic   images True Image • Discriminator weights   are frozen Input Noise Generator Discriminator y = 1 9 Leigh Whitehead - University of Cambridge

Fast moving field • Things have progressed very quickly • Back in 2016 there were a lot of horrors created • I think these are supposed to be dogs Ian Goodfellow, NIPS 2016 Tutorial: Generative Adversarial Networks 10 Leigh Whitehead - University of Cambridge

Fast moving field • Things have progressed very quickly • We can now see much better images https://www.kaggle.com/c/generative-dog-images 11 Leigh Whitehead - University of Cambridge

Fast moving field • Things have progressed very quickly • We can now see much better images https://twitter.com/goodfellow_ian/status/1084973596236144640 12 Leigh Whitehead - University of Cambridge

Conditional GANs • The examples I have shown so far have had noise input to the generator • Conditional GANs have generator outputs that are conditional on the input • The generator input hence has some meaning • Conditional GANs have found   True Image some applications in high energy   physics for fast simulations Generator Input Data Discriminator 13 Leigh Whitehead - University of Cambridge

Useful examples • GANs can also be used in useful ways • Image upscaling: increase image resolution • Maybe the CSI-style “zoom in, enhance” is on the way • Robustness and security of image recognition • Important for self driving cars! • Physics! • Etc, etc, etc https://medium.com/@ageitgey/machine-learning-is-fun-part-8-how-to-intentionally- trick-neural-networks-b55da32b7196 14 Leigh Whitehead - University of Cambridge

Adversarial Attacks • Application of noise invisible to the eye can completely fool   some image recognition neural networks Jiajun Lu, Hussein Sibai, Evan Fabry Adversarial Examples that Fool Detectors, arXiv:1712.02494, 2017 • Physical changes can also cause   Kevin Eykholt, et al.,Robust Physical-World Attacks on Deep incorrect classification Learning Models arXiv:1707.08945, 2017 • Training image classifying networks adversarially can help to make them more robust 15 Leigh Whitehead - University of Cambridge

The Model-Assisted GAN S. Alonso-Monsalve and L. H. Whitehead, "Image-Based Model Parameter Optimization Using Model-Assisted Generative Adversarial Networks," in IEEE Transactions on Neural Networks and Learning Systems, 2020

Model-Assisted GAN • I first had the idea for the MAGAN in 2018 • Image-recognition approaches are now common in HEP, but differences between simulation and data are a concern • Saw examples of applying a GAN to the simulated images • This is effectively arbitrary bin-by-bin reweighting • I wanted to find a method to modify the simulated images in a physically motivated way • The MAGAN knows about the physics parameters that are used by the simulation 17 Leigh Whitehead - University of Cambridge

Model-Assisted GAN - Aims • Instead of making images directly like standard GANs, create a vector of model parameters instead • These parameters are the input that control the simulation • In reality this could be noise, energy scale, etc • We want to train a neural network to reproduce the simulation outputs for the whole parameter space • We also want to be able to extract the model parameter values from a defined data sample • Allows us to tune the simulation 18 Leigh Whitehead - University of Cambridge

Model-Assisted GAN - Details • To achieve those goals, the Model-Assisted GAN is a bit more complex than a standard GAN: • Pre-training stage: • Train an emulator (E) to mimic the simulation (T) for the same model parameters using a siamese network (S) • This stage is similar to training a conditional GAN • Training stage: • Train a generator (G) against a discriminator (D) to make a model parameter vector such that the emulator makes images to match true data 19 Leigh Whitehead - University of Cambridge

Model-Assisted GAN: Overview • The full architecture of the MAGAN: The discriminator is a 2D CNN The generator is a 1D CNN These are the model parameters The emulator is a 2D CNN • Similar to two GANs working together 20 Leigh Whitehead - University of Cambridge

Model-Assisted GAN • The full architecture of the MAGAN: The discriminator is a 2D CNN The generator is a 1D CNN These are the model parameters The siamese network contains two 2D CNNs that share their network weights NB: Unlike the discriminator, the siamese always takes two input images 21 Leigh Whitehead - University of Cambridge

Pretraining step • This is very similar to the standard GANs • The siamese here is doing a similar job to a discriminator These are our physics model parameters Choose random values r within some defined parameter space 22 Leigh Whitehead - University of Cambridge

Post pretraining • We now have an emulator that produces the same images as the simulation for all model parameter values • We can use the emulator for fast simulation • It is effectively the generator of a conditional GAN • We can now move on to the second goal of extracting the model parameters from a true data sample 23 Leigh Whitehead - University of Cambridge

Training step • This stage allows us to extract the best matching physics parameters to the true data Here, true data sample that we produce using some choice of physics parameters. In an experiment, this would be real data G(z) is our generated vector of physics parameters 24 Leigh Whitehead - University of Cambridge

Case Study I S. Alonso-Monsalve and L. H. Whitehead, "Image-Based Model Parameter Optimization Using Model-Assisted Generative Adversarial Networks," in IEEE Transactions on Neural Networks and Learning Systems, 2020

<latexit sha1_base64="1FowqNC2zBW4bqjIOhwt+rSxUA4=">AB7XicbVBNSwMxEJ2tX7V+VT16CRZBEMquCnoRil48VrAf0C4lm2b2GSzJFlxWfofvHhQxKv/x5v/xrTdg7Y+GHi8N8PMvCDmTBvX/XYKS8srq2vF9dLG5tb2Tnl3r6loghtEMmlagdYU84i2jDMcNqOFcUi4LQVjG4mfuRKs1kdG/SmPoCDyIWMoKNlZrplXg6Ib1yxa26U6BF4uWkAjnqvfJXty9JImhkCMdadzw3Nn6GlWGE03Gpm2gaYzLCA9qxNMKCaj+bXjtGR1bpo1AqW5FBU/X3RIaF1qkIbKfAZqjnvYn4n9dJTHjpZyKE0MjMlsUJhwZiSavoz5TlBieWoKJYvZWRIZYWJsQCUbgjf/8iJpnla9s+rp3Xmldp3HUYQDOIRj8OACanALdWgAgQd4hld4c6Tz4rw7H7PWgpP7MfOJ8/NkKO4w=</latexit> Case Study I - Outline • Start simple: image containing a single line y = mx + c • Model parameters are hence: • Gradient m • Offset c • Start position in x: x 0 • Length in x: x steps • Parameter space limited to values that   produce lines that start and stop in   the 28 x 28 pixel images 26 Leigh Whitehead - University of Cambridge

Case Study I - Pretraining • Nice agreement after 500k steps! • At this stage we now have a well- trained emulator Emulated Images 27 Leigh Whitehead - University of Cambridge

Model-Assisted Generative Adversarial Networks Leigh Whitehead ICL - PowerPoint PPT Presentation

Model-Assisted Generative Adversarial Networks Leigh Whitehead ICL Seminar 05/06/20 Overview What are Generative Adversarial Networks (GANs)? The Model-Assisted GAN Case Studies Case Study I (from the paper) Case

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks

Adversarial Training Attacks on Deep Networks and Generative Adversarial Networks Erkut Erdem

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical

LAB MEETING: A Connection Between Generative Adversarial Networks, Inverse Reinforcement Learning

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

generative design systems Generative Brief Design Definitions Workshop Processes

Short Proofs are Hard to Find Ian Mertz University of Toronto Joint work w/ Toniann Pitassi, Hao

Third homology of perfect central extension Fatemeh Y. Mokari UCD, Dublin YRAC 2019, 16 -18

Persistent homotopy theory Rick Jardine University of Western Ontario June 17, 2020 Rick

In memoriam of Michael Butler (19282012) Helmut Lenzing Universit at Paderborn Auslander

Formalization of Mathematics for Fun and Profit John Harrison Intel Corporation 23rd July 2015

Cognitively-grounded Procedural Content Generation Rogelio E. Cardona-Rivera recardon@ncsu.edu

Disconnect: The Impact of Internationalization Efforts on Student Global Learning Robin Matross

of random 2-complexes Michael Farber University of Warwick, UK July 2013 Geometry and topology