Deep Learning Techniques for Music Generation Compound and GAN (6) - PowerPoint PPT Presentation

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot Jean-Pierre.Briot@lip6.fr Laboratoire d’Informatique de Paris 6 (LIP6) Sorbonne Université – CNRS Programa de Pós-Graduação em Informática (PPGI) UNIRIO Deep Learning – Music Generation – 2018 Jean-Pierre Briot

Architectures Deep Learning – Music Generation – 2018 2 Jean-Pierre Briot

Architectures • Feedforward mini-bach.py • Autoencoder auto-bach.py – Variational Autoencoder (VAE) VRAE • Recurrent (RNN) – LSTM lstm.py, Celtic • Generative Adversarial Networks (GAN) • Restricted Boltzmann Machine (RBM) • Reinforcement Learning (RL) Deep Learning – Music Generation – 2018 3 Jean-Pierre Briot

Compound Architectures • Autoencoder Stack = Autoencoder n 784 400 – DeepHear, auto-bach.py 200 100 • Autoencoder(RNN, RNN) = RNN Encoder-Decoder – VRAE • RNN Variational Encoder-Decoder – Music-VAE Deep Learning – Music Generation – 2018 4 Jean-Pierre Briot

Generative Adversarial Networks (GAN) [Goodfellow et al., 2014] • Training Simultaneously 2 Neural Networks – Generator » Transforms Random noise Vectors into Faked Samples – Discriminator » Estimates probability that the Sample came from training data rather than from G – Minimax 2-player game Prediction by D D( x ): P D (x from real data) (Correct) D(G( z )): P D (G( z ) from real data) (Incorrect) 1 - D(G( z )): P D (G( z ) from Generator) (Correct) P=1 P=0 [Nam Hyuk Ahn, 2017] Deep Learning – Music Generation – 2018 5 Jean-Pierre Briot

GAN Equation • Binary Cross-Entropy: • H B (y, y) = - (y log y + (1-y) log (1-y)) • D (x) = 1 P D ( x from real data) Correct • H B ( D (x), D (x)) = - ( D (x) log D (x) + (1- D (x)) log (1- D (x))) • H B ( D (x), D (x)) = - log D (x) • D ( G (z)) = 0 P D (G( z ) from real data) Incorrect • H B ( D ( G (z)), D ( G (z))) = - ( D ( G (z)) log D ( G (z)) + (1- D ( G (z))) log (1- D (G(z)))) • H B ( D ( G (z)), D ( G (z))) = - log (1- D (G(z))) • H B ( D (x), D (x)) + H B ( D ( G (z)), D ( G (z))) = - (log D (x) + log (1- D (G(z)))) Deep Learning – Music Generation – 2018 6 Jean-Pierre Briot

GAN and Turing Test artist’s renditio 𝐸 𝐻(𝑨) or 𝐸 𝑦 𝐻 𝑨, 𝜄 (𝐻) 𝐸 𝑦, 𝜄 (𝐸) Generator 𝐸 𝜄 (𝐸) 𝐻 𝜄 (𝐻) 𝐻 𝑨 or 𝑦 ℝ 𝑀 ℝ 𝑁 ℝ 𝑁 𝐻 𝑨, 𝜄 (𝐻) 𝐸 𝑦, 𝜄 (𝐸) Discriminator 𝐸 𝑨 𝜄 (𝐸) 𝐻 [Goodfellow, 2016] 𝜄 (𝐻) Deep Learning – Music Generation – 2018 7 Jean-Pierre Briot ℝ 𝑀 ℝ 𝑁 ℝ 𝑁

GAN Basic Training Algorithm Initialize 𝜄 (𝐻) , 𝜄 (𝐸) • • For 𝑢 = 1: 𝑐: 𝑈 Initialize Δ𝜄 (𝐸) = 0 • • For 𝑗 = 𝑢: 𝑢 + 𝑐 − 1 • Sample 𝑨 𝑗 ~ 𝑞(𝑨 𝑗 ) 𝑙 • Compute 𝐸 𝐻 𝑨 𝑗 , 𝐸(𝑦 𝑗 ) (𝐸) ← Compute gradient of Discriminator loss , 𝐾 𝐸 𝜄 𝐻 , 𝜄 (𝐸) • Δ𝜄 𝑗 Δ𝜄 (𝐸) ← Δ𝜄 (𝐸) + Δ𝜄 𝑗 𝐸 • Update 𝜄 (𝐸) • 𝑙 = 1 Initialize Δ𝜄 (𝐻) = 0 • • For 𝑘 = 𝑢: 𝑢 + 𝑐 − 1 • Sample 𝑨 𝑘 ~ 𝑞(𝑨 𝑘 ) • Compute 𝐸 𝐻 𝑨 𝑘 , 𝐸(𝑦 𝑘 ) (𝐻) ← Compute gradient of Generator loss, 𝐾 𝐻 𝜄 𝐻 , 𝜄 (𝐸) • Δ𝜄 𝑘 Δ𝜄 (𝐻) ← Δ𝜄 (𝐻) + Δ𝜄 𝐻 • 𝑘 • Update 𝜄 (𝐻) Deep Learning – Music Generation – 2018 8 Jean-Pierre Briot

Examples of GAN Generated Images [Brundage et al., 2018] Synthetic (Generated) Celebrity images CelebFaces Attributes Dataset (CelebA) > 200K celebrity images [Karras et al., 2018] Deep Learning – Music Generation – 2018 9 Jean-Pierre Briot

C-RNN-GAN [Mogren, 2016] GAN(Bidirectional-LSTM 2 , LSTM 2 ) • Discriminator considers the hidden layers (forward and backward) values to be (or not) representative of the Real data – Analog to RNN Encoder-Decoder which considers the hidden layer as the summary of a sequence • Classical music Training Dataset Deep Learning – Music Generation – 2018 Jean-Pierre Briot 11

MidiNet [Yang et al., 2017] GAN(Conditioning(Convolutional(Feedforward), Convolutional(Feedforward(History, Chord sequence))), Conditioning(Convolutional(Feedforward), History)) • Convolutional • Conditioning – Previous measure – Chord sequence • Pop music Training Dataset https://soundcloud.com/vgtsv6jf5fwq/model3 Deep Learning – Music Generation – 2018 Jean-Pierre Briot 12

VAE vs GAN • VAE (Variational Autoencoder) and GAN (Generative Adversarial Networks) Some Similarities: • Are both generative architectures • Generate from random latent variables [Dykeman, 2016] Differences: • VAE is representational of the whole training dataset • GAN is not • Smooth control interface for exploring latent data space • GAN has (ex: interpolation) but not as for VAE • GAN produces better quality content (ex: better resolution images) Deep Learning – Music Generation – 2018 13 Jean-Pierre Briot

Compound Architectures • Composition – Bidirectional RNN, combining two RNNs, forward and backward in time – RNN-RBM [Boulanger-Lewandowski et al., 2012], combining an RNN (horizontal/sequence) and an RBM (vertical/chords) • Refinement – Sparse autoencoder – Variational autoencoder (VAE) = Variational(Autoencoder) • Nested – Stacked autoencoder = Autoencoder n – RNN Encoder-Decoder = Autoencoder(RNN, RNN) • Pattern instantiation – C-RBM [Lattner et al., 2016] = Convolutional(RBM) – C-RNN-GAN [Mogren, 2016] = GAN(Bidirectional-LSTM 2 , LSTM 2 ) – Anticipation-RNN [Hadjeres & Nielsen, 2017] = Conditioning(RNN, RNN) Deep Learning – Music Generation – 2018 14 Jean-Pierre Briot

Deep Learning Techniques for Music Generation Compound and GAN (6) - PowerPoint PPT Presentation

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot Jean-Pierre.Briot@lip6.fr Laboratoire dInformatique de Paris 6 (LIP6) Sorbonne Universit CNRS Programa de Ps-Graduao em Informtica (PPGI)

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Deep Learning Techniques for Music Generation Recurrent (5) Jean-Pierre Briot

Deep Learning Techniques for Music Generation Recurrent (5) Jean-Pierre Briot

Deep Learning Techniques for Music Generation Reinforcement (7) Jean-Pierre Briot

Music Generation Using Machine Learning Seminar Computer Music SS 2017 Michael Krause RWTH

Polyphonic Music Transcription using Deep Learning Methods Aniruddha Zalani Ayush Mittal Course

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Data Generation and Evaluation Using Deep Learning Ye Ji An a , Ji Hun Park a , So Hun Yun a , Man

DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Question-Answering: Shallow & Deep Techniques for NLP Deep Processing Techniques for NLP

Generative Deep Learning Prof. Kuan-Ting Lai 2020/5/12 DeepFake (Intro) Generative Recurrent

Deep Learning Next Meetups - Tom History, Approaches, Applications An Introduction by: Thomas

Question-Answering: Shallow & Deep Techniques for NLP Ling571 Deep Processing Techniques

Learning Music, Images and Physics with Deep Neural Networks Joan Bruna, Matthew Hirn, Stphane

Applying Deep Neural Network Techniques for LArTPC Data Reconstruction Laura Domine (Stanford

What does compound mean? Aron Ralston had a compound fracture in his arm before he severed

Exploring the application of deep learning techniques on medical text corpora Jos Antonio

Deep-Learning Oriented Smart Sensing for the Next Generation of Embedded Applications Manuele

Bird Identification using Deep Learning Techniques Presentation by Elias Sprengel University:

TASO: Optimizing Deep Learning with Automatic Generation of Graph Substitutions Zhihao Jia , Oded

Next Generation NAS Evolution of DSM Platform Jeremie Francois Synology Deep Machine Learning

Deep Learning Techniques for Music Generation Compound and GAN (6) - PowerPoint PPT Presentation

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot Jean-Pierre.Briot@lip6.fr Laboratoire dInformatique de Paris 6 (LIP6) Sorbonne Universit CNRS Programa de Ps-Graduao em Informtica (PPGI)

Deep Learning Techniques for Music Generation Compound and GAN (6) Jean-Pierre Briot

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Deep Learning Techniques for Music Generation 3. Generation by Feedforward Architectures

Deep Learning Techniques for Music Generation Recurrent (5) Jean-Pierre Briot

Deep Learning Techniques for Music Generation Recurrent (5) Jean-Pierre Briot

Deep Learning Techniques for Music Generation Reinforcement (7) Jean-Pierre Briot

Music Generation Using Machine Learning Seminar Computer Music SS 2017 Michael Krause RWTH

Polyphonic Music Transcription using Deep Learning Methods Aniruddha Zalani Ayush Mittal Course

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Introduction to Deep Processing Techniques for NLP Deep Processing Techniques for NLP Ling 571

Data Generation and Evaluation Using Deep Learning Ye Ji An a , Ji Hun Park a , So Hun Yun a , Man

DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs

Question-Answering: Shallow &amp; Deep Techniques for NLP Deep Processing Techniques for NLP

Generative Deep Learning Prof. Kuan-Ting Lai 2020/5/12 DeepFake (Intro) Generative Recurrent

Deep Learning Next Meetups - Tom History, Approaches, Applications An Introduction by: Thomas

Question-Answering: Shallow &amp; Deep Techniques for NLP Ling571 Deep Processing Techniques

Learning Music, Images and Physics with Deep Neural Networks Joan Bruna, Matthew Hirn, Stphane

Applying Deep Neural Network Techniques for LArTPC Data Reconstruction Laura Domine (Stanford

What does compound mean? Aron Ralston had a compound fracture in his arm before he severed

Exploring the application of deep learning techniques on medical text corpora Jos Antonio

Deep-Learning Oriented Smart Sensing for the Next Generation of Embedded Applications Manuele

Bird Identification using Deep Learning Techniques Presentation by Elias Sprengel University:

TASO: Optimizing Deep Learning with Automatic Generation of Graph Substitutions Zhihao Jia , Oded

Next Generation NAS Evolution of DSM Platform Jeremie Francois Synology Deep Machine Learning

Question-Answering: Shallow & Deep Techniques for NLP Deep Processing Techniques for NLP

Question-Answering: Shallow & Deep Techniques for NLP Ling571 Deep Processing Techniques