MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment Hao-Wen Dong*, Wen-Yi Hsiao*, Li-Chia Yang, Yi-Hsuan Yang Research Center of IT Innovation, Academia Sinica Demo Page https://salu133445.github.io/musegan/ *these authors contributed equally to this work
Outline 。 Goals & Challenges 。 Data 。 Proposed Model 。 Results & Evaluation 。 Future Works Source Code https://github.com/salu133445/musegan Demo Page https://salu133445.github.io/musegan/ 2
Generate pop music 。 of multiple tracks Goals 。 in piano-roll format [Source Code] https://github.com/ salu133445/musegan [Demo Page] https://salu133445. 。 using GAN with CNNs github.io/musegan/ 3
Multi-track GAN Challenge I Multitrack Interdependency vocal piano strings bass drums music & clip by phycause 4
Convolutional Challenge II Neural Networks Music Texture melody chord (harmony) 5
Challenge III Temporal Structure song paragraph 1 paragraph 2 paragraph 3 phrase 1 phrase 2 phrase 2 phrase 3 phrase 4 bar 1 bar 2 bar 3 bar 4 4/4 time beat 1 beat 2 beat 3 beat 4 step 1 step 2 ··· step 24 6
Challenge III Temporal Structure Convolutional Neural Networks Fixed Structure phrase 2 bar 1 bar 2 bar 3 bar 4 4/4 time beat 1 beat 2 beat 3 beat 4 step 1 step 2 ··· step 24 7
Piano-roll (with symbolic timing) Data Representation polyphonic multi-track time step Bar 1 Bar 2 Bar 3 Bar 4 pitch time 8
Piano-roll (with symbolic timing) Data Representation polyphonic multi-track Bar 1 Bar 2 Bar 3 Bar 4 A3 pitch t 0 t 1 time 9
Multi-track Piano-roll (with symbolic timing) Data Representation polyphonic multi-track pitch tracks time 10
Bass Data Representation Drums Strings Piano Guitar 4 bars 84 5 tracks pitches a 4 × 96 × 84 × 5 tensor 96 time steps 11
LPD (Lakh Pianoroll Dataset) 。 >170,000 multi-track piano-rolls 。 Derived from Lakh MIDI Dataset 。 Mainly pop songs Data Pypianoroll (Python package) 。 Manipulation & Visualization 。 Efficient Save/Load [Dataset] https://salu133445.gith 。 Parse/Write MIDI files ub.io/musegan/dataset 。 On PYPI (pip installable) [Pypianoroll] https://salu133445. github.io/pypianoroll/ 12
Generative Adversarial Networks Generator random noise fake data critic Discriminator z ~ p ( z ) G G( z ) (wgan-gp) D real/fake X real data 4-bar phrases of 5 tracks 13
MuseGAN – An Overview temporal bar generator generator G temp G bar 1 random noise 4 piano-roll matrices 4 latent variables 14
MuseGAN Bar Generator z z z G G z G G G z z z z z z z z z 15
MuseGAN Bar Generator Coordination z z z z G track-independent G z G G G No Coordination z z z z z z z z z track-dependent 16
MuseGAN Bar Generator z G z z z z G G z z G z G z z G G z z G G z z G z G z z z z z z z z z 17
MuseGAN Bar Generator z G z z z z G G z z G z G z z G G z z G G z z G z G z z z z z z z z z 18
Time MuseGAN Dependent Independent Dependent Melody Groove Track Independent Chords Style Bar Generator Chords z G z Style z z z G G z z G z G z G z G Melody z z G G z z G z G z z z z Groove z z z z z 19
Bass Line Drum pattern Results Sample 1 Sample 2 Chords More Samples on Demo Page https:// salu133445.github.io/musegan / Bass Drums Guitar Strings Piano Step 0 Step 700 Step 2500 Step 6000 Step 7900 20
Monitor the Training Objective Metrics UPC Negative Critic Loss 10 12 10 10 10 8 10 6 step QN 10 4 0 2000 4000 6000 8000 step UPC number of used pitch classes per bar QN ratio of qualified notes step 21
User Study composer H : harmonious R : rhythmic jamming MS : musically structured C : coherent OR: overall rating hybrid 22
Summary 。 MuseGAN ◦ a novel GAN for multi-track sequence generation ◦ multi-track , polyphonic music ◦ human-AI cooperative scenario (see the paper) 。 Lakh Pianoroll Dataset (LPD) ( new dataset!! ) 。 Pypianoroll ( new package!! ) 23
Full Song Generation Future song Works paragraph 1 paragraph 2 paragraph 3 phrase 1 phrase 2 phrase 2 phrase 3 phrase 4 bar 1 bar 2 bar 3 bar 4 beat 1 beat 2 beat 3 beat 4 step 1 step 2 ··· step 24 Hierarchical Temporal Structure 24
Cross-modal Generation 。 Music + Video Future 。 Music + Lyrics Works 。 Video + Text 25
Source Code https://github.com/salu133445/musegan Demo Page https://salu133445.github.io/musegan/ Q&A MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment
Recommend
More recommend