unsupervised piano music transcription
play

Unsupervised Piano Music Transcription Taylor Berg-Kirkpatrick - PowerPoint PPT Presentation

Unsupervised Piano Music Transcription Taylor Berg-Kirkpatrick Jacob Andreas and Dan Klein CMU UC Berkeley Piano Music Transcription note time Supervised Transcription Supervised Transcription w Model


  1. Unsupervised Piano Music Transcription Taylor Berg-Kirkpatrick Jacob Andreas and Dan Klein CMU UC Berkeley

  2. Piano Music Transcription note time

  3. Supervised Transcription

  4. Supervised Transcription w Model

  5. Supervised Transcription

  6. Learning to Transcribe Learning w

  7. Prediction w ? Model

  8. Prediction w Model

  9. Piano Sounds

  10. Piano Sounds

  11. Piano Sounds

  12. Piano Sounds

  13. Piano Sounds

  14. Piano Sounds

  15. Piano Sounds freq time

  16. Spectral Shape freq time

  17. Spectral Shape freq time

  18. Spectral Shape freq time

  19. Spectral Shape freq time

  20. Temporal Shape freq time

  21. Temporal Shape freq time

  22. Temporal Shape freq time

  23. Temporal Shape freq time

  24. Temporal Shape freq time

  25. Temporal Shape freq time

  26. Temporal Shape freq time

  27. Temporal Shape freq time

  28. Temporal Shape freq time

  29. Temporal Shape freq time

  30. Temporal Shape freq time

  31. Temporal Shape freq time

  32. Polyphony

  33. Polyphony . . .

  34. Unsupervised Transcription

  35. Unsupervised Transcription Audio signal Symbolic Music

  36. Unsupervised Transcription Learning Piano Parameters ? Audio signal Generative Model Symbolic ? Music

  37. Unsupervised Transcription Learning Piano Parameters Generative ? Model Audio signal Symbolic ? Music

  38. Generative Model note n time Note events velocity M n time

  39. Generative Model Parameters Latent variables Note events PLAY REST M n µ n time duration velocity Activation A n α n time time Component spectrogram freq freq S n σ n time Spectrogram X freq time

  40. Note Event Model M n µ n PLAY Event type REST PLAY REST PLAY E 1 E 2 E 3 Duration duration D 1 D 2 D 3 Velocity velocity V 1 V 2 V 3

  41. Activation Model D 1 D 2 D 3 V 1 V 2 V 3

  42. Activation Model Temporal α n shape D 1 D 2 D 3 V 1 V 2 V 3 copy temporal shape Activation A n

  43. Activation Model Temporal α n shape D 1 D 2 D 3 V 1 V 2 V 3 truncate to duration Activation A n

  44. Activation Model Temporal α n shape D 1 D 2 D 3 V 1 V 2 V 3 scale to velocity Activation A n

  45. Activation Model Temporal α n shape D 1 D 2 D 3 V 1 V 2 V 3 add Gaussian noise Activation A n

  46. Component Spectrogram Model Activation A n Spectral shape Poisson noise σ n S n Component spectrogram

  47. Total Spectrogram Model A 1 A N σ 1 σ N . . . + S 1 S N X Total spectrogram

  48. Learning and Inference Parameters Latent variables Note events PLAY REST M n µ n time duration velocity Activation A n α n time time Component spectrogram freq freq S n σ n time Spectrogram X freq time

  49. Learning and Inference Note events update: Semi-Markov dynamic program M | A, α , µ Temporal shapes update: Closed form update α | A, M Activations update: Exponentiated gradient ascent A | M, X, α , σ Spectral shapes update: Exponentiated gradient ascent σ | A, X

  50. Evaluation Onset F1 note time

  51. Results MAPS Corpus 80 82.1 Onset F1 70 70.4 69.0 68.6 60 58.3 50 O’Hanlon Benetos Vincent Supervised Unsupervised* 2014 2014 2013 [Valentin et al. 2010] [Berg-Kirkpatrick et al. 2014]

  52. Transcription Reference Predicted

  53. Resynthesized Examples Grieg input Grieg resynth piano Grieg resynth guitar

  54. Demo Demo!

  55. Resynthesized Examples Chopin input Chopin resynth piano Chopin resynth guitar

  56. Resynthesized Examples Beethoven input Beethoven resynth piano Beethoven resynth guitar

Recommend


More recommend