Perceptual Evaluation of Source Separation for Remixing Music H. - PowerPoint PPT Presentation

Perceptual Evaluation of Source Separation for Remixing Music H. Wierstorf 1 D. Ward 1 E. M. Grais 1 M. D. Plumbley 1 R. Mason 2 C. Hummersone 2 1 Centre for Vision, Speech and Signal Processing, University of Surrey 2 Institute of Sound Recording, University of Surrey 143rd AES Convention 20.10.2017, CC BY 4.0

Source separation for music Reference: vocals others mixture Source separation: vocals others How to talk about source separation? Sound quality : artifacts and distortion added Interference : not perfect separation achieved 1

Source separation for music How to evaluate source separation? BSS eval : signal decomposition and energy ratios 1 PEASS: signal decomposition and auditory model 2 Open questions Correlation with perception has been questioned 3 1 Vincent, et al. (2006), IEEE TASLP , doi: 10.1109/TSA.2005.858005 2 Emiya, et al. (2011), IEEE TASLP , doi: 10.1109/TASL.2011.2109381 3 e.g. Gupta, et al. (2015), WASPAA , doi: 10.1109/WASPAA.2015.7336923 2

BSS eval Decompose signal into different components s estimated = s original + e interferer + e artifacts || s original + e interferer || 2 SAR = 10 log 10 || e artifacts || 2 || s original || 2 SIR = 10 log 10 || e interferer || 2 3

Source separation for music Reference: vocals others mixture Source separation: vocals others How to talk about source separation? Sound quality : artifacts and distortion added Interference : not perfect separation achieved 4

Source separation for music Reference: vocals others mixture Source separation: vocals others mixture How to talk about source separation? Sound quality : artifacts and distortion added Interference : not perfect separation achieved 4

Remixing using source separation Modify component levels 4 Change positions (upmix) 5 Change frequency content 6 Add effects 7 Mashups 4 Itoyama, et al. (2009), ISMIR , pp. 133–138 5 Cobos, et al. (2008), ISCCSP , doi: 10.1109/ISCCSP.2008.4537423 6 Yoshii, et al. (2005), WASPAA , doi: 10.1049/ic.2005.0733 7 Woodruff, et al. (2006), ISMIR , pp. 314–319 5

Evaluation of remixes Evaluate the actual remix Problem if only asked for preference or naturalness 8 Enable for adjustment by listeners 9 Trade-off between artifacts and level increase 10 Predictions with BSS eval? 8 Gillet and Richard (2005), WASPAA , doi: 10.1109/ASPAA.2005.1540232 9 Yoshii, et al. (2005), WASPAA , doi: 10.1049/ic.2005.0733 10 Pons, et al. (2016), JASA , doi: 10.1121/1.4971424 6

Experiment Start with reference mix Introduce changes in level of vocals Rate sound quality and loudness balance Look for correlations with SAR and SIR 7

Experiment “ Loudness balance describes the relation of the overall loudness of the vocals to the overall loudness of the remaining instruments. It does not include short and abrupt changes in loudness that you might experience for some test sounds. It is more considered with the general balance of the vocals and the accompanying instruments. 8

Experiment MUSHRA inspired experiment using Web Audio Evaluation Tool 11 11 Jillings, et al. (2015), SMC , github: BrechtDeMan/WebAudioEvaluationTool 9

Experiment 2 tasks: sound quality and loudness balance 5 source separation algorithms 6 songs (converted to mono) 3 remixes, level of vocal (0 dB, 6 dB, 12 dB) 3 anchor and references for every task loudness anchor: vocals − 14 dB quality anchor: artifacts, distortions, 3 . 5 kHz low pass 15 participants 10

Stimuli Signal separation evaluation campaign (SiSEC) 12 The MUS task includes 23 algorithms and 100 mixed songs 13 SAR: 7 . 7 6 . 1 2 . 8 6 . 3 − 3 . 4 SIR: 10 . 2 11 . 1 8 . 8 6 . 2 7 . 0 Vocal: UHL3 NUG3 OZE GRA3 KON 12 Liutkus, et al. (2017), LVA/ICA , doi: 10.1007/978-3-319-53547-0_31 13 https://www.sisec17.audiolabs-erlangen.de 11

Results Average across medians of every song same quality sound UHL3 NUG3 OZE GRA3 KON worse same loudness balance different 0 6 12 0 6 12 0 6 12 0 6 12 0 6 12 level / dB 12

Influence of song Song 30 Song 48 same quality sound worse 0 dB same loudness balance different f 3 3 E 3 N r f 3 3 E 3 N r e o e o L G Z A O L G Z A O R h R h H U O R H U O R K c K c G G U N n U N n A A system system 13

Influence of song Connected to level balance of original mix? Song 30, level balance: 1 . 7 dB Song 48, level balance: − 5 . 7 dB Weak correlation with both results for 12 dB Two songs were worse in level balance than song 48 14

BSS eval and remixes Correlation for 12 dB conditions same loudness balance r = 0.75 rs = 0.79 different − 5 0 5 10 15 20 25 SIR / dB 15

BSS eval and remixes Correlation for 12 dB conditions same quality sound r = 0.68 rs = 0.67 worse − 4 − 2 0 2 4 6 8 10 12 14 16 SAR / dB 15

BSS eval and remixes Correlation for all conditions 14 same quality sound r = 0.50 rs = 0.83 worse − 10 0 10 20 30 40 50 60 70 80 SAR mix / dB 14 Liu et al. (2015), EUSIPCO , doi: 10.1109/EUSIPCO.2015.7362551 16

Conclusions Source separation methods suitable for level remixing Trade off between achievable level and sound quality Maximum reachable level BSS eval can be used to pick algorithm Connection to adjustment experiments? https://hagenw.github.io 17

http://cvssp.org/events/lva-ica-2018 18

Perceptual Evaluation of Source Separation for Remixing Music H. - PowerPoint PPT Presentation

Perceptual Evaluation of Source Separation for Remixing Music H. Wierstorf 1 D. Ward 1 E. M. Grais 1 M. D. Plumbley 1 R. Mason 2 C. Hummersone 2 1 Centre for Vision, Speech and Signal Processing, University of Surrey 2 Institute of Sound Recording,

Remixing Gotye Katie Wardrobe Midnight Music What is remixing? A new work made from an

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Separation energies A = 21 isobaric chain one-nucleon separation energies two-nucleon separation

Readings Covered Further Readings Ware: Evaluation Appendix Ware, Appendix C: The Perceptual

Killer Engine for Remixing Games Jesse Himmelstein

2018-02-12 Perceptual organization PSY 525.001 Vision Science 2018 Spring Rick Gilmore

and Retrieval Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H.

A Classification Approach to Single Channel Source Separation CS 6772 Project Ron Weiss

Perceptual Evaluation of Color-to-Grayscale Image Conversions Martin adk Czech Technical

Underdetermined Source Separation Using Speaker Subspace Models Thesis Defense Ron Weiss May 4,

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Extending propositional separation logic for robustness properties of separation logic Alessio

Text/Graphics Separation Revisited Karl Tombre, Salvatore Tabbone, Loc Plissier, Bart

Temporary Read-Only Permissions for Separation Logic Making Separation Logics Small Axioms

Decomposition Announcements Modular Design Separation of Concerns 4 Separation of Concerns A

PLN BERSIH: Clean Business! A Civil Society Support for Mainstreaming Corporate Governance in

Policy and Strategy: Accelerating Poverty Reduction, Reducing Unemployment, and Overcoming

4Q17 & FY17 Performance Results Jakarta, 26 February 2018 1 Agenda 1 INTRODUCTION 2

Junior Achievement South Africa Celebrating 40 years of empowering youth Through experiential

Mixture of g Priors for Bayesian Variable Selection Feng Liang, Rui Paulo et al. Sheng Zhang

Jeffrey Wennberg Commissioner of Public Works City of Rutland 1 City of Rutland CSO Planning

Learning in Graphical Models Problem Dimensions Model Bayes Nets Markov Nets

WORKSHOP ON ELETRONIC PRODUCT INFORMATION (ePI) Sine Jensen Danish Consumer Council -

Sambuz

Useful Links

Newsletter

Mail Us

Perceptual Evaluation of Source Separation for Remixing Music H. - PowerPoint PPT Presentation

Perceptual Evaluation of Source Separation for Remixing Music H. Wierstorf 1 D. Ward 1 E. M. Grais 1 M. D. Plumbley 1 R. Mason 2 C. Hummersone 2 1 Centre for Vision, Speech and Signal Processing, University of Surrey 2 Institute of Sound Recording,

Remixing Gotye Katie Wardrobe Midnight Music What is remixing? A new work made from an

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Separation energies A = 21 isobaric chain one-nucleon separation energies two-nucleon separation

Readings Covered Further Readings Ware: Evaluation Appendix Ware, Appendix C: The Perceptual

Killer Engine for Remixing Games Jesse Himmelstein

2018-02-12 Perceptual organization PSY 525.001 Vision Science 2018 Spring Rick Gilmore

and Retrieval Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H. Jegou Source: H.

A Classification Approach to Single Channel Source Separation CS 6772 Project Ron Weiss

Perceptual Evaluation of Color-to-Grayscale Image Conversions Martin adk Czech Technical

Underdetermined Source Separation Using Speaker Subspace Models Thesis Defense Ron Weiss May 4,

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Extending propositional separation logic for robustness properties of separation logic Alessio

Text/Graphics Separation Revisited Karl Tombre, Salvatore Tabbone, Loc Plissier, Bart

Temporary Read-Only Permissions for Separation Logic Making Separation Logics Small Axioms

Decomposition Announcements Modular Design Separation of Concerns 4 Separation of Concerns A

PLN BERSIH: Clean Business! A Civil Society Support for Mainstreaming Corporate Governance in

Policy and Strategy: Accelerating Poverty Reduction, Reducing Unemployment, and Overcoming

4Q17 &amp; FY17 Performance Results Jakarta, 26 February 2018 1 Agenda 1 INTRODUCTION 2

Junior Achievement South Africa Celebrating 40 years of empowering youth Through experiential

Mixture of g Priors for Bayesian Variable Selection Feng Liang, Rui Paulo et al. Sheng Zhang

Jeffrey Wennberg Commissioner of Public Works City of Rutland 1 City of Rutland CSO Planning

Learning in Graphical Models Problem Dimensions Model Bayes Nets Markov Nets

WORKSHOP ON ELETRONIC PRODUCT INFORMATION (ePI) Sine Jensen Danish Consumer Council -

Sambuz

Useful Links

Newsletter

Mail Us

4Q17 & FY17 Performance Results Jakarta, 26 February 2018 1 Agenda 1 INTRODUCTION 2