An Efficient Posterior Regularized Latent Variable Model for - PowerPoint PPT Presentation

An Efficient Posterior Regularized Latent Variable Model   for Interactive Sound Source Separation � Nicholas J. Bryan, Stanford University � Gautham J. Mysore, Adobe Research � ICML 2013 � Sound Check 1

Motivation I � § Real world sounds are mixtures of many individual sounds � + 2

Current State-of-the-Art � § Non-negative matrix factorization (NMF) � � � � [Lee & Seung, 2001; Smaragdis & Brown 2003] � � § Related latent variable models (LVM) � � � � [Raj & Smaragdis 2005, Smaragdis et al., 2006] � � 3

Latent Variable Model � • Probabilistic latent component analysis (PLCA) [Smaragdis et al., 2006] X ≈ P ( f, t ) = P P ( z ) P ( f | z ) P ( t | z ) z P ( f | z ) P ( z ) P ( t | z ) P ( f | z ) Basis vectors, frequency components, dictionary � P ( z ) Latent component weights P ( t | z ) Time activations or gains �

Latent Variable Model � X ≈ P ( f, t ) = P P ( z ) P ( f | z ) P ( t | z ) z P ( f | z ) P ( z ) P ( t | z ) • Solve via an expectation-maximization (EM) algorithm

Problems � § Requires isolated training data (supervised/semi-supervised) � � § Don’t incorporate auditory/perceptual models of hearing � § One-shot process, cannot correct for poor results � § Very difficult, underdetermined problem � 7

Focus � § Eliminate the need to explicit training data � § Method of user feedback to guide separation � § Algorithm to incorporate the user feedback � 8

Paradigm: Listen, Paint, Remove � looping playback Speech + Cell Phone � Speech � Cell Phone � 9

Latent Variable Model w/Painting Constraints � ˜ P ( z ) ˜ ˜ P ( f | z ) ˜ P ( f, t ) = P P ( t | z ) z Λ 2 Λ 1 p ( f | z ) p ( z ) p ( t | z ) § Incorporate painting annotations into the model � 10

Constraints � § Constraints typical encoded as: � P ( z ) P ( f | z ) P ( t | z ) § Prior probabilities on model parameters � § Direct observations � � § Does not (reasonably) allow time-frequency constraints � � § Posterior regularization [Graça et al., 2007, 2009] � § Complementary method that allows time-frequency constraints � P ( z | f, t ) § Iterative optimization procedure for each E step � § Well suited for our problem � � 11

Expectation Maximization � ln P ( X | Θ ) = F ( Q, Θ ) + KL( Q || P ) ln P ( X | Θ ) ≥ F ( Q, Θ ) E Step: Q n +1 F ( Q, Θ n ) = arg max Q = arg min KL( Q || P ) Q M Step: Θ n +1 F ( Q n +1 , Θ ) = arg max Θ 12

Expectation Maximization w/Posterior Constraints I � ln P ( X | Θ ) = F ( Q, Θ ) + KL( Q || P ) ln P ( X | Θ ) ≥ F ( Q, Θ ) E Step: F ( Q, Θ n ) Q n +1 = arg max Q ∈ Q = arg min KL( Q || P ) Q ∈ Q M Step: Θ n +1 F ( Q n +1 , Θ ) = arg max Θ 13

Linear Grouping Expectation Constraints � arg min KL( Q ( z | f, t ) || P ( z | f, t ) ) Q ∈ Q P ( z | f, t ) • For each time-frequency point of � , solve � � q T ln p + q T ln q + q T λ arg min q q T 1 = 1 , q ⌫ 0 subject to Λ 2 Λ 1 λ T = [ Λ 1 ft Λ 1 ft Λ 1 ft . . . Λ 2 ft Λ 2 ft Λ 2 ft ] 14

Fast Updates � • With simple penalty, both E and M steps are in closed form • Reduces to simple, fast multiplicative updates vs. NMF • Roughly the same computational cost as without constraints 15

Evaluation � • BSS-EVAL metrics [Vincent et al., 2006] • Signal-to-Distortion Ratio (SDR) • Signal-to-Interference Ratio (SIR) • Signal-to-Artifact Ratio (SAR) • Test material • Cell phone + speech (C), drums + bass (D), orchestra + cough (O), piano + wrong note (P), siren + speech (S) • Vocals + background music (S1, S2, S3, S4) • Results • Outperformed prior state-of-the-art on tested material • Outperformed SiSEC 2011 vocals + background music winner 16

Live Demonstration � 17

Jackson 5 Remix � Jackson 5’s “I want You Back” Cher Llyod’s “Want U Back” Remix 18

A Look Back � § Perceptual domain, objective evaluation is difficult � § Human evaluation within the learning process � � § Processing training data only � 19

Conclusion � § Sound source separation algorithm � § Time-frequency constraints via posterior regularization � § No explicit training data � § Efficient, interactive algorithm w/closed-form update equations � § Improved separation quality over prior work � § Open source software � § Poster ID: 348 � § Demos at ccrma.stanford.edu/~njb/research/iss � 20

An Efficient Posterior Regularized Latent Variable Model   for Interactive Sound Source Separation � Nicholas J. Bryan, Stanford University � Gautham J. Mysore, Adobe Research � ICML 2013 � 21

An Efficient Posterior Regularized Latent Variable Model for - PowerPoint PPT Presentation

An Efficient Posterior Regularized Latent Variable Model for Interactive Sound Source Separation Nicholas J. Bryan, Stanford University Gautham J. Mysore, Adobe Research ICML 2013 Sound Check 1 Motivation I Real world

1 Latent variable models In the next section we will discuss latent variable models for

A Latent Variable Model of Synchronous Parsing for Syntactic and Semantic Dependencies James

Latent Variable Models CS3750 Xiaoting Li 1 Out utli line Latent Variable Models

Outline Latent Variable Generative Models Cooperative Vector Quantizer Model Model

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model CS330

Mokken Scale Analysis Alternative names: Unidimensional Latent Variable Model (e.g., Holland &

Bayesian Latent Variable Modelling of Longitudinal Family Data for Genetic Pleiotropy Studies

Efficient Model Evaluation in the Search-Based Approach to Latent Structure Discovery Tao Chen,

Learning Latent Variable Models through Tensor Methods Anima Anandkumar U.C. Irvine Challenges

Latent Semantic Indexing: A Regularized approach to large-scale modeling. Parth Guntoorkar

Learning Overcomplete Latent Variable Models through Tensor Methods Anima Anandkumar UC Irvine

Model selection and estimation for latent variable models Presented by Emi Tanaka School of

A Discriminative Latent Variable Model for Online Clustering Rajhans Samdani, Kai-Wei Chang , Dan

FARMS: a probabilistic latent variable model for summarizing Affymetrix array data at probe level

Adversarially Regularized Autoencoders Jake Zhao* 1 , 3 Yoon Kim* 2 Kelly Zhang 1 Alexander Rush 2

MLG Spotlight Talks August 20th, 2018 Growing Better Graphs with Latent-Variable Probabilistic

Pengtao Xie Joint work with Yuntian Deng and Eric Xing Carnegie Mellon University 1 Latent

Learning Overcomplete Latent Variable Models through Tensor Methods Majid Janzamin UC Irvine

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Maximum Reconstruction Estimation for Generative Latent-Variable Models Yong Cheng joint work

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Guaranteed Learning of Latent Variable Models through Tensor Methods Furong Huang University of

Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research

An Efficient Posterior Regularized Latent Variable Model for - PowerPoint PPT Presentation

An Efficient Posterior Regularized Latent Variable Model for Interactive Sound Source Separation Nicholas J. Bryan, Stanford University Gautham J. Mysore, Adobe Research ICML 2013 Sound Check 1 Motivation I Real world

1 Latent variable models In the next section we will discuss latent variable models for

A Latent Variable Model of Synchronous Parsing for Syntactic and Semantic Dependencies James

Latent Variable Models CS3750 Xiaoting Li 1 Out utli line Latent Variable Models

Outline Latent Variable Generative Models Cooperative Vector Quantizer Model Model

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model CS330

Mokken Scale Analysis Alternative names: Unidimensional Latent Variable Model (e.g., Holland &amp;

Bayesian Latent Variable Modelling of Longitudinal Family Data for Genetic Pleiotropy Studies

Efficient Model Evaluation in the Search-Based Approach to Latent Structure Discovery Tao Chen,

Learning Latent Variable Models through Tensor Methods Anima Anandkumar U.C. Irvine Challenges

Latent Semantic Indexing: A Regularized approach to large-scale modeling. Parth Guntoorkar

Learning Overcomplete Latent Variable Models through Tensor Methods Anima Anandkumar UC Irvine

Model selection and estimation for latent variable models Presented by Emi Tanaka School of

A Discriminative Latent Variable Model for Online Clustering Rajhans Samdani, Kai-Wei Chang , Dan

FARMS: a probabilistic latent variable model for summarizing Affymetrix array data at probe level

Adversarially Regularized Autoencoders Jake Zhao* 1 , 3 Yoon Kim* 2 Kelly Zhang 1 Alexander Rush 2

MLG Spotlight Talks August 20th, 2018 Growing Better Graphs with Latent-Variable Probabilistic

Pengtao Xie Joint work with Yuntian Deng and Eric Xing Carnegie Mellon University 1 Latent

Learning Overcomplete Latent Variable Models through Tensor Methods Majid Janzamin UC Irvine

Part III: Latent Tree Models Le Song ICML 2012 Tutorial on Spectral Algorithms for Latent

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Maximum Reconstruction Estimation for Generative Latent-Variable Models Yong Cheng joint work

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

Guaranteed Learning of Latent Variable Models through Tensor Methods Furong Huang University of

Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research

Mokken Scale Analysis Alternative names: Unidimensional Latent Variable Model (e.g., Holland &