e9 205 machine learning for signal processing
play

E9 205 Machine Learning for Signal Processing ML, MAP, MMSE and - PowerPoint PPT Presentation

E9 205 Machine Learning for Signal Processing ML, MAP, MMSE and Gaussian 28-08-2019 Modeling Instructor - Sriram Ganapathy (sriram@ee.iisc.ernet.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in). Decision Theory (PRML Chap. 1.5)


  1. E9 205 Machine Learning for Signal Processing ML, MAP, MMSE and Gaussian 28-08-2019 Modeling Instructor - Sriram Ganapathy (sriram@ee.iisc.ernet.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in).

  2. Decision Theory (PRML Chap. 1.5) ❖ Decision Theory ❖ Inference problem ❖ Finding the joint density ❖ Decision problem ❖ Using the inference to make the classification or regression decision

  3. Decision Problem - Classification ❖ Minimizing the mis-classification error ❖ Decision based on maximum posteriors ❖ Loss matrix ❖ Minimizing the expected loss

  4. Visualizing the Max. Posterior Classifier

  5. Approaches for Inference and Decision I. Finding the joint density from the data. II. Finding the posteriors directly. III. Using discriminant functions for classification.

  6. Approaches for Inference and Decision I. Finding the joint density from the data. II. Finding the posteriors directly. III. Using discriminant functions for classification.

  7. Advantage of Posteriors

  8. Decision Rule for Regression ❖ Minimum mean square error loss ❖ Solution is conditional expectation.

  9. Generative Modeling Non- parametric Generative Classifiers Parametric

  10. Non-parametric Modeling • Non-parametric models do not specify an apriori set of parameters to model the distribution. Example - Histogram # 10 4 3 2.5 2 Bin Count 1.5 1 0.5 0 -0.25 -0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 0.2 0.25 Sample value The density is not smooth and has block like shape.

  11. Non-parametric Modeling • Non-parametric models do not specify an apriori set of parameters to model the distribution. • Example - Kernel Density Estimators Histogram Kernel Density Kernel is a smooth function which obeys certain properties

  12. Non-parametric Modeling • Non-parametric methods are dependent on number of data points • Estimation is difficult for large datasets. • Likelihood computation and model comparisons are hard. • Limited use in classifiers

  13. Parametric Models (Chap 2 PRML) ❖ Collection of probability distributions which are described by a finite dimensional parameter set • Examples - • Poisson Distribution • Bernoulli Distribution • Gaussian Distribution

  14. Gaussian Distribution One of most widely used and well studied model Points of equal probability lie on on contour Diagonal Gaussian with Identical Variance

  15. Gaussian Distribution Insights into two dimensional Gaussian distribution Diagonal Gaussian with different variance

  16. Gaussian Distribution Insights into two dimensional Gaussian Distribution Full covariance Gaussian distribution

  17. Gaussian Distribution Fitting the data with a Gaussian Model

  18. Finding the parameters of the Model ❖ The Gaussian model has the following parameters ❖ Total number of parameters to be learned for D dimensional data is ❖ Given N data points how do we estimate the parameters of model. ❖ Several criteria can be used ❖ The most popular method is the maximum likelihood estimation (MLE).

  19. MLE Define the likelihood function as The maximum likelihood estimator (MLE) is The MLE satisfies nice properties like - Consistency (covergence to true value) - Efficiency (has the least Mean squared error).

  20. MLE For the Gaussian distribution To estimate the parameters

Recommend


More recommend