MLE 04-09-2019 For Gaussian and Mixture Gaussian Models - PowerPoint PPT Presentation

May 30, 2023 •240 likes •390 views

E9 205 Machine Learning for Signal Processing MLE 04-09-2019 For Gaussian and Mixture Gaussian Models Instructor - Sriram Ganapathy (sriramg@iisc.ac.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in). Finding the parameters of the

E9 205 Machine Learning for Signal Processing MLE 04-09-2019 For Gaussian and Mixture Gaussian Models Instructor - Sriram Ganapathy (sriramg@iisc.ac.in) Teaching Assistant - Prachi Singh (prachisingh@iisc.ac.in).
Finding the parameters of the Model • The Gaussian model has the following parameters • Total number of parameters to be learned for D dimensional data is • Given N data points how do we estimate the parameters of model. • Several criteria can be used • The most popular method is the maximum likelihood estimation (MLE).
MLE Define the likelihood function as The maximum likelihood estimator (MLE) is The MLE satisfies nice properties like - Consistency (covergence to true value) - Efficiency (has the least Mean squared error).
MLE For the Gaussian distribution To estimate the parameters
MLE Using matrix differentiation rules, for a symmetric matrix Using matrix differentiation rules for log determinant and trace ∂ tr ( AB ) = B + B T − diag ( B ) ∂ A Sample mean and Sample Covariance
Gaussian Distribution Often the data lies in clusters (2-D example) Fitting a single Gaussian model may be too broad.
Gaussian Distribution Need mixture models Can fit any arbitrary distribution.
Gaussian Distribution 1-D example
Gaussian Distribution Summary • The Gaussian model - parametric distributions • Simple and useful properties. • Can model unimodal (single peak distributions) • MLE gives intuitive results • Issues with Gaussian model • Multi-modal data • Not useful for complex data distributions • Need for mixture models
Basics of Information Theory • Entropy of distribution • KL divergence • Jensen’s inequality • Expectation Maximization Algorithm for MLE
Gaussian Mixture Models A Gaussian Mixture Model (GMM) is defined as The weighting coefficients have the property
Gaussian Mixture Models • Properties of GMM • Can model multi-modal data. • Identify data clusters. • Can model arbitrarily complex data distributions The set of parameters for the model are The number of parameters is
MLE for GMM • The log-likelihood function over the entire data in this case will have a logarithm of a summation • Solving for the optimal parameters using MLE for GMM is not straight forward. • Resort to the Expectation Maximization (EM) algorithm
Basics of Information Theory • Entropy of distribution • KL divergence • Jensen’s inequality • Expectation Maximization Algorithm for MLE

Recommend

Making Life Easier Online service for people within North Lanarkshire MLE History MLE website

Making Life Easier Online service for people within North Lanarkshire MLE History MLE website launched in 2009 Minor equipment, local supports and limited advice Redevelopment of MLE in 2016 Relaunch of MLE in September 2017 Wide variety of

173 views • 14 slides

Logistic Regression: MLE vs. OLS3 in Excel2013 25 Aug 2016 V0H V0H V0H Schield MLE vs.

Logistic Regression: MLE vs. OLS3 in Excel2013 25 Aug 2016 V0H V0H V0H Schield MLE vs. OLS3-Based Logistic Excel 2013 1 Schield MLE vs. OLS3-Based Logistic Excel 2013 2 Logistic Regression: Background & Goals MLE vs. OLS3 in Excel 2013

216 views • 21 slides

Excel2013: Model Logistic MLE 1Y1X Sept 2015 V1A V1A V1A Excel2013 Model Logistic MLE 1Y1X

Excel2013: Model Logistic MLE 1Y1X Sept 2015 V1A V1A V1A Excel2013 Model Logistic MLE 1Y1X Slides 1 Excel2013 Model Logistic MLE 1Y1X Slides 2 Model Logistic Regression Background & Goals MLE 1Y1X in Excel 2013 Modelling a binary

636 views • 26 slides

Logistic Regression: MLE vs. OLS1 in Excel2013 29 Aug 2016 V0B V0B V0B Schield MLE vs.

Logistic Regression: MLE vs. OLS1 in Excel2013 29 Aug 2016 V0B V0B V0B Schield MLE vs. OLS1-Based Logistic Excel 2013 1 Schield MLE vs. OLS1-Based Logistic Excel 2013 2 Logistic Regression: Background & Goals MLE vs. OLS1 in Excel 2013

520 views • 19 slides

Laying a Solid Foundation for Learning: Lessons from the Kom MLE Project in Cameroon Paul

Laying a Solid Foundation for Learning: Lessons from the Kom MLE Project in Cameroon Paul FrankSIL LEAD and SIL International What is MLE? Mother tongue-based multilingual education (MTB- MLE or just MLE) Using the childs

375 views • 19 slides

MLE vs. MAP Aarti Singh Machine Learning 10-701/15-781 Sept 15, 2010 1 MLE vs. MAP Maximum

MLE vs. MAP Aarti Singh Machine Learning 10-701/15-781 Sept 15, 2010 1 MLE vs. MAP Maximum Likelihood estimation (MLE) Choose value that maximizes the probability of observed data Maximum a posteriori (MAP) estimation Choose value

669 views • 24 slides

2015 Schield Logistic MLE1C Excel2013 8/18/2016 V0D V0D V0D 2015 Schield Logistic MLE 1C

2015 Schield Logistic MLE1C Excel2013 8/18/2016 V0D V0D V0D 2015 Schield Logistic MLE 1C Excel2013 Slides 1 2015 Schield Logistic MLE 1C Excel2013 Slides 2 Logistic Regression: Background & Goals MLE with 2 inputs, Excel 2013

329 views • 21 slides

MLE/MAP + Nave Bayes MLE / MAP Readings: Nave Bayes Readings: Matt Gormley

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University MLE/MAP + Nave Bayes MLE / MAP Readings: Nave Bayes Readings: Matt

891 views • 44 slides

2015 Schield Logistic MLE1A Excel2013 10/29/2015 V0D V0D V0D 2015 Schield Logistic MLE 1A

2015 Schield Logistic MLE1A Excel2013 10/29/2015 V0D V0D V0D 2015 Schield Logistic MLE 1A Excel2013 Slides 1 2015 Schield Logistic MLE 1A Excel2013 Slides 2 Logistic Regression using Background & Goals MLE (1A) and Excel 2013

529 views • 26 slides

MLE, MAP, AND NAIVE BAYES 10-601 RECITATION MARY MCGLOHON MLE The usual representation we come

MLE, MAP, AND NAIVE BAYES 10-601 RECITATION MARY MCGLOHON MLE The usual representation we come across is a P ( X = x | ) probability density function: x 1 , x 2 , ..., x n N ( , 2 ) But what if we know that

456 views • 17 slides

Homework 2 MLE and Naive Bayes Instructions Answer the questions and upload your answers to

Homework 2 MLE and Naive Bayes Instructions Answer the questions and upload your answers to courseville. Answers can be in Thai or English. Answers can be either typed or handwritten and scanned. MLE Consider the following very simple model for

264 views • 10 slides

Outline CSE 527 Previously: Learning from data MLE: Max Likelihood Estimators Autumn 2009 EM:

Outline CSE 527 Previously: Learning from data MLE: Max Likelihood Estimators Autumn 2009 EM: Expectation Maximization (MLE w/hidden data) These Slides: 5 Motifs: Representation & Discovery Bio: Expression & regulation

495 views • 20 slides

Maximum Likelihood Estimation MLE tool for parameter estimation good approach for cases

Maximum Likelihood Estimation MLE tool for parameter estimation good approach for cases when OLS (ordinary least squares) assumptions are violated e.g. for non-linear models with non-normal data in MLE, we estimate the parameters

1.01k views • 66 slides

TUTORIAL TUTORIAL Matthieu R Bloch Tuesday, March 24, 2020 1 MLE FOR UNIFORM DISTRIBUTIONS

TUTORIAL TUTORIAL Matthieu R Bloch Tuesday, March 24, 2020 1 MLE FOR UNIFORM DISTRIBUTIONS MLE FOR UNIFORM DISTRIBUTIONS Assume that you have access to i.i.d. realization of of a uniformly distributed random variable { x i } N i =1 . X

467 views • 9 slides

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression

Bayesian Learning 1 Outline MLE, MAP vs. Bayesian Learning Bayesian Linear Regression Bayesian Gaussian Mixture Models Non-parametric Bayes 2 Take Away ... 1. Maximum Likelihood Estimate (MLE) = arg max p ( D| )

609 views • 27 slides

CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan

CSE 527 Lecture Notes: MLE & EM 1 CSE 527, Additional notes on MLE & EM Based on earlier notes by C. Grant & M. Narasimhan Introduction Last lecture we began an examination of model based clustering. This lecture will be the

261 views • 9 slides

Machine Learning (CSE 446): Probabilistic Machine Learning MLE & MAP Sham M Kakade 2018

Machine Learning (CSE 446): Probabilistic Machine Learning MLE & MAP Sham M Kakade 2018 c University of Washington cse446-staff@cs.washington.edu 1 / 14 Announcements Homeworks HW 3 posted. Get the most recent version.

668 views • 30 slides

& Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of

Factor analysis & Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of Technology Spring 2016 Soleymani Multivariate Gaussian distribution 2 /2 1/2 exp{ 1 1 2

1.13k views • 43 slides

CS480/680 Lecture 12: June 17, 2019 Gaussian Processes [B] Section 6.4 [M] Chap. 15 [HTF] Sec.

CS480/680 Lecture 12: June 17, 2019 Gaussian Processes [B] Section 6.4 [M] Chap. 15 [HTF] Sec. 8.3 University of Waterloo CS480/680 Spring 2019 Pascal Poupart 1 Gaussian Process Regression Idea: distribution over functions University of

526 views • 23 slides

Parameter estimation (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University January 24, 2019

Parameter estimation (cont.) Dr. Jarad Niemi STAT 544 - Iowa State University January 24, 2019 Jarad Niemi (STAT544@ISU) Parameter estimation (cont.) January 24, 2019 1 / 32 Outline Normal model, unknown mean Jeffreys prior Natural

756 views • 32 slides

Multivariate normal distribution Surajit Ray Reader, University of Glasgow DataCamp

DataCamp Multivariate Probability Distributions in R MULTIVARIATE PROBABILITY DISTRIBUTIONS IN R Multivariate normal distribution Surajit Ray Reader, University of Glasgow DataCamp Multivariate Probability Distributions in R Univariate

537 views • 51 slides

Probability Review III Harvard Math Camp - Econometrics Ashesh Rambachan Summer 2018 Outline

Probability Review III Harvard Math Camp - Econometrics Ashesh Rambachan Summer 2018 Outline Useful Univariate Distributions Bernoulli distribution Binomial distribution Uniform distribution Normal distribution Chi-squared Distribution

267 views • 25 slides

Using Discrete Gaussian Sampling Divesh Aggarwal National University of Singapore (NUS) Daniel

Solving SVP and CVP in 2 Time Using Discrete Gaussian Sampling Divesh Aggarwal National University of Singapore (NUS) Daniel Dadush Centrum Wiskunde en Informatica (CWI) Oded Regev Noah Stephens-Davidowitz New York University (NYU)

1.41k views • 86 slides

Quantitative Security Colorado State University Yashwant K Malaiya CS 559 L6: Probability &

Quantitative Security Colorado State University Yashwant K Malaiya CS 559 L6: Probability & Intrusion Detection CSU Cybersecurity Center Computer Science Dep 1 Quantitative Security 1 About this Course CS 559 is a research-oriented

615 views • 35 slides