Statistical Machine Learning Lecture 06 Extra: Expectation - PowerPoint PPT Presentation

Nov 28, 2022 •153 likes •251 views

Statistical Machine Learning Lecture 06 Extra: Expectation Maximization Kristian Kersting TU Darmstadt Summer Term 2020 K. Kersting based on Slides from J. Peters Statistical Machine Learning Summer Term 2020 1 / 9 Expectation

Statistical Machine Learning Lecture 06 Extra: Expectation Maximization Kristian Kersting TU Darmstadt Summer Term 2020 K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 1 / 9
Expectation Maximization Basic Idea log-Likelihood Desired lower bound ( q , ) t , q t K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 2 / 9
Expectation Maximization ( ) Choose q * ( q , t ) t Requirements 1. Guarantee a lower bound (aka surrogate function ) F ( q , θ ) ≤ L ( θ ) ∀ q , θ where q is the “guessed” distribution and θ are the parameters 2. Choose q ∗ such that they touch F ( q ∗ , θ t ) = L ( θ t ) K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 3 / 9
Expectation Maximization Expectation-Step (E-Step) ( ) Choose q * t + 1 ( q * t + 1 , ) ( q , ) t K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 4 / 9
Expectation Maximization Maximization-Step (M-Step) ( ) * ) ( q * t + 1 , * Find θ ∗ by maximization θ ∗ = arg max � � F q ∗ t + 1 , θ θ K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 5 / 9
Expectation Maximization Find a lower bound on L ( θ ) � L ( θ ) = log p θ ( x i ) i � � = log p θ ( x i , z i ) d z i i � q ( z i ) p θ ( x i , z i ) � = log d z i q ( z i ) i (by Jensen’s inequality) � q ( z i ) log p θ ( x i , z i ) � d z i ≡ F ( q , θ ) ≥ q ( z i ) i � s.t. q ( z i ) d z i = 1 ∀ i K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 6 / 9
Expectation Maximization Constrained Optimization Problem � q ( z i ) log p θ ( x i , z i ) � d z i max q ( z i ) i � s.t. q ( z i ) d z i = 1 ∀ i K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 7 / 9
Expectation Maximization �� q ( z i ) log p θ ( x i , z i ) � = d z i + λ i q i ( z i ) d z i − 1 ∀ i L q ( z i ) i � log p θ ( x i , z i ) � ! ∇ q ( z i ) L = − 1 + λ i = 0 q ( z i ) = ⇒ q ( z i ) = exp ( λ i − 1 ) p θ ( x i , z i ) � ! ∇ λ i L = q ( z i ) d z i − 1 = 0 � exp ( λ i − 1 ) p θ ( x i , z i ) d z i = 1 � = ⇒ λ i = 1 − log p θ ( x i , z i ) d z i q ( z i ) = p θ ( z i | x i ) ≡ E-step K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 8 / 9
Expectation Maximization 1. We have a lower bound for the likelihood 2. We guaranteed F ( q ∗ , θ t ) = L ( θ t ) 3. We want to guarantee L ( θ t + 1 ) ≥ L ( θ t ) thus � � � � L ( θ t + 1 ) ≥ F q ∗ = max F q ∗ ≥ L ( θ t ) t + 1 , θ t + 1 t + 1 , θ θ 4. Choose θ t + 1 as � � θ t + 1 = arg max F q ∗ t + 1 , θ θ K. Kersting based on Slides from J. Peters · Statistical Machine Learning · Summer Term 2020 9 / 9

Recommend

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A

Statistical Machine Translation George Foster George Foster Statistical Machine Translation A Brief History of MT Origins (1949): WW II codebreaking success suggests statistical approach to MT George Foster Statistical Machine Translation A

841 views • 54 slides

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine Learning Rob Schapire Princeton University www.cs.princeton.edu/ schapire Machine

1.26k views • 38 slides

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum Computing Machine Learning Quantum Computing Machine Learning so hot so so hot Quantum Computing Machine Learning Quantum Computing Machine Learning

835 views • 51 slides

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is Machine Learning? Azure Machine Learning: How it works Azure Machine Learning in action Get started Contents What is Machine Learning?

456 views • 21 slides

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING Exam Format The exam lasts a total of 3 hours: - Upon entering the room, you must

373 views • 21 slides

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

MACHINE LEARNING 2012 MACHINE LEARNING MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How to separate the red class from the grey class? x 2 360 r x 1 Polar coordinates Data

1.04k views • 44 slides

Foundations of AI Why learning works 1 6 . Statistical Machine Learning Bayesian Learning and

Contents Statistical learning Foundations of AI Why learning works 1 6 . Statistical Machine Learning Bayesian Learning and Why Learning Works W olfram Burgard, Bernhard Nebel, and Andreas Karw ath 10/ 1 10/ 2 Statistical Learning

305 views • 5 slides

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach to Preventing to Preventing to Preventing to Preventing Avoidable ED Utilization Avoidable ED Utilization Avoidable ED

727 views • 13 slides

MACHINE LEARNING, STATISTICAL LEARNING AND PARALLEL COMPUTING INTRODUCTION VS MACHINE LEARNING

JACOPO DI IORIO, MATTEO FONTANA, DANIELE OCCHIUTO, CLAUDIA VOLPETTI MACHINE LEARNING, STATISTICAL LEARNING AND PARALLEL COMPUTING INTRODUCTION VS MACHINE LEARNING STATISTICAL LEARNING Separate evolutions, but with shared methodologies MERGE

565 views • 55 slides

COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Trevor Cohn 23. PGM

COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Trevor Cohn 23. PGM Statistical Inference Statistical Machine Learning (S2 2017) Deck 23 Statistical inference on PGMs Learning from data fitting probability tables to

358 views • 17 slides

Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of

Components: Translation model, language model, decoder Statistical Machine Translation Lecture 2: Theory and Praxis of Decoding p Statistical Machine Translation Statistical Machine Translation p Lecture 2 Theory and Praxis of Decoding

541 views • 9 slides

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

MACHINE LEARNING TOOLBOX Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret R package Automates supervised learning (a.k.a. predictive modeling ) Target variable Machine Learning Toolbox

634 views • 16 slides

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine

Introduction to Machine Learning COMPSCI 371D Machine Learning COMPSCI 371D Machine Learning Introduction to Machine Learning 1 / 18 Outline 1 Classification, Regression, Unsupervised Learning 2 About Dimensionality 3 Drawings and

701 views • 18 slides

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is

INTRODUCTION TO MACHINE LEARNING Joseph C. Osborn CS 51A Spring 2020 Machine Learning is Machine learning is about predicting the future based on the past. -- Hal Daume III Machine Learning is Machine learning is about predicting

917 views • 59 slides

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University

Human and Machine Learning Tom Mitchell Machine Learning Department Carnegie Mellon University April 23, 2008 1 How can studies of machine (human) learning inform machine (human) learning inform studies of h human (machine) learning? (

872 views • 49 slides

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification

Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Machine Learning Algorithms for Classification Rob

824 views • 58 slides

Generalized Majorization-Minimization Sobhan Naderi Kun He Reza Aghajani Stan

Generalized Majorization-Minimization Sobhan Naderi Kun He Reza Aghajani Stan Sclaroff Pedro Felzenszwalb Google Research Facebook Reality Labs UCSD Boston University

275 views • 23 slides

Expectation Maximization CMSC 691 UMBC Outline EM (Expectation Maximization) Basic idea Three

Expectation Maximization CMSC 691 UMBC Outline EM (Expectation Maximization) Basic idea Three coins example Why EM works Expectation Maximization (EM) 0. Assume some value for your parameters Two step, iterative algorithm 1. E-step: count

665 views • 55 slides

Variational denoising for manifold-valued data Andreas Weinmann Helmholtz Center Munich & TU

TV minimization for manifold data Second order TV type functionals Potts and Blake-Zisserman functionals Variational denoising for manifold-valued data Andreas Weinmann Helmholtz Center Munich & TU M unchen Paris, le 21 novembre 2014

753 views • 44 slides

A Type System for Functional Traversal-Based Aspects Bryan Chadwick and Karl Lieberherr March 2

A Type System for Functional Traversal-Based Aspects Bryan Chadwick and Karl Lieberherr March 2 nd 2009 1 / 30 Outline Introduction Example (Pure) Semantics Example (Full Dispatch) Type System Soundness 2 / 30 Intro: Traversals AOP

490 views • 30 slides

Expectation Maximization [KF Chapter 19] CS 786 University of Waterloo Lecture 17: June 28,

Expectation Maximization [KF Chapter 19] CS 786 University of Waterloo Lecture 17: June 28, 2012 Incomplete data Complete data Values of all attributes are known Learning is relatively easy But many real-world problems have

450 views • 12 slides

and how to reverse it Almsgiving is Mammons perversion of giving. It affirms the superiority

How churches and non-profits hurt those they help and how to reverse it Almsgiving is Mammons perversion of giving. It affirms the superiority of the giver, binds the recipient, demands gratitude, humiliates him and reduces him to a

580 views • 17 slides

Today Finish up Conditional Expectation. Markov Chains. Application: Mixing Each step, pick

Today Finish up Conditional Expectation. Markov Chains. Application: Mixing Each step, pick ball from each well-mixed urn. Transfer it to other urn. Let X n be the number of red balls in the bottom urn at step n . What is E [ X n ] ? Given X n =

479 views • 30 slides

Mathematical Foundations for Finance Exercise 1 Martin Stefanik ETH Zurich Which Exercise Class

Mathematical Foundations for Finance Exercise 1 Martin Stefanik ETH Zurich Which Exercise Class to Visit? We would like to distribute students more or less evenly to the available exercise classes. Therefore Try to visit the exercise class to

469 views • 26 slides