4csll5 parameter estimation supervised and unsupervised
play

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Martin - PowerPoint PPT Presentation

4CSLL5 Parameter Estimation (Supervised and Unsupervised) 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Martin Emms September 20, 2019 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Outline Supervised Maximum Likelihood


  1. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Martin Emms September 20, 2019

  2. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Outline Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D 2nd scenario: (toss Z; (then A or B) 10 ) D

  3. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Outline Parameter Estimation

  4. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Outline Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D 2nd scenario: (toss Z; (then A or B) 10 ) D

  5. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’

  6. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’ P ( Z = a ): probability of giving ’a’ when tossed – currently not known

  7. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’ P ( Z = a ): probability of giving ’a’ when tossed – currently not known P ( Z = b ): probability of giving ’b’ when tossed – currently not known

  8. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’ P ( Z = a ): probability of giving ’a’ when tossed – currently not known P ( Z = b ): probability of giving ’b’ when tossed – currently not known Suppose you have data d recording 100 tosses of Z

  9. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’ P ( Z = a ): probability of giving ’a’ when tossed – currently not known P ( Z = b ): probability of giving ’b’ when tossed – currently not known Suppose you have data d recording 100 tosses of Z if there were (50 a, 50 b) in d , ’common-sense’ says P ( Z = a ) = 50 / 100

  10. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’ P ( Z = a ): probability of giving ’a’ when tossed – currently not known P ( Z = b ): probability of giving ’b’ when tossed – currently not known Suppose you have data d recording 100 tosses of Z if there were (50 a, 50 b) in d , ’common-sense’ says P ( Z = a ) = 50 / 100 if there were (30 a, 70 b) in d , ’common-sense’ says P ( Z = a ) = 30 / 100

  11. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Common-sense and relative frequency Suppose a 2-sided ’coin’ Z , one side labelled ’a’, other side labelled ’b’ P ( Z = a ): probability of giving ’a’ when tossed – currently not known P ( Z = b ): probability of giving ’b’ when tossed – currently not known Suppose you have data d recording 100 tosses of Z if there were (50 a, 50 b) in d , ’common-sense’ says P ( Z = a ) = 50 / 100 if there were (30 a, 70 b) in d , ’common-sense’ says P ( Z = a ) = 30 / 100 ie. you ’define’ or ’estimate’ the probability by the relative frequency

  12. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values.

  13. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values. let θ a and θ b stand for P ( Z = a ) and P ( Z = b )

  14. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values. let θ a and θ b stand for P ( Z = a ) and P ( Z = b ) let #( a ) be the number of ’a’ outcomes in the sequence d

  15. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values. let θ a and θ b stand for P ( Z = a ) and P ( Z = b ) let #( a ) be the number of ’a’ outcomes in the sequence d let #( b ) be the number of ’b’ outcomes in the sequence d

  16. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values. let θ a and θ b stand for P ( Z = a ) and P ( Z = b ) let #( a ) be the number of ’a’ outcomes in the sequence d let #( b ) be the number of ’b’ outcomes in the sequence d the probability of d , assuming the probability settings θ a and θ b is

  17. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values. let θ a and θ b stand for P ( Z = a ) and P ( Z = b ) let #( a ) be the number of ’a’ outcomes in the sequence d let #( b ) be the number of ’b’ outcomes in the sequence d the probability of d , assuming the probability settings θ a and θ b is × θ #( b ) p ( d ) = θ #( a ) (1) a b

  18. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D Data likelihood assuming the tosses of Z are all independent, can work out the probability of the observed data d if Z ’s probabilities had particular values. let θ a and θ b stand for P ( Z = a ) and P ( Z = b ) let #( a ) be the number of ’a’ outcomes in the sequence d let #( b ) be the number of ’b’ outcomes in the sequence d the probability of d , assuming the probability settings θ a and θ b is × θ #( b ) p ( d ) = θ #( a ) (1) a b different settings of θ a and θ b will give different values for p ( d ) following slides investigate this empirically

  19. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D p ( d ) for 50 a, 50 b X 1.2e−21 as θ a is varied, data prob p ( d ) varies 8.0e−22 4.0e−22 0.0e+00 0.0 0.2 0.4 0.6 0.8 1.0

  20. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D p ( d ) for 50 a, 50 b X 1.2e−21 as θ a is varied, data prob p ( d ) varies 8.0e−22 max occurs at θ a = 0 . 5 4.0e−22 0.0e+00 0.0 0.2 0.4 0.6 0.8 1.0

  21. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D p ( d ) for 50 a, 50 b X 1.2e−21 as θ a is varied, data prob p ( d ) varies 8.0e−22 max occurs at θ a = 0 . 5 4.0e−22 50 which is 50 + 50 0.0e+00 0.0 0.2 0.4 0.6 0.8 1.0

  22. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D p ( d ) for 30 a, 70 b 4e−19 X 3e−19 as θ a is varied, data prob p ( d ; θ a , θ b ) varies 2e−19 1e−19 0e+00 0.0 0.2 0.4 0.6 0.8 1.0

  23. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D p ( d ) for 30 a, 70 b 4e−19 X 3e−19 as θ a is varied, data prob p ( d ; θ a , θ b ) varies 2e−19 max occurs at θ a = 0 . 3 1e−19 0e+00 0.0 0.2 0.4 0.6 0.8 1.0

  24. 4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood Estimation(MLE) First scenario: (toss a ’coin’ Z) D p ( d ) for 30 a, 70 b 4e−19 X 3e−19 as θ a is varied, data prob p ( d ; θ a , θ b ) varies 2e−19 max occurs at θ a = 0 . 3 1e−19 30 which is 30 + 70 0e+00 0.0 0.2 0.4 0.6 0.8 1.0

Recommend


More recommend