martingales
play

Martingales and the Method of Bounded Differences Advanced - PowerPoint PPT Presentation

Martingales and the Method of Bounded Differences Advanced Algorithms Nanjing University, Fall 2018 (Some) Concentration Inequalities Question: probability that X deviates more than from expectation? (Some) Concentration Inequalities


  1. Fundamental Facts about Conditional Expectation Example : 𝑍 : height of the chosen human being π‘Œ : country of origin of the chosen human being π‘Ž : gender of the chosen human being 𝔽 𝑍 = 𝔽 𝔽 𝑍 | π‘Œ average height of all human beings = weighted average of the country-by-country average heights 𝔽 𝑍 | π‘Ž = 𝔽 𝔽 𝑍 | π‘Œ, π‘Ž | π‘Ž average height of all male/female human beings = weighted average of the country-by-country average male/female heights 𝔽 𝔽 𝑔 π‘Œ 𝑕 π‘Œ, 𝑍 | π‘Œ = 𝔽 𝑔 π‘Œ 𝔽 𝑕 π‘Œ, 𝑍 | π‘Œ

  2. Fundamental Facts about Conditional Expectation Example : 𝑍 : height of the chosen human being π‘Œ : country of origin of the chosen human being π‘Ž : gender of the chosen human being 𝔽 𝑍 = 𝔽 𝔽 𝑍 | π‘Œ average height of all human beings = weighted average of the country-by-country average heights 𝔽 𝑍 | π‘Ž = 𝔽 𝔽 𝑍 | π‘Œ, π‘Ž | π‘Ž average height of all male/female human beings = weighted average of the country-by-country average male/female heights 𝔽 𝔽 𝑔 π‘Œ 𝑕 π‘Œ, 𝑍 | π‘Œ = 𝔽 𝑔 π‘Œ 𝔽 𝑕 π‘Œ, 𝑍 | π‘Œ once π‘Œ is fixed to some 𝑦 , 𝔽 𝑔 π‘Œ 𝑕 π‘Œ, 𝑍 | π‘Œ = 𝑦 = 𝑔(𝑦)𝔽 𝑕 π‘Œ, 𝑍 | π‘Œ = 𝑦

  3. Fundamental Facts about Conditional Expectation (Cont.) 𝔽 𝑍 = 𝔽 𝔽 𝑍 | π‘Œ 𝔽 𝑍 | π‘Ž = 𝔽 𝔽 𝑍 | π‘Œ, π‘Ž | π‘Ž 𝔽 𝔽 𝑔 π‘Œ 𝑕 π‘Œ, 𝑍 | π‘Œ = 𝔽 𝑔 π‘Œ 𝔽 𝑕 π‘Œ, 𝑍 | π‘Œ

  4. Fundamental Facts about Conditional Expectation (Cont.) 𝔽 𝑍 = 𝔽 𝔽 𝑍 | π‘Œ 𝔽 𝑍 | π‘Ž = 𝔽 𝔽 𝑍 | π‘Œ, π‘Ž | π‘Ž 𝔽 𝔽 𝑔 π‘Œ 𝑕 π‘Œ, 𝑍 | π‘Œ = 𝔽 𝑔 π‘Œ 𝔽 𝑕 π‘Œ, 𝑍 | π‘Œ Generalization to the multivariate case: 𝔽 𝑍 = 𝔽 𝔽 𝑍 | Τ¦ π‘Œ 𝔽 𝑍 | Τ¦ π‘Ž = 𝔽 𝔽 𝑍 | Τ¦ π‘Œ, Τ¦ π‘Ž | Τ¦ π‘Ž 𝔽 𝔽 𝑔 Τ¦ π‘Œ 𝑕 Τ¦ π‘Œ, 𝑍 | Τ¦ = 𝔽 𝑔 Τ¦ π‘Œ 𝔽 𝑕 Τ¦ π‘Œ, 𝑍 | Τ¦ π‘Œ π‘Œ

  5. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss”

  6. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss” π‘œβˆ’1 2 𝑗 = 1 when you get a win after π‘œ losses: 2 π‘œ βˆ’ Οƒ 𝑗=0

  7. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss” π‘œβˆ’1 2 𝑗 = 1 when you get a win after π‘œ losses: 2 π‘œ βˆ’ Οƒ 𝑗=0 consider a fair game, with any betting strategy

  8. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss” π‘œβˆ’1 2 𝑗 = 1 when you get a win after π‘œ losses: 2 π‘œ βˆ’ Οƒ 𝑗=0 consider a fair game, with any betting strategy let π‘Œ 𝑗 be our wealth after 𝑗 rounds

  9. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss” π‘œβˆ’1 2 𝑗 = 1 when you get a win after π‘œ losses: 2 π‘œ βˆ’ Οƒ 𝑗=0 consider a fair game, with any betting strategy let π‘Œ 𝑗 be our wealth after 𝑗 rounds 𝔽 π‘Œ 𝑗+1 | π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ 𝑗 =

  10. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss” π‘œβˆ’1 2 𝑗 = 1 when you get a win after π‘œ losses: 2 π‘œ βˆ’ Οƒ 𝑗=0 consider a fair game, with any betting strategy let π‘Œ 𝑗 be our wealth after 𝑗 rounds 𝔽 π‘Œ 𝑗+1 | π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ 𝑗 = π‘Œ 𝑗

  11. Martingales originally refers to a betting strategy: β€œdouble your bet after every loss” π‘œβˆ’1 2 𝑗 = 1 when you get a win after π‘œ losses: 2 π‘œ βˆ’ Οƒ 𝑗=0 consider a fair game, with any betting strategy let π‘Œ 𝑗 be our wealth after 𝑗 rounds 𝔽 π‘Œ 𝑗+1 | π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ 𝑗 = π‘Œ 𝑗 since the game is fair , conditioned on past history, we expect no change to current value after one round

  12. Martingales

  13. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  14. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  15. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  16. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  17. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  18. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  19. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  20. Example: Coin Flipping toss a fair coin for many times measure the differences between # of heads and # of tails

  21. Example: Random Walk a dot starting from the origin in each step, move equiprobably to one of four neighbors

  22. Example: Random Walk a dot starting from the origin in each step, move equiprobably to one of four neighbors after 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance)

  23. Example: Random Walk a dot starting from the origin in each step, move equiprobably to one of four neighbors after 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance)

  24. Example: Random Walk a dot starting from the origin in each step, move equiprobably to one of four neighbors after 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance)

  25. Example: Random Walk a dot starting from the origin in each step, move equiprobably to one of four neighbors after 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance) How far the dot is away from the origin after π‘œ steps?

  26. Azuma’s Inequality

  27. Azuma’s Inequality π‘Œ 0 , π‘Œ 1 , β‹― are not necessarily independent

  28. Azuma’s Inequality in Action After 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance) How large is π‘Œ π‘œ ?

  29. Azuma’s Inequality in Action After 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance) How large is π‘Œ π‘œ ?

  30. Azuma’s Inequality in Action After 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance) How large is π‘Œ π‘œ ? We know π‘Œ 0 = 0 , and π‘Œ 𝑙 βˆ’ π‘Œ π‘™βˆ’1 ≀ 1

  31. Azuma’s Inequality in Action After 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance) How large is π‘Œ π‘œ ? We know π‘Œ 0 = 0 , and π‘Œ 𝑙 βˆ’ π‘Œ π‘™βˆ’1 ≀ 1

  32. Azuma’s Inequality in Action After 𝑗 steps, use π‘Œ 𝑗 to denote # of hops to origin (Manhattan distance) How large is π‘Œ π‘œ ? We know π‘Œ 0 = 0 , and π‘Œ 𝑙 βˆ’ π‘Œ π‘™βˆ’1 ≀ 1 Within Ο( π‘œ log π‘œ) w.h.p.

  33. Azuma’s Inequality

  34. Azuma’s Inequality For a sequence of r.v., if in each step: * on average make no change to current value ( martingale ) * no big jump ( bounded difference )

  35. Azuma’s Inequality For a sequence of r.v., if in each step: * on average make no change to current value ( martingale ) * no big jump ( bounded difference ) Then final value does not deviate a lot from the initial value.

  36. Proving Azuma’s Inequality

  37. Proving Azuma’s Inequality Use similar strategy as in proving Chernoff bounds:

  38. Proving Azuma’s Inequality Use similar strategy as in proving Chernoff bounds: (a ) Apply generalized Markov’s inequality to MGF

  39. Proving Azuma’s Inequality Use similar strategy as in proving Chernoff bounds: (a ) Apply generalized Markov’s inequality to MGF (b) * Bound the value of MGF (use Hoeffding’s lemma)

  40. Proving Azuma’s Inequality Use similar strategy as in proving Chernoff bounds: (a ) Apply generalized Markov’s inequality to MGF (b) * Bound the value of MGF (use Hoeffding’s lemma) (c) Optimize the value of MGF

  41. Proving Azuma’s Inequality

  42. Proving Azuma’s Inequality

  43. Proving Azuma’s Inequality

  44. Proving Azuma’s Inequality

  45. for πœ‡ > 0

  46. for πœ‡ > 0

  47. for πœ‡ > 0

  48. for πœ‡ > 0 𝔽 𝑍 = 𝔽 𝔽 𝑍 | π‘Œ

  49. for πœ‡ > 0 𝔽 𝔽 𝑔 π‘Œ 𝑕 π‘Œ, 𝑍 |π‘Œ = 𝔽 𝑔 π‘Œ 𝔽 𝑕 π‘Œ, 𝑍 |π‘Œ

  50. for πœ‡ > 0

  51. for πœ‡ > 0

  52. for πœ‡ > 0

  53. for πœ‡ > 0

  54. for πœ‡ > 0

  55. for πœ‡ > 0

  56. for πœ‡ > 0

  57. for πœ‡ > 0

  58. for πœ‡ > 0 𝑒 minimized when πœ‡ = π‘œ 2 Οƒ 𝑙=1 𝑑 𝑙

  59. Proving Azuma’s Inequality

  60. Proving Azuma’s Inequality ???

  61. Proving Azuma’s Inequality ???

  62. Generalized Martingales

  63. Generalized Martingales betting on a fair game π‘Œ 𝑗 : gain/loss of the i th bet 𝑗 : wealth after the i th bet 𝑍

  64. Generalized Martingales betting on a fair game π‘Œ 𝑗 : gain/loss of the i th bet 𝑗 : wealth after the i th bet ← martingale (since game is fair) 𝑍

  65. Generalized Azuma’s Inequality

  66. Azuma’s Inequality martingale π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ π‘œ martingale π‘Œ 0 , π‘Œ 1 , β‹― 𝔽 π‘Œ 𝑗 π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ π‘—βˆ’1 ) = π‘Œ π‘—βˆ’1 with π‘Œ 𝑙 βˆ’ π‘Œ π‘™βˆ’1 ≀ 𝑑 𝑙 , generalization then β„™ π‘Œ π‘œ βˆ’ π‘Œ 0 β‰₯ 𝑒 ≀ β‹― martingale 𝑍 0 , 𝑍 1 , β‹― w.r.t. π‘Œ 0 , π‘Œ 1 , β‹― generalization 𝑍 𝑗 = 𝑔(π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ 𝑗 ) 𝔽 𝑍 π‘Œ 0 , π‘Œ 1 , β‹― , π‘Œ π‘—βˆ’1 ) = 𝑍 Generalized Azuma’s Inequality 𝑗 π‘—βˆ’1 martingale 𝑍 0 , 𝑍 1 , β‹― w.r.t. π‘Œ 0 , π‘Œ 1 , β‹― with 𝑍 𝑙 βˆ’ 𝑍 π‘™βˆ’1 ≀ 𝑑 𝑙 , then β„™ 𝑍 π‘œ βˆ’ 𝑍 0 β‰₯ 𝑒 ≀ β‹―

  67. Doob Sequence

  68. Doob Sequence 𝑔( ) , , ,

  69. Doob Sequence 𝑔( ) , , , average over no information 𝔽 𝑔

Recommend


More recommend