online stochastic matching with unequal probabilities
play

Online Stochastic Matching with Unequal Probabilities Aranyak Mehta - PowerPoint PPT Presentation

Online Stochastic Matching with Unequal Probabilities Aranyak Mehta Bo Waggoner Harvard Morteza Zadimoghaddam SODA 2015 1 Outline Problem and motivation Prior work, our main result Key idea: Adaptivity Ideas behind


  1. Online Stochastic Matching with Unequal Probabilities Aranyak Mehta Bo Waggoner Harvard Morteza Zadimoghaddam SODA 2015 1

  2. Outline ● Problem and motivation ● Prior work, our main result ● Key idea: Adaptivity ● Ideas behind algorithm/analysis 2

  3. Motivation: Search ads advertisers search queries Time 3

  4. Motivation: Search ads advertisers search queries Simplified problem: - display one ad per query - have estimate of click Time probabilities - advertisers pay $1 if click, $0 if no click - advertisers have budget for one click per day How to assign ads? 4

  5. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online offline vertices arrivals Time 5

  6. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online offline vertices arrivals p 11 p 31 Time p 41 6

  7. Online Stochastic Matching [Mehta and Panigrahi, 2012] Pr[ searcher clicks if we show this ad ] fixed, online offline vertices arrivals p 11 p 31 Time p 41 7

  8. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online offline vertices arrivals p 31 Time Assign to vertex 3 ! Alg 8

  9. Online Stochastic Matching [Mehta and Panigrahi, 2012] With prob p 31 : match succeeds fixed, online offline vertices arrivals With prob 1 - p 31 : match fails p 31 Time Alg 9

  10. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online match succeeded offline vertices arrivals Time cannot be matched again Alg 10

  11. Online Stochastic Matching [Mehta and Panigrahi, 2012] fixed, online match failed offline vertices arrivals Time may be matched again later disappears (cannot re-try) Alg 11

  12. Measuring algorithm performance fixed, online offline vertices arrivals Alg’s performance = # successes Alg 12

  13. Measuring algorithm performance fixed, online offline vertices arrivals Alg’s performance = E[ # successes ] Alg 13 13

  14. Measuring algorithm performance fixed, online offline vertices arrivals Alg’s performance = E[ # successes ] Opt’s performance = size of max weighted assignment, budget 1 Alg Opt 14 14

  15. Measuring algorithm performance fixed, online Competitive ratio = offline vertices arrivals min Alg Alg’s performance = E[ size of matching ] Opt Opt’s performance = size of max weighted over all input instances. assignment, budget 1 (Note: Opt is a bit funky … not achievable even with foreknowledge of instance.) Alg Opt 15 15

  16. Prior Work ● Online Matching with Stochastic Rewards Mehta, Panigrahi, FOCS 2012. ○ Greedy = 0.5 . Opt ○ For case where all p are equal and vanishing : Alg ≥ 0.567 . Opt Open: anything better than Greedy for unequal p 16

  17. This work For unequal, vanishing edge probabilities: Alg ≥ 0.534 Opt 17

  18. This work For unequal, vanishing edge probabilities: So what? Alg ≥ 0.534 algorithmic ideas to beat Opt Greedy 18

  19. Outline ● Problem and motivation ● Prior work, our main result ● Key idea: Adaptivity ● Ideas behind algorithm/analysis 19

  20. Adaptive: sees whether or not assignment succeeds fixed, online offline vertices arrivals 20

  21. Our Approach 1. Start with an optimal non-adaptive alg that is straightforward to analyze 2. Add a small amount of adaptivity (second choices) 3. Analysis remains tractable by limiting amount of adaptivity 21

  22. An optimal non-adaptive algorithm ● MP-2012: nonadaptive algs have upper bound of 0.5 ● How to achieve 0.5 ? (Previously unknown.) Seems nonobvious. 22

  23. Maximize marginal expected gain offline online Assign first arrival to vertices arrivals vertex with largest p i1 0.3 0.4 0.2 23

  24. Maximize marginal expected gain offline online Assign next arrival to vertices arrivals max 0.1 Pr[ i available ] p i2 0.3 0.2 24

  25. Maximize marginal expected gain offline online = (1 - 0.4) * 0.3 Assign next arrival to vertices arrivals = 0.18 max 0.1 Pr[ i available ] p i2 0.3 0.2 = (1) * 0.2 = 0.2 25

  26. NonAdaptive Theorem: NonAdaptive has a competitive ratio of 0.5 for the general online stochastic matching problem. Does not require vanishing probabilities. 26

  27. Why do we like NonAdaptive ? ● On a given instance, an arrival has the same “first choice” every time (regardless of previous realizations) ● Algorithm tracks/uses competitive ratio (probabilities of success) 27

  28. Add Adaptivity (but not too much) Proposed SemiAdaptive : Assign next arrival to max Pr[ i available ] p ij unless already taken, in which case assign to second-highest. 28

  29. Why do we like SemiAdaptive ? ● On a given instance, an arrival has the same first and second choices every time (regardless of previous realizations) ● Algorithm tracks/uses competitive ratio (probabilities of success) These allow us to analyze SemiAdaptive -- almost... 29

  30. (Analysis?) Roadblock ● Want: when first-choice is not available, get measurable benefit by assigning to second choice → giving improvement over NonAdaptive ’s 0.5 30

  31. (Analysis?) Roadblock ● Want: when first-choice is not available, get measurable benefit by assigning to second choice → giving improvement over NonAdaptive ’s 0.5 ● Problem: correlation between availability of first and second choice . Perhaps when first choice is not available, most likely second choice is not available either. → cannot guarantee improvement over NonAdaptive 31

  32. (Analysis?) Roadblock ● Want: when first-choice is not available, get measurable benefit by assigning to second choice → giving improvement over NonAdaptive ’s 0.5 ● Problem: correlation between availability of first and second choice . Perhaps when first choice is not available, most likely second choice is not available either. → cannot guarantee improvement over NonAdaptive ● Fix: introduce independence / even less adaptivity. (no time to say more! sorry!) 32

  33. RECAP Online stochastic matching problem: - edges succeed probabilistically - maximize expected number of successes - input instance chosen adversarially New here: p 11 - edge probabilities may be unequal p 31 33

  34. RECAP Results: - optimal 0.5 -competitive NonAdaptive - 0.534 -competitive SemiAdaptive (with tweak) for vanishing probabilities Key idea: p 11 - control adaptivity to control analysis p 31 34

  35. Future Work Everything about Online Stochastic Matching: ● Vanishing probabilities: ○ Equal: 0.567 … ? … 0.62 ○ Unequal: 0.534 … ? … 0.62 ● Large probabilities: ○ Equal: 0.53 … ? … 0.62 ○ Unequal: 0.5 … ? … 0.62 35

  36. Future Work Everything about Online Stochastic Matching: ● Vanishing probabilities: ○ Equal: 0.567 … ? … 0.62 ○ Unequal: 0.534 … ? … 0.62 ● Large probabilities: ○ Equal: 0.53 … ? … 0.62 ○ Unequal: 0.5 … ? … 0.62 Thanks! 36

  37. Additional slides 37

  38. Final Algorithm “ SemiAdaptive ” Assign next arrival to max Pr[ i available ] p ij * unless already taken, in which case assign to second-highest. * “it would have already been taken by a previous first-choice” (key point: even less adaptive, more independence) 38

  39. Ideas behind analysis Pr[ available ] Either first choice is q 1 p 12 the same as Opt’s... p 22 q 2 q 3 p 42 q 4 q 5 39

  40. Ideas behind analysis Pr[ available ] Either first choice is q 1 p 12 the same as Opt’s... p 22 q 2 ...or both first and second choice would q 3 give at least as much p 42 “gain” as Opt’s. q 4 q 5 40

  41. Ideas behind analysis Pr[ available ] Either first choice is Very good because gains q 1 p 12 the same as Opt’s... “compound”. p 22 q 2 ...or both first and second choice would q 3 Good because we get give at least as much p 42 “second-choice gains”. “gain” as Opt’s. q 4 q 5 41

  42. Note: Can only get 1 - 1/e ≈ 0.632 even with knowledge of instance online Weighted matching: 1 arrivals 1/n E[ # of matches ] = 1 - Pr[ all fail ] 1/n = 1 - (1 - 1/n) n 1/n → 1 - 1/e 1/n 1/n 1/n Alg Opt 42 42

  43. Example of defining Opt fixed, online offline vertices arrivals 1/2 Opt gets 1 Alg’s performance = 2/3 E[ size of matching ] Opt’s performance = size of max weighted assignment, budget 1 1/4 1/4 Opt gets 1/2 Alg Opt 43 43

Recommend


More recommend