online learning via convex geometry with applications to
play

Online Learning via Convex Geometry, with Applications to Pricing - PowerPoint PPT Presentation

Online Learning via Convex Geometry, with Applications to Pricing Adrian Vladu MIT Dynamic Pricing Problem ? ? ? $0.5 $2.0 $1.0 ? ? ? $0.5 $2.0 $1.0 ? ? ? day 1 x 1 x 3 x 0 ? $0.5 $2.0 $1.0 ? ? ?


  1. Online Learning via Convex Geometry, with Applications to Pricing Adrian Vladu MIT

  2. Dynamic Pricing Problem ? ? ?

  3. $0.5 $2.0 $1.0 🙋 ? ? ?

  4. $0.5 $2.0 $1.0 🙋 ? ? ? day 1 x 1 x 3 x 0 💶 ?

  5. $0.5 $2.0 $1.0 🙋 ? ? ? day 1 x 1 x 3 x 0 💶 ? $7

  6. $0.5 $2.0 $1.0 🙋 ? ? ? day 1 x 1 x 3 x 0 💶 ? $7 ❌

  7. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=6.5 day 1 x 1 x 3 x 0 💶 ? $7 ❌

  8. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=6.5 day 1 day 2 x 2 x 1 x 0 x 3 x 2 x 0 💶 ? 💶 ? $7 ❌

  9. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=6.5 day 1 day 2 x 2 x 1 x 0 x 3 x 2 x 0 💶 ? 💶 ? $2 $7 ❌

  10. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=7.5 day 1 day 2 x 2 x 1 x 0 x 3 x 2 x 0 💶 ? 💶 ? $2 $7 ✅ ❌

  11. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=7.5 day 1 day 2 day 3 x 2 x 1 x 0 x 0 x 3 x 1 x 2 x 0 x 1 💶 ? 💶 ? 💶 ? $2 $7 ✅ ❌

  12. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=7.5 day 1 day 2 day 3 x 2 x 1 x 0 x 0 x 3 x 1 x 2 x 0 x 1 💶 ? 💶 ? 💶 ? $2 $7 $4 ✅ ❌

  13. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=10. 5 day 1 day 2 day 3 x 2 x 1 x 0 x 0 x 3 x 1 x 2 x 0 x 1 💶 ? 💶 ? 💶 ? $2 $7 $4 ✅ ❌ ❌

  14. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=10.5 day 1 day 2 day 3 day 4 x 2 x 1 x 0 x 1 x 0 x 3 x 1 x 1 x 2 x 0 x 1 x 1 💶 ? 💶 ? 💶 ? 💶 ? $2 $7 $4 ✅ ❌ ❌

  15. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=10.5 day 1 day 2 day 3 day 4 x 2 x 1 x 0 x 1 x 0 x 3 x 1 x 1 x 2 x 0 x 1 x 1 💶 ? 💶 ? 💶 ? 💶 ? $2 $7 $4 $3 ✅ ❌ ❌

  16. $0.5 $2.0 $1.0 🙋 ? ? ? Regret=11 day 1 day 2 day 3 day 4 x 2 x 1 x 0 x 1 x 0 x 3 x 1 x 1 x 2 x 0 x 1 x 1 💶 ? 💶 ? 💶 ? 💶 ? $2 $7 $4 $3 ✅ ❌ ❌ ✅

  17. Dynamic Pricing Problem • Warm-up: 1-dimensional case • this work

  18. Dynamic Pricing Problem • Warm-up: 1-dimensional case • this work ?

  19. Dynamic Pricing Problem • Warm-up: 1-dimensional case • this work ? $0 $1

  20. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? $0 $1

  21. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? ✂ $0 $1 $0.5

  22. Dynamic Pricing Problem • Warm-up: 1-dimensional case ✅ • this work ? ✂ $0 $1 $0.5

  23. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? $0 $1

  24. Dynamic Pricing Problem • Warm-up: 1-dimensional case 💶 ? • this work ? ✂ $0 $1 $0.75

  25. Dynamic Pricing Problem • Warm-up: 1-dimensional case ❌ • this work ? ✂ $0 $1 $0.75

  26. Dynamic Pricing Problem • Warm-up: 1-dimensional case ❌ • this work ? ✂ $0 $1 $0.75

  27. Dynamic Pricing Problem • Warm-up: 1-dimensional case ⌛ O(log 1/ ε ) days • this work ? $0 $1

  28. Dynamic Pricing Problem • Warm-up: 1-dimensional case ⌛ O(log 1/ ε ) days • this work ? ≤ε $0 $1

  29. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy ⌛ O(log 1/ ε ) days ? • this work ≤ε $0 $1

  30. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  31. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  32. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  33. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  34. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  35. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • Can we bridge the gap? • this work

  36. Dynamic Pricing Problem • Warm-up: 1-dimensional case • O(log T) is easy • O(log log T) is a cute puzzle [Kleinberg, Leighton ‘03] • Back to d-dimensional case • O(d 2 log T) [Cohen, Lobel, Paes Leme ’16] • Ω (d log log T) lower bound • O(d log T) [this work] • this work

  37. Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining

  38. Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price ≥ε • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly Cut • repeat until no valid directions remaining perpendicular to this direction

  39. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price ≥ε • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly Cut • repeat until no valid directions remaining perpendicular to this direction

  40. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half

  41. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half

  42. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half Theorem : there exists a strategy for cutting such that w(K,u) ≤ ε for all u after O(d log (d/ ε )) iterations. Set ε =d log T / T , regret is at most T ε + O(d log (d/ ε )) = O(d log T) .

  43. ✂ Model • start with d -dimensional convex body (pos- sible prices for items) • customer gives vector of item quantities u , k u k = 1, with the promise that w ( K, u ) � ✏ , asks for price • player chooses price c , and partitions K = K 1 [ K 2 , where K 1 = { x 2 K : x > u  c } and K 2 = K \ K 1 • customer picks which half contains the true prices, update K accordingly • repeat until no valid directions remaining Keep this half Theorem : there exists a strategy for cutting such that w(K,u) ≤ ε for all u after O(d log (d/ ε )) iterations. Set ε =d log T / T , regret is at most T ε + O(d log (d/ ε )) = O(d log T) .

  44. Convex Geometry Toolbox

  45. Convex Geometry Toolbox ✂ Grunbaum’s Theorem : any cut through the centroid partitions K in two pieces of roughly equal volume (largest/ smallest ≤ e ).

  46. Convex Geometry Toolbox ✂ Not quite sufficient Grunbaum’s Theorem : any cut through the centroid partitions K in two pieces of roughly equal volume (largest/ smallest ≤ e ).

  47. Convex Geometry Toolbox ✂ Not quite sufficient Grunbaum’s Theorem : any cut through the centroid partitions K in two pieces of roughly equal volume (largest/ smallest ≤ e ).

Recommend


More recommend