reserve pricing in repeated second price auctions with
play

Reserve Pricing in Repeated Second-Price Auctions with Strategic - PowerPoint PPT Presentation

Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders Alexey Drutsa Setup Second-Price (SP) Auction with Reserve Prices Setting A good (e.g., an ad space) is offered for sale by a seller to buyers Each buyer


  1. Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders Alexey Drutsa

  2. Setup

  3. Second-Price (SP) Auction with Reserve Prices β–Œ Setting β€Ί A good (e.g., an ad space) is offered for sale by a seller to 𝑁 buyers β€Ί Each buyer 𝑛 holds a private valuation 𝑀 $ ∈ [0,1] for this good ( 𝑀 $ is unknown to the seller) β–Œ Actions β€Ί The seller selects a reserve price π‘ž $ for each buyer 𝑛 β€Ί Each buyer 𝑛 submits a bid 𝑐 $ β–Œ Allocation and payments β€Ί Determine actual buyer-participants: 𝕅 = {𝑛 ∣ 𝑐 $ β‰₯ π‘ž $ } β€Ί The good is received by the buyer 𝑛 4 = argmax $βˆˆπ•… 𝑐 $ (that has the highest bid) β€Ί This buyer pays π‘ž $ 4 = max {π‘ž $ 4 , max $βˆˆπ•…βˆ–{$ 4} 𝑐 $ }

  4. Repeated Second-Price Auctions with Reserve Equal goods (e.g., ad spaces) are repeatedly offered for sale β€Ί by a seller (e.g., RTB platform) to 𝑁 buyers (e.g., advertisers) β€Ί over π‘ˆ rounds (one good per round). Each buyer 𝑛 β€Ί holds a private fixed valuation 𝑀 $ ∈ [0,1] for each of those goods, β€Ί 𝑀 $ is unknown to the seller. At each round 𝑒 = 1, … , π‘ˆ , the seller conducts SP auction with reserves: β€Ί the seller selects a reserve price π‘ž > $ for each buyer 𝑛 β€Ί and a bid 𝑐 > $ is submitted by each buyer 𝑛 .

  5. Seller’s pricing algorithm β€Ί The seller applies a pricing algorithm 𝐡 that sets reserve prices {π‘ž > B,C $ } >@A,$@A B,C $ } >@A,$@A in response to bids 𝐜 = {𝑐 > of buyers 𝑛 = 1, … , 𝑁 β€Ί A price π‘ž > $ can depend only on past bids {𝑐 E >GA,C F } E@A,F@A and the horizon π‘ˆ .

  6. Strategic buyers β–Œ The seller announces her pricing algorithm 𝐡 in advance In each round 𝑒 , each buyer 𝑛 β€Ί observes a history of previous rounds (available to this buyer) and β€Ί chooses his bid 𝑐 > $ s.t. it maximizes his future 𝛿 $ -discounted surplus: B 4 O (𝑀 $ βˆ’ π‘ž E $ ) Sur > 𝐡, 𝑀 $ , 𝛿 $ , {𝑐 E $ } : = 𝔽 M EGA 𝕁 $@$ 𝛿 $ , 𝛿 $ ∈ 0,1 , E@> where 𝕁 $@$ 4 O is the indicator of the event when buyer 𝑛 is the winner in round 𝑑 $ is the payment of the buyer 𝑛 in this case π‘ž E

  7. Seller’s goal The seller’s strategic regret: $ 𝑀 $ βˆ’ 𝕁 𝕅 W Xβˆ… π‘ž > 4 W ) $ SReg π‘ˆ, 𝐡, 𝑀 $ $ , 𝛿 $ $ : = βˆ‘ B (max >@A She seeks for a no-regret pricing for worst-case valuation: sup \ ] ,…,\ ^ ∈ _,A SReg π‘ˆ, 𝐡, 𝑀 $ $ , 𝛿 $ $ = 𝑝 π‘ˆ Optimality : the lowest possible upper bound for the regret of the form 𝑃 𝑔(π‘ˆ) .

  8. Background, Research question & Main contribution

  9. Background: 1-buyer case (posted-price auctions) If one buyer ( 𝑁 = 1 ), a SP auction reduces to a posted-price auction: β€Ί the buyer either accepts or rejects a currently offered price π‘ž > A β€Ί the seller either gets payment equal to π‘ž > A or nothing [Kleinberg et al., FOCS’2003] Optimal algorithm against myopic buyer with truthful regret Θ(log log π‘ˆ) . [Amin et al., NIPS’2013] The strategic setting is introduced. βˆ„ no-regret pricing for non-discount case 𝛿 = 1 . [Drutsa, WWW’2017] Optimal algorithm against strategic buyer with regret Θ(log log π‘ˆ) for 𝛿 < 1 .

  10. Research question The known optimal algorithms (PRRFES & prePRRFES) from posted-price auctions cannot be directly applied to set reserve prices in second-price auctions β€Ί buyers in SP auctions have incomplete information due to presence of rivals β€Ί the proofs of optimality of [pre]PRRFES strongly rely on complete information β–Œ In this study, I try to find an optimal algorithm for the multi-buyer setup

  11. Main contribution A novel algorithm for our strategic buyers with regret upper bound of Θ(log log π‘ˆ) for 𝛿 < 1 A novel transformation that maps any pricing algorithm designed for posted-price auctions to a multi-buyer setup

  12. Main ideas

  13. Two learning processes $ 𝑀 $ βˆ’ 𝕁 𝕅 W Xβˆ… π‘ž > 4 W ) $ B SReg π‘ˆ, 𝐡, 𝑀 $ $ , 𝛿 $ $ : = βˆ‘ (max >@A Find which buyer has Find the buyers’ valuations the maximal valuation Learning process #1 Learning process #2

  14. Learning proc.#1: an idea to localize a valuation PRRFES is an optimal learner of a valuation in posted-price auctions. However, its core localization technique relies on: β–Œ The buyer completely knows the outcomes of current and all future rounds β–Œ given their bids (due to absence of rivals) Can we use PRRFES in the second-price scenario where each buyer does not know perfectly the outcomes of rounds?

  15. Barrage pricing β€Ί Reserve prices are personal (individual) in our setup β€Ί Thus, we are able to β€œeliminate” particular buyers from particular rounds β€Ί Namely, a buyer 𝑛 will not bid above 1/(1 βˆ’ 𝛿 $ ) β€Ί We call this price as β€œbarrage” one and denote it by ∞ Let β€œeliminate” all buyers except some buyer 𝑛 in a round 𝑒 Then the buyer 𝑛 will have com round 𝑒 complete i ete inf nfor ormati tion on abo about outcome of this s ro

  16. Learning proc.#2: an idea to find max valuation The search algorithm works by maintaining a feasible interval [𝑣 $ , π‘₯ $ ] that β€Ί is aimed to localize the valuation 𝑀 $ , i.e. 𝑀 $ ∈ [𝑣 $ , π‘₯ $ ] β€Ί shrinks as 𝑒 β†’ ∞ [𝑣 o , π‘₯ o ] [𝑣 A , π‘₯ A ] [𝑣 p , π‘₯ p ] 𝑀 A 𝑀 p 𝑀 o 0 1 round 𝑒 A round 𝑒 p round 𝑒 o β–Œ If, in a round 𝑒 , it becomes that π‘₯ $ < 𝑣 m for some buyers 𝑛 and π‘œ , β–Œ then buyer 𝑛 has non-maximal valuation which should not be searched anymore

  17. Dividing algorithms

  18. Key instrument that implements the ideas transformation di div

  19. Transformation di div : cyclic elimination Let 𝐡 be an algorithm designed for repeated posted-price auctions β–Œ Its transformation 𝐞𝐣𝐰 𝐡 is an algorithm for repeated SP auctions as follows Buyers: Reserve prices are set by: Reserve Prices (only one non-barrage in a round): A A A π‘ž A ∞ ∞ Algorithm 𝐡 π‘ž t ∞ ∞ π‘ž u ∞ . . . p p p ∞ ∞ Algorithm 𝐡 π‘ž p ∞ ∞ ∞ . . . π‘ž v π‘ž w o o ∞ ∞ Algorithm 𝐡 π‘ž o ∞ ∞ ∞ ∞ . . . π‘ž x Rounds, 𝑒 = 3 1 2 4 5 6 7 8 1 2 3 Periods, 𝑗 =

  20. Transformation di div : stopping rule We stop considering a buyer 𝑛 in periods when π‘₯ $ < 𝑣 m for some buyer π‘œ. β–Œ The number of periods with buyer 𝑛 is referred to as subhorizon, 𝐽 $ . We stopped learning of 𝑀 A and 𝐽 A = 𝑙 , when π‘₯ A < 𝑣 p Buyers: Reserve prices are set by: Reserve Prices : A π‘ž E ∞ ∞ ∞ Algorithm 𝐡 ∞ ∞ ∞ ∞ . . . p p p p ∞ ∞ Algorithm 𝐡 π‘ž E|A ∞ ∞ π‘ž E|u . . . π‘ž E|o π‘ž E|v o o o ∞ ∞ Algorithm 𝐡 π‘ž E|p ∞ ∞ ∞ . . . π‘ž E|t π‘ž E|x 𝑑 + 2 𝑑 𝑑 + 1 𝑑 + 3 𝑑 + 4 𝑑 + 5 𝑑 + 6 𝑑 + 7 Rounds, 𝑒 = 𝑙 + 3 𝑙 𝑙 + 1 𝑙 + 2 Periods, 𝑗 =

  21. οΏ½ οΏ½ Transformation div: regret decomposition Lemma 1. For the described transformation, strategic regret has decomposition: SReg π‘ˆ, 𝐞𝐣𝐰 𝐡 , 𝑀 $ $ , 𝛿 $ $ = 𝑀 m βˆ’ 𝑀 $ ) = M Reg $ (π‘ˆ, 𝐡, 𝑀 $ , 𝛿 $ ) 𝐽 $ (max + M m $ $ Individual regrets Deviation regret Measure how the algorithm 𝐡 learns Measures how fast we stop learning of non-maximal valuations the valuation of each buyer

  22. Key challenge against strategic buyer Strategic buyer may lie and mislead algorithms, thus a good algorithm must Extract correct information about a buyer’s valuation from his actions (bids) β–Œ Dividing structure in a round allows to construct a tool to locate valuations: β–Œ it is enough to make complete information situation in a round

  23. Upper bound on valuation of strategic buyer Let buyer 𝑛 is the non-”eliminated ” one in a round 𝑒 . β–Œ If the buyer accepts (bids above) the current reserve price π‘ž > $ B 4 W (𝑀 $ βˆ’ π‘ž > 4 O (𝑀 $ βˆ’ π‘ž E >GA 𝕁 $@$ $ ) + 𝔽 M EGA 𝕁 $@$ $ ) Surplus > = 𝔽 𝛿 $ 𝛿 $ E@>|A = ≀ Ε½ (𝑀 $ βˆ’ π‘ž > >GA (𝑀 $ βˆ’ π‘ž > >GA 𝕁 β€’ W $ ) = 𝛿 $ $ ) 0 𝛿 $ Ε½ β€˜β€™ W β–Œ If the buyer rejects (bids below) the current reserve price π‘ž > $ >|†GA B ≀ 𝛿 $ 4 O (𝑀 $ βˆ’ π‘ž E (𝑀 $ βˆ’ [lowest_price]) $ ) EGA 𝕁 $@$ Surplus > = 𝔽 M 𝛿 $ 1 βˆ’ 𝛿 $ E@>|† If we observe that a buyer rejects non-”barrage” reserve price, then: β€’ 𝑀 $ βˆ’ π‘ž > $ < $ βˆ’ [lowest_price]) β€’ Ε½ β€’ (π‘ž > AGβ€’ Ε½ Gβ€’ Ε½

  24. Optimal algorithm

  25. Pricing algorithm divPRRFES Apply the transformation div div to PRRFES algorithm

  26. divPRRFES: individual and deviation regrets β–Œ Individual regrets Our tool to locate valuations provides the upper bound (as in 1-buyer case): Reg $ π‘ˆ, 𝐡, 𝑀 $ , 𝛿 $ = 𝑃 log p log p π‘ˆ βˆ€π‘› β–Œ Deviation regrets β€Ί For each buyer 𝑛 with non-maximal valuation (i.e., 𝑀 $ < max 𝑀 m ) m β€Ί We can upper bound its subhorizon 𝐽 $ : 𝐷 𝐽 $ ≀ 𝑀 m βˆ’ 𝑀 $ max m

Recommend


More recommend