a learning framework for distribution based game
play

A Learning Framework for Distribution- Based Game-Theoretic Solution - PowerPoint PPT Presentation

A Learning Framework for Distribution- Based Game-Theoretic Solution Concepts Tushant Jha Yair Zick IIIT Hyderabad UMass Amherst Learn models of compute prefs/valuations/etc solution Economic Data directly learn approx. solutions


  1. A Learning Framework for Distribution- Based Game-Theoretic Solution Concepts Tushant Jha Yair Zick IIIT Hyderabad UMass Amherst Learn models of compute prefs/valuations/etc solution Economic Data directly learn approx. solutions (This work was conducted while both authors were at National University of Singapore, and was supported by Singapore National Research Foundation Fellowship.)

  2. General Model of PAC Solution Concepts Observations from ๐’  over game space ๐’ด probability โ‰ฅ 1 โˆ’ ฮด , over training samples The probability that we observe a loss is low Find solution ๐‘ก โˆ— โˆˆ ๐•‹ ๐‘Œโˆผ๐’  ๐œ‡ ๐‘•, ๐‘Œ, ๐‘ก โˆ— Pr < ๐œ Evaluated by an โ€ข Dimension theory for economic unknown game solution concepts ๐‘•: ๐’ด โ†’ ๐’ต โ€ข Sample complexity bounds โ€ข Are consistent algorithms enough?

  3. Example: PAC Solutions for Markets Observations, of item bundles and the utilities of n players for them, sampled i.i.d. from ๐’  A sampled set of goods ๐‘ค 1 ๐‘ก 1 , ๐‘ค 2 ๐‘ก 1 โ€ฆ ๐‘ค ๐‘œ (๐‘ก 1 ) Find an ๐‘‡ โŠ† ๐ป is not demanded allocation by any player ๐‘— โˆˆ ๐‘‚ and prices ๐‘ค 1 ๐‘ก 2 , ๐‘ค 2 ๐‘ก 2 โ€ฆ ๐‘ค ๐‘œ (๐‘ก 2 ) โŒฉ๐ต โˆ— , ิฆ ๐‘ž โˆ— โŒช ๐‘‡โˆผ๐’  Demands i ๐‘‡; ๐ต โˆ— , ิฆ ๐‘ž โˆ— Pr < ๐œ ๐‘ค 1 ๐‘ก 3 , ๐‘ค 2 ๐‘ก 3 โ€ฆ ๐‘ค ๐‘œ (๐‘ก 3 ) โ€ข We prove linear solution dimension. โ€ข Therefore, prove polynomial sample complexity Evaluated by bound for PAC solvability. โ‹ฎ โ€ข Also, we show existence of PAC solutions for unknown player Fisher markets (ie. budget constrained) and preferences ๐‘ค ๐‘— ๐‘—โˆˆ๐‘‚ markets with endowments.

  4. Main Takeaway Recovers previous results such as: Enables i) PAC Core Stability in TU generalization of Cooperative Games approach to other (Balcan et al. IJCAI 2015), ii) in Hedonic Games domains like Markets, (Sliwinski et al. IJCAI 2017; Voting, Auctions, etc. Igarashi et al. AAAI 2019), iii) And othersโ€ฆ Provides a unified formal approach for proving learnability in economics, and associated distribution-agnostic sample complexity bounds.

Recommend


More recommend