Introduction Instructor: Haifeng Xu Outline Course Overview - PowerPoint PPT Presentation

CS6501: T opics in Learning and Game Theory (Fall 2019) Introduction Instructor: Haifeng Xu

Outline Ø Course Overview Ø Administrivia Ø An Example 2

Single-Agent Decision Making Ø A decision maker picks an action 𝑦 ∈ 𝑌 , resulting in utility 𝑔(𝑦) Ø Typically an optimization problem: minimize (or maximize) 𝑔(𝑦) 𝑦 ∈ 𝑌 subject to 𝑦 : decision variable • 𝑔(𝑦) : objective function • 𝑌 : feasible set/region • • Optimal solution, optimal value Ø Example 1: minimize 𝑦 ' , s.t. 𝑦 ∈ [−1,1] Ø Example 2: pick a road to school 3

Single-Agent Decision Making Ø A decision maker picks an action 𝑦 ∈ 𝑌 , resulting in utility 𝑔(𝑦) Ø Typically an optimization problem: minimize (or maximize) 𝑔(𝑦) 𝑦 ∈ 𝑌 subject to 𝑦 : decision variable • 𝑔(𝑦) : objective function • 𝑌 : feasible set/region • • Optimal solution, optimal value Ø Example 1: minimize 𝑦 ' , s.t. 𝑦 ∈ [−1,1] Ø Example 2: pick a road to school Ø Example 3: invest a subset of stocks 4

Multi-Agent Decision Making Ø Usually, your payoffs affected not only by your actions, but also others’ Ø Agent 𝑗 ’s utility 𝑔 . (𝑦 . , 𝑦 /. ) depends on his own action 𝑦 . , as well as other agents’ actions 𝑦 /. Ø Is this still an optimization problem? Should each agent 𝑗 just pick 𝑦 . ∈ 𝑌 . to minimize 𝑔 . (𝑦 . , 𝑦 /. ) ? 𝑦 /. is not under 𝑗 ’s control • • Think of rock-paper-scissor game Ø Examples: stock investment, routing, sales, even taking courses… 5

Example 1: Prisoner’s Dilemma Ø Two members A,B of a criminal gang are arrested Ø They are questioned in two separate rooms v No communications between them Q: How should each prisoner act? Ø Both of them betray Ø (-1,-1) is the best, but is not a stable status • Selfish behaviors result in inefficiency 6

Example II: Markets on Amazon 7

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 If the market has only one book seller… Q: What price should this monopoly set? $200! 8

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 What if the market has two book sellers… Q: What price should each seller set? $199 $200! 9

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 What if the market has two book sellers… Q: What price should each seller set? $199 $198 $200! 10

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 What if the market has two book sellers… Q: What price should each seller set? $100 $199 $198 $200! 11

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 What if the market has two book sellers… Q: What price should each seller set? $100 $199 $20 $198 $200! 12

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 What if the market has two book sellers… Q: What price should each seller set? $100 $20 $199 $20 $198 $200! 13

Example II: Markets on Amazon Ø Assume people will buy if the book price ≤ $200 Ø Product cost = $20 What if the market has two book sellers… Q: What price should each seller set? Ø The market reaches a “stable status” (a.k.a., equilibrium) Ø Nobody can benefit via unilateral deviation • Bertrand competition $20 $20 • Selfish behaviors result in inefficiency (to sellers) 14

Game Theory Game Theory studies multiple-agent decision making in competitive scenarios where an agent’s payoff depends on other agents’ actions. Ø Fundamental concept --- Equilibrium • A “stable status” at which any agent cannot improve his payoff through unilateral deviation • If exits, it should be what we expect to happen • Resembles “optimal decision” in single-agent case Ø A central theme in game theory is to study the equilibrium • Different definitions of equilibria • May not exist; even exist, not necessarily unique • Understand properties of equilibrium, compute equilibria, how to improve inefficiency of equilibrium . . . 15

Machine Learning Ø Difficult to give a universal definition Ø At a high level, the task is to learn a function 𝑔: 𝑌 → 𝑍 , where x, y ∈ 𝑌×𝑍 is drawn from some distribution 𝐸 𝑦 . , 𝑧 . .9:,',⋯,< drawn from 𝐸 • Input: a set of samples • Output: an algorithm 𝐵: 𝑌 → 𝑍 such that 𝐵 𝑦 ≈ 𝑔(𝑦) (usually measured by some loss function) Ø Examples • Classification: 𝑌 = feature vectors; 𝑍 = {0,1} • Regression: 𝑌 = feature vectors; 𝑍 = ℝ • Reinforcement learning has a slightly different setup, but can be thought as 𝑌 = state space, 𝑍 = action space 16

Problems at Interface of Learning and Game Theory Ø If a game is unknown or too complex, can players learn to play the game optimally? • Yes, sometimes – no regret learning and convergence to equilibrium Ø Can game-theoretic models inspire machine learning models? • Yes, GANs which are zero-sum games Ø Data is the fuel for ML – Can we collect high-quality data from crowd? • Yes, via information elicitation mechanisms Ø We know how to learn to recognize faces or languages, but can we also learn to design games to achieve some goal? • Yes, learning optimal auction mechanisms Ø Game-theoretic/strategic behaviors in ML? How to handle them? • Yes, e.g, learn whether to give loans to someone or whether to admit a student to UVA based on their features Ø . .. 17

Main Topics of This Course First Half: Machine learning for game theory Ø No regret learning and its convergence to equilibrium Ø Learning optimal auction mechanisms Second Half: Game theory for machine learning Ø Incentivize high-quality data via information elicitation (a.k.a., crowdsourcing) Ø Handle strategic behaviors in machine learning • Particularly, learning from strategic data sources, and fairness 18

Main Topics of This Course First Half: Machine learning for game theory Ø No regret learning and its convergence to equilibrium Ø Learning optimal auction mechanisms Second Half: Game theory for machine learning Ø Incentivize high-quality data via information elicitation (a.k.a., crowdsourcing) Ø Handle strategic behaviors in machine learning • Particularly, learning from strategic data sources, and fairness Only cover fundamentals of each direction 19

Course Goal Ø Get familiar with basics of game theory and learning Ø Understand machine learning questions in game-theoretic settings, and how to deal with some of them Ø Understand strategic aspects in machine learning tasks, and how to deal with some of them Ø Can understand cutting-edge research papers in relevant areas 20

Targeted Audience of This Course Ø Anyone planning to do research at the interface of game theory (or algorithm design) and machine learning • This is a new research direction with many opportunities/challenges • Recent breakthrough in no-limit poker is an example Ø Anyone interested in theoretical ML, game theory, human factors in learning, AI • As more and more ML systems interact with human beings, such game-theoretic reasoning becomes increasingly important • With more techniques developed for ML, they also broadened our toolkits for designing and solving games Ø Anyone interested in understanding basics of game theory and learning 21

Who May not Be Suitable for This Course? Ø Those who do not satisfy the prerequisites “in practice” Ø Those who are looking for a recipe to implement ML/DL algorithms, or want to learn how to use TensorFlow, PyTorch, etc. • This is primarily a theory course • We will mostly focus on simple/basic yet theoretically insightful problems • The course is proof based – we will not write code 22

Outline Ø Course Overview Ø Administrivia Ø An Example 23

Basic Information Ø Course time: Tuesday/Thursday, 3:30 pm – 4:45 pm Ø Lecture place: Thornton Hall E303 Ø Instructor: Haifeng Xu • Email: hx4ad@virginia.edu • Office: Rice Hall 522 • Office Hour: Mon 4 – 5 pm Ø TAs • Minbiao Han : office hour Thur 11 – 12 pm, Olsson Hall 001 • Jing Ma : office hour Tue 11 – 12 pm, Rice Hall 442 Ø Depending on demand, can add more office hours (let us know!) Ø Couse website: http://www.haifeng-xu.com/cs6501fa19/ Ø References: linked papers/notes on website, no official textbooks • Slides will be posted after lecture 24

Prerequisites Ø Mathematically mature: be comfortable with proofs Ø Sufficient exposures to algorithms/optimization • CS 6161 and equivalent, or • CS 4102 and you did really well • We will cover some basics of optimization 25

Requirements and Grading Ø 3-4 homeworks, 60% of grade. • Proof based • Will be challenging • Discussion allowed, even encouraged, but must write up solutions independently • Must be written up in Latex – hand-written solutions will not be accepted • One late homework allowed, at most 2 days Ø Research project, 40% of grade. Project instructions will be posted on website later. • Team up: 2 – 4 people per team • Can thoroughly survey a research field, or • Study a relevant research question, e.g., arising from your own research • Presentation form: a report in PDF Ø FYI: should not worry about your grade if you do invest time 26

Introduction Instructor: Haifeng Xu Outline Course Overview - PowerPoint PPT Presentation

CS6501: T opics in Learning and Game Theory (Fall 2019) Introduction Instructor: Haifeng Xu Outline Course Overview Administrivia An Example 2 Single-Agent Decision Making A decision maker picks an action , resulting

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

Introduction to Web Design & Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Introduction to CICS Course introduction Course introduction What is CICS? What is an

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INTRODUCTION I Syllabus INTRODUCTION I Syllabus I Why study labor economics? INTRODUCTION I

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Administrivia Mini project is graded 1 st place: Justin (75.45) 2 nd place: Liia

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math

CSC2/458 Parallel and Distributed Systems Introduction Sreepathi Pai January 18, 2018 URCS

Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 1 Slides adapted from

Administrivia Website. cis.poly.edu/jsterling/cs3224 Text: Modern Operating Systems ;

COURSE OVERVIEW WEB SKILL SETS Front-End Back-End Design Front-End Back-End MY BLOG HTTP

Lecture #3: Lecturer M ichael Ball Loops and Functions January 31, 2020 https://cs88.org

CSCI 2330 F OUNDATIONS OF C OMPUTER S YSTEMS Sean Barker Bowdoin College Department of Computer

Introduction Instructor: Haifeng Xu Outline Course Overview - PowerPoint PPT Presentation

CS6501: T opics in Learning and Game Theory (Fall 2019) Introduction Instructor: Haifeng Xu Outline Course Overview Administrivia An Example 2 Single-Agent Decision Making A decision maker picks an action , resulting

INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION INTRODUCTION

Introduction ATV Introduction A T V Introduction A lphabet T V Introduction A lphabet

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Brief Brief Introduction Introduction Brief Brief Introduction Introduction Zhengzhou

Shenzhen Cuilu jewelry Co., Ltd was founded in 1996 and its a large private enterprise

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Spectrum Painting Richard Shipman MW0RCZ ADARS 6th Jan 2020 Introduction Introduction

Introduction Introduction Introduction Introduction Outline Motivation Failures

Introduction Introduction Introduction Nationwide Cause for Concern 1

Team Introduction Experiments Outreach Problem Project Brainstorm Introduction Introduction

Lecture 1 Andreas Habegger Introduction Zynq Introduction Zynq Introduction Zynq PS vs. PL

Introduction to Web Design &amp; Computer Principles Class 1 CSCI-UA 4 Introduction and Overview

Introduction to CICS Course introduction Course introduction What is CICS? What is an

INF5110 Compiler Construction Introduction Spring 2016 1 / 33 Outline 1. Introduction

INTRODUCTION I Syllabus INTRODUCTION I Syllabus I Why study labor economics? INTRODUCTION I

2018.06 01 SMILE5 Introduction S E 5 02 Alpha Cloud M I L 03 Company Introduction 04

Administrivia Mini project is graded 1 st place: Justin (75.45) 2 nd place: Liia

CS 4803 / 7643: Deep Learning Topics: Backpropagation Vector/Matrix/Tensor math

CSC2/458 Parallel and Distributed Systems Introduction Sreepathi Pai January 18, 2018 URCS

Machine Learning: Chenhao Tan University of Colorado Boulder LECTURE 1 Slides adapted from

Administrivia Website. cis.poly.edu/jsterling/cs3224 Text: Modern Operating Systems ;

COURSE OVERVIEW WEB SKILL SETS Front-End Back-End Design Front-End Back-End MY BLOG HTTP

Lecture #3: Lecturer M ichael Ball Loops and Functions January 31, 2020 https://cs88.org

CSCI 2330 F OUNDATIONS OF C OMPUTER S YSTEMS Sean Barker Bowdoin College Department of Computer

Introduction to Web Design & Computer Principles Class 1 CSCI-UA 4 Introduction and Overview