Social Learning in Multi Agent Multi Armed Bandits Abishek - PowerPoint PPT Presentation

Feb 11, 2024 •427 likes •973 views

Social Learning in Multi Agent Multi Armed Bandits Abishek Sankararaman, UC Berkeley April 9, 2020 Joint Work with - Sanjay Shakkottai, Ronshee Chawla, UT Austin - Ayalvadi Ganesh, University of Bristol Multi Armed Bandit Problem A set of

Social Learning in Multi Agent Multi Armed Bandits Abishek Sankararaman, UC Berkeley April 9, 2020 Joint Work with - Sanjay Shakkottai, Ronshee Chawla, UT Austin - Ayalvadi Ganesh, University of Bristol
Multi Armed Bandit Problem A set of possible drugs with a-priori unknown cure rates
Multi Armed Bandit Problem A set of possible drugs with a-priori unknown cure rates Task - Prescribe one of these to new incoming patients to both (i) cure them and (ii) collect data about their cure rates
Multi Armed Bandit Problem A set of possible drugs with a-priori unknown cure rates Task - Prescribe one of these to new incoming patients to both (i) cure them and (ii) collect data about their cure rates Explore/Exploit Tradeo ff for each new patient [Thompson’ 33] Prescribe a drug that has shown the best promise so far Exploit Explore Try a new drug to discover more promising alternatives Run a risk of not curing these patients
Outline 1. Single Agent MAB 2. The Multi-Agent Setup 3. The Gossiping Insert-Eliminate (Gosine) Algorithm 4. Insights

Recommend

Cooperative Multi-Agent Bandits with Heavy Tails Introduction K-Armed Bandits Cooperation

Cooperative Bandits with Heavy Tails Dubey and Pentland ICML 2020 Cooperative Multi-Agent Bandits with Heavy Tails Introduction K-Armed Bandits Cooperation Summary Abhimanyu Dubey and Alex Pentland Background K-Armed Bandits

350 views • 16 slides

The Contextual Bandits Problem The Contextual Bandits Problem The Contextual Bandits Problem The

The Contextual Bandits Problem The Contextual Bandits Problem The Contextual Bandits Problem The Contextual Bandits Problem The Contextual Bandits Problem A New, Fast, and Simple Algorithm A New, Fast, and Simple Algorithm A New, Fast, and

1.56k views • 134 slides

About this class An example Bandit problems in general Two-armed bandits Multi-armed bandits

About this class An example Bandit problems in general Two-armed bandits Multi-armed bandits and Gittins indices 1 An Example [Most of this lecture from Berry & Fristedt] You want to maximize the sum of two observa- tions. The process

407 views • 13 slides

Multi-armed Bandits Prof. Kuan-Ting Lai 2020/3/12 k-armed Bandit Problem Playing k armed

Multi-armed Bandits Prof. Kuan-Ting Lai 2020/3/12 k-armed Bandit Problem Playing k armed bandit machines and find a way to win most money! Note: assume you have unlimited money and never go bankrupt!

342 views • 21 slides

Reinforcement Learning n-armed bandit Kevin Spiteri April 21, 2015 n-armed bandit n-armed

Reinforcement Learning n-armed bandit Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5 0.1 0.9 0.5 0.1 0.0 0.0 0.0 0.0 estimate n-armed bandit n-armed bandit 0.9 0.5 0.1 0.9 0.5 0.1 0 0.0 0.0 0.0 0.0

677 views • 21 slides

Econ 2148, fall 2019 Multi-armed bandits Maximilian Kasy Department of Economics, Harvard

Bandits Econ 2148, fall 2019 Multi-armed bandits Maximilian Kasy Department of Economics, Harvard University 1 / 25 Bandits Agenda Thus far: Supervised machine learning data are given. Next: Active learning

665 views • 25 slides

On conditional versus marginal bias in multi-armed bandits Jaehyeok Shin 1 , Aaditya Ramdas 1,2

On conditional versus marginal bias in multi-armed bandits Jaehyeok Shin 1 , Aaditya Ramdas 1,2 and Alessandro Rinaldo 1 Dept. of Statistics and Data Science 1 , Machine Learning Dept. 2 , CMU Stochastic Multi-armed bandits (MABs) K 2 1

1k views • 56 slides

Advanced Econometrics 2, Hilary term 2021 Multi-armed bandits Maximilian Kasy Department of

Bandits Advanced Econometrics 2, Hilary term 2021 Multi-armed bandits Maximilian Kasy Department of Economics, Oxford University 1 / 25 Bandits Agenda Thus far: Supervised machine learning data are given. Next: Active

425 views • 25 slides

Muti-armed Bandits,Online Learning and Sequential Prediction Jian Li Institute for

2016 NDBC Muti-armed Bandits,Online Learning and Sequential Prediction Jian Li Institute for Interdisciplinary Information Sciences Tsinghua University Outline Online Learning Stochastic Multi-armed Bandits UCB Combinatorial

789 views • 47 slides

Adaptations of the Thompson Sampling Algorithm for Multi-Armed Bandits Ciara Pike-Burke

Adaptations of the Thompson Sampling Algorithm for Multi-Armed Bandits Adaptations of the Thompson Sampling Algorithm for Multi-Armed Bandits Ciara Pike-Burke Supervisor: David Leslie 24th April 2015 1 / 14 Adaptations of the Thompson

586 views • 20 slides

Introduction to Bandits R emi Munos SequeL project: Sequential Learning

Introduction to bandits Games Hierarchical bandits Lipschitz optimization X -armed bandits Planning Conclusion Introduction to Bandits R emi Munos SequeL project: Sequential Learning http://researchers.lille.inria.fr/ munos/ INRIA

1.1k views • 67 slides

Multi-armed bandits S Bubeck, N Cesa-Bianchi Foundations and Trends in Machine Learning 2012 *

Multi-armed bandits S Bubeck, N Cesa-Bianchi Foundations and Trends in Machine Learning 2012 * Real title: regret analysis of stochastic and nonstochastic multi-armed bandit problems Overview Stochastic, adversarial, extensions &

281 views • 7 slides

Reinforcement Learning Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5

Reinforcement Learning Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5 0.1 n-armed bandit 0.9 0.5 0.1 0.0 0.0 0.0 0.0 estimate n-armed bandit 0.9 0.5 0.1 0 0.0 0.0 0.0 0.0 estimate 0 0 0 0.0 0

995 views • 84 slides

Module 13 Bayesian Bandits CS 886 Sequential Decision Making and Reinforcement Learning

Module 13 Bayesian Bandits CS 886 Sequential Decision Making and Reinforcement Learning University of Waterloo Multi-Armed Bandits Problem: bandits with unknown average reward () Which arm should we play at each

414 views • 15 slides

Multi-Armed Bandits: Non-adaptive and Adaptive Sampling Instructor: Sham Kakade 1 The

CSE 547/Stat 548: Machine Learning for Big Data Lecture Multi-Armed Bandits: Non-adaptive and Adaptive Sampling Instructor: Sham Kakade 1 The (stochastic) multi-armed bandit problem The basic paradigm is as follows: K Independent Arms: a

273 views • 5 slides

Multi-Player Bandits Revisited Decentralized Multi-Player Multi-Arm Bandits Lilian Besson Joint

Multi-Player Bandits Revisited Decentralized Multi-Player Multi-Arm Bandits Lilian Besson Joint work with milie Kaufmann PhD Student Team SCEE, IETR, CentraleSuplec, Rennes & Team SequeL, CRIStAL, Inria, Lille CMAP Seminar 31 st

1.47k views • 96 slides

Refresh Your Knowledge Fast RL Part II The prior over arm 1 is Beta(1,2) (left) and arm 2 is a

Lecture 13: Fast Reinforcement Learning 1 Emma Brunskill CS234 Reinforcement Learning Winter 2020 1 With a few slides derived from David Silver Lecture 13: Fast Reinforcement Learning 1 Emma Brunskill (CS234 Reinforcement Learning ) Winter 2020

762 views • 40 slides

Working Texas Style: Do You Have The Skills To Pay The Bills Central East Texas Alliance

Working Texas Style: Do You Have The Skills To Pay The Bills Central East Texas Alliance Regional economic development, business and workforce development leaders Aug. 26, 2014, in Navasota, Texas Presentation by Mick Normington Data

997 views • 84 slides

ALL THINGS Lindy Strong DNA & OUR THINKING DNA & Our Thinking The following section

RESTORATION OF ALL THINGS Lindy Strong DNA & OUR THINKING DNA & Our Thinking The following section has several quotes directly for this book It also contains a 21 day brain detox plan DNA & Our Thinking To quote Caroline

1.4k views • 106 slides

The purpose of life, after all, is to live it, to taste experience to the utmost, to reach

What is a Meaningful Life? The purpose of life, after all, is to live it, to taste experience to the utmost, to reach out eagerly and without fear for newer and richer experience. Eleanor Roosevelt What is a Meaningful Life?

987 views • 84 slides

Wireless & Mobile Health to Address COVID-19 Fadel Adib Wireless & Mobile Health to

Talk given at the NSF NeTS Call to Arms Workshop on April 22, 2012 Wireless & Mobile Health to Address COVID-19 Fadel Adib Wireless & Mobile Health to Address COVID-19 Technologies that have been already deployed Solutions that

282 views • 27 slides

GALAXY SPIRAL ARMS, DISK DISTURBANCES AND STATISTICS Part I: NGC3081 to build background for

GALAXY SPIRAL ARMS, DISK DISTURBANCES AND STATISTICS Part I: NGC3081 to build background for NGC4622. Co-authors for Parts I and II: G. Byrd (Univ. of Alabama, Tuscaloosa), T. Freeman (Bevill State Comm. Coll. Fayette, AL), R.

519 views • 36 slides

Introduction to Multi-Armed Bandits and Reinforcement Learning Training School on Machine

Introduction to Multi-Armed Bandits and Reinforcement Learning Training School on Machine Learning for Communications Paris, 23-25 September 2019 Who am I ? . Hi, Im Lilian Besson finishing my PhD in telecommunication and machine

1.73k views • 127 slides

Adpative MAMS Design Lingyun Liu 27 April 2019 Lingyun Liu Stat4Onc 27 April 2019 1 / 28

Adpative MAMS Design Lingyun Liu 27 April 2019 Lingyun Liu Stat4Onc 27 April 2019 1 / 28 Outline 1 Introduction 2 Group Sequential (GS) Approach 3 P-value Combination Approach 4 Group Sequential Approach vs P-value Combination Approach 5

567 views • 32 slides