PAC Identification of Many Good Arms in Stochastic Multi-Armed - PowerPoint PPT Presentation

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits Arghya Roy Chaudhuri under the guidance of Prof. Shivaram Kalyanakrishnan Indian Institute of Technology Bombay, India 1 / 8

What Is It All About? 2 / 8

What Is It All About? 3 / 8

What Is a Multi-Armed Bandit? 1.0 0.9 0.5 0.5 0.2 0.0 Mean (Unknown) Bandits: Slot machines Mean: Pr[Reward = 1] 4 / 8

What Is a Multi-Armed Bandit? To identify the best arm: � n 1.0 ǫ 2 log 1 � E [SC] = Ω 0.9 δ To identify the best subset of size 0.5 0.5 m : � n ǫ 2 log m � E [SC] = Ω 0.2 δ 0.0 Mean (Unknown) Bandits: Slot machines Mean: Pr[Reward = 1] 4 / 8

What Is a Multi-Armed Bandit? To identify the best arm: � n 1.0 ǫ 2 log 1 � E [SC] = Ω 0.9 δ To identify the best subset of size 0.5 0.5 m : � n ǫ 2 log m � E [SC] = Ω 0.2 δ 0.0 Mean (Unknown) We need an alternative. Bandits: Slot machines Mean: Pr[Reward = 1] 4 / 8

Large Bandit Instances Difficulty for n ≫ T : ǫ 2 log 1 lim n →∞ n δ = ∞ . 5 / 8

Large Bandit Instances Difficulty for n ≫ T : ǫ 2 log 1 lim n →∞ n δ = ∞ . Get around: Identifying 1 from the best ρ -fraction is possible. 5 / 8

Large Bandit Instances Difficulty for n ≫ T : ǫ 2 log 1 lim n →∞ n δ = ∞ . Get around: Identifying 1 from the best ρ -fraction is possible. Redefine the problem to identify 1 from the best m arms. Defining ρ = m n , generalise the problem. What if we n is relatively small? 5 / 8

Finite-Armed Bandit Instances ( k , m , n ): To identify any distinct k arms from the best m arms in a set of n arms. 6 / 8

Finite-Armed Bandit Instances ( k , m , n ): To identify any distinct k arms from the best m arms in a set of n arms. k = 1 : Any 1 arm out of the best subset of size m . 6 / 8

Finite-Armed Bandit Instances ( k , m , n ): To identify any distinct k arms from the best m arms in a set of n arms. k = m : Best subset identification. 6 / 8

Finite-Armed Bandit Instances ( k , m , n ): To identify any distinct k arms from the best m arms in a set of n arms. k = m = 1 : Best arm identification. 6 / 8

Finite-Armed Bandit Instances ( k , m , n ): To identify any distinct k arms from the best m arms in a set of n arms. k = 1 : Any 1 arm out of the best subset of size m . k = m : Best subset identification. k = m = 1 : Best arm identification. Contributions: LUCB -k-m (Fully sequential + Adaptive). Worst case upper and lower bound. 6 / 8

Infinite-Armed Bandit Instances ( k , ρ ): To identify any distinct k arms from the best ρ fraction of arms. 7 / 8

Thank You! Poster: #54 Email: arghya@cse.iitb.ac.in 8 / 8

PAC Identification of Many Good Arms in Stochastic Multi-Armed - PowerPoint PPT Presentation

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits Arghya Roy Chaudhuri under the guidance of Prof. Shivaram Kalyanakrishnan Indian Institute of Technology Bombay, India 1 / 8 What Is It All About? 2 / 8 What Is It All

Sergeant at Arms Training Session www.toastmasters.org Sergeant at Arms TRAINING DISTRICT 37

Guiding Financial Controls and Practices for PACs and PAC Treasurers PAC Treasurers Workshop

NAPSLO PAC Contributions How contributing to the NAPSLO PAC will benefit you, your company and the

WELCOME June 2011 PAC Presentation Opening Remarks Introductions June 2011 PAC

AAOS Orthopaedic PAC The Orthopaedic PAC is the only national political action committee

LArIAT Fermilab PAC Meeting November 11, 2016 Jen Raaf PAC Charge Fermilab PAC Meeting, J.

Small Arms Survey 2012 New York, 1-5 June 2015 Small arms, new technologies, and MGE2 Moving

HERITAGE SQUARE CONSIDERATIONS Public Process Project Advisory Committee Meetings: PAC Meeting

Interferometric Sensor (MAGIS-100) PAC Meeting Jason Hogan on behalf of the MAGIS

Architecture Aromatique Good Taste Good Food Good Health Based on sustainability Technical

The PAC Learning Framework Guoqing Zheng January 20, 2015 Guoqing Zheng The PAC Learning

Toward Efficient Many-to-Many Broadcast in Dynamic Wireless Networks Fabian Mager , Carsten

Tie development of MEDIEVAL ARMOR over time WORCESTER ART MUSEUM ARMS & ARMOR PRESENTATION

EVENTS AND CELEBRATIONS THE LYTTELTON ARMS WWW.THELYTTELTONARMS.CO.UK WELCOME TO THE LYTTELTON

PROSPECTS FOR A STRONG ARMS TRADE TREATY Ben Donaldson 27.10.2012 Why am I talking about the Arms

The U.S.- -Russian Nuclear Arms Reduction Russian Nuclear Arms Reduction The U.S. Dialog:

M 3 : INTEGRATING ARBITRARY COMPUTE UNITS AS FIRST-CLASS CITIZENS OS: Nils Asmussen, Hermann H

ARM memory generator Arm Memory generator Make sure you create a folder similar to what you

ARM EDITION Matt Spisak REcon 2016, Montreal RECON 2016 ABOUT Offense-based approach to

Probabilis)c Reasoning for Assembly-Based 3D Modeling

NEVE: Nested Virtualization Extensions for ARM Jin Tack Lim, Christo ff er Dall, Shih-Wei Li, Jason

Extending the swsusp Hibernation Framework to ARM Russell Dill 1 2 Introduction Russ Dill

HIGHLEVELMANIPULATION PRIMITIVESFORAROBOTARM Supported by National

ARM Exception Handling CS2253 Owen Kaser, UNBSJ Overview Warning: hardest parts of CS2253.

Sambuz

Useful Links

Newsletter

Mail Us

PAC Identification of Many Good Arms in Stochastic Multi-Armed - PowerPoint PPT Presentation

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits Arghya Roy Chaudhuri under the guidance of Prof. Shivaram Kalyanakrishnan Indian Institute of Technology Bombay, India 1 / 8 What Is It All About? 2 / 8 What Is It All

Sergeant at Arms Training Session www.toastmasters.org Sergeant at Arms TRAINING DISTRICT 37

Guiding Financial Controls and Practices for PACs and PAC Treasurers PAC Treasurers Workshop

NAPSLO PAC Contributions How contributing to the NAPSLO PAC will benefit you, your company and the

WELCOME June 2011 PAC Presentation Opening Remarks Introductions June 2011 PAC

AAOS Orthopaedic PAC The Orthopaedic PAC is the only national political action committee

LArIAT Fermilab PAC Meeting November 11, 2016 Jen Raaf PAC Charge Fermilab PAC Meeting, J.

Small Arms Survey 2012 New York, 1-5 June 2015 Small arms, new technologies, and MGE2 Moving

HERITAGE SQUARE CONSIDERATIONS Public Process Project Advisory Committee Meetings: PAC Meeting

Interferometric Sensor (MAGIS-100) PAC Meeting Jason Hogan on behalf of the MAGIS

Architecture Aromatique Good Taste Good Food Good Health Based on sustainability Technical

The PAC Learning Framework Guoqing Zheng January 20, 2015 Guoqing Zheng The PAC Learning

Toward Efficient Many-to-Many Broadcast in Dynamic Wireless Networks Fabian Mager , Carsten

Tie development of MEDIEVAL ARMOR over time WORCESTER ART MUSEUM ARMS &amp; ARMOR PRESENTATION

EVENTS AND CELEBRATIONS THE LYTTELTON ARMS WWW.THELYTTELTONARMS.CO.UK WELCOME TO THE LYTTELTON

PROSPECTS FOR A STRONG ARMS TRADE TREATY Ben Donaldson 27.10.2012 Why am I talking about the Arms

The U.S.- -Russian Nuclear Arms Reduction Russian Nuclear Arms Reduction The U.S. Dialog:

M 3 : INTEGRATING ARBITRARY COMPUTE UNITS AS FIRST-CLASS CITIZENS OS: Nils Asmussen, Hermann H

ARM memory generator Arm Memory generator Make sure you create a folder similar to what you

ARM EDITION Matt Spisak REcon 2016, Montreal RECON 2016 ABOUT Offense-based approach to

Probabilis)c Reasoning for Assembly-Based 3D Modeling

NEVE: Nested Virtualization Extensions for ARM Jin Tack Lim, Christo ff er Dall, Shih-Wei Li, Jason

Extending the swsusp Hibernation Framework to ARM Russell Dill 1 2 Introduction Russ Dill

HIGHLEVELMANIPULATION PRIMITIVESFORAROBOTARM Supported by National

ARM Exception Handling CS2253 Owen Kaser, UNBSJ Overview Warning: hardest parts of CS2253.

Sambuz

Useful Links

Newsletter

Mail Us

Tie development of MEDIEVAL ARMOR over time WORCESTER ART MUSEUM ARMS & ARMOR PRESENTATION