By Stian Berg Supervisor Ole-Christoffer Granmo, University of - PowerPoint PPT Presentation

Aug 05, 2023 •169 likes •240 views

Solving Dynamic Bandit Problems and Decentralized Games using the Kalman Bayesian Learning Automaton By Stian Berg Supervisor Ole-Christoffer Granmo, University of Agder Introduction Thesis topic: Evaluation of a novel approach to dynamic

Solving Dynamic Bandit Problems and Decentralized Games using the Kalman Bayesian Learning Automaton By Stian Berg Supervisor Ole-Christoffer Granmo, University of Agder
Introduction • Thesis topic: Evaluation of a novel approach to dynamic bandit problems • Bandit problem example: Link relevance 2
Stationary bandit problem 3
Dynamic bandit problem 4
The Kalman Bayesian Learning Automaton (KBLA) • Kalman filtering • Position tracking • Robot navigation • Electronic equipment • Stock estimation • Forecasting • Computer vision • KBLA • Kalman filtering adapted to work in a bandit setting 5
Summary of results • Among the top performers in all experiments • Scaled rather well with the number of options • Could handle various types of feedback However.... • May need significant tuning for good performance 6
Conclusion • Empirical evaluation of the KBLA • Performance • Scalability • Robustness • Overall we believe this is a very promising approach • Further work • Parameter problem • Combining ideas from other bandit algorithms 7

Recommend

Game programming in Haskell Alexander Berntsen Stian A. Ellingsen September 18, 2013 Alexander

Outline Our project Welcome to the functional paradigm of Haskell Game programming in Haskell Alexander Berntsen Stian A. Ellingsen September 18, 2013 Alexander Berntsen, Stian A. Ellingsen Game programming in Haskell Outline Our project

874 views • 44 slides

C2 language Bas van den Berg Fosdem 2015, Brussels Bas van den Berg C2 language Goal Goal of

C2 language Bas van den Berg Fosdem 2015, Brussels Bas van den Berg C2 language Goal Goal of this presentation: show the C2 language show how you can re-use LLVM/Clang components get feedback/ideas Bas van den Berg C2 language

686 views • 40 slides

The he Role ole of of Strengt ength The he Role ole of of Strengt ength Exer xercis

The he Role ole of of Strengt ength The he Role ole of of Strengt ength Exer xercis cise e in in Weight Weight Exer xercis cise e in in Weight Weight Los Loss and and Wellnes Wellness Los Loss and and Wellnes Wellness

994 views • 78 slides

OLE extension from OT extension Manoj Prabhakaran joint work with Guru Vamsi

OLE extension from OT extension Manoj Prabhakaran joint work with Guru Vamsi Policharla Rajeev Raghunath Parjanya Vyas IIT Bombay New Results for OLE over GF ( 2 n ) Random OLE over GF ( 2 n ) : Alice gets ( a, t )

248 views • 13 slides

Object-Oriented Programming for Scientific Computing Dynamic Polymorphism Ole Klein

Object-Oriented Programming for Scientific Computing Dynamic Polymorphism Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 19. Mai 2015 Ole Klein (IWR) Object-Oriented

642 views • 32 slides

Object-Oriented Programming for Scientific Computing Formalia, Introduction and Quick Recap Ole

Object-Oriented Programming for Scientific Computing Formalia, Introduction and Quick Recap Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 14. April 2015 Ole Klein (IWR)

958 views • 63 slides

Object-Oriented Programming for Scientific Computing Namespaces and Inheritance Ole Klein

Object-Oriented Programming for Scientific Computing Namespaces and Inheritance Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 5. Mai 2015 Ole Klein (IWR) Object-Oriented

514 views • 29 slides

Object-Oriented Programming for Scientific Computing Traits and Policies Ole Klein

Object-Oriented Programming for Scientific Computing Traits and Policies Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 23. Juni 2015 Ole Klein (IWR) Object-Oriented

488 views • 46 slides

Object-Oriented Programming for Scientific Computing Error Handling and Exceptions Ole Klein

Object-Oriented Programming for Scientific Computing Error Handling and Exceptions Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 12. Mai 2015 Ole Klein (IWR)

613 views • 37 slides

Object-Oriented Programming for Scientific Computing Dynamic Memory Management Ole Klein

Object-Oriented Programming for Scientific Computing Dynamic Memory Management Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 21. April 2015 Ole Klein (IWR) Object-Oriented

1.15k views • 35 slides

Object-Oriented Programming for Scientific Computing Templates and Static Polymorphism Ole Klein

Object-Oriented Programming for Scientific Computing Templates and Static Polymorphism Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 2. Juni 2015 Ole Klein (IWR)

867 views • 48 slides

IETF Trust Report Ole Jacobsen IETF Trust Chair (for a few more minutes) IETF 89 London March

IETF Trust Report Ole Jacobsen IETF Trust Chair (for a few more minutes) IETF 89 London March 2014 Musical Leadership Chairs l Ole followed Marshall as Trust Chair l Chris followed Ole as Trust Chair l Bob became ISOC BoT Chair l

241 views • 9 slides

Object-Oriented Programming for Scientific Computing Template Metaprogramming Ole Klein

Object-Oriented Programming for Scientific Computing Template Metaprogramming Ole Klein Interdisciplinary Center for Scientific Computing Heidelberg University ole.klein@iwr.uni-heidelberg.de 30. Juni 2015 Ole Klein (IWR) Object-Oriented

736 views • 51 slides

Fail-Safe Strategies for FPGA Devices Targeted for Critical Applications Melanie Berg, AS&D

Fail-Safe Strategies for FPGA Devices Targeted for Critical Applications Melanie Berg, AS&D in support of NASA/GSFC Melanie.D.Berg@NASA.gov Kenneth LaBel, NASA/GSFC Jonathan Pellish, NASA/GSFC Presented by Melanie Berg at the Single Event

917 views • 67 slides

Reliable Design Versus Trust Melanie Berg AS&D in support of NASA/GSFC

Unclassified Reliable Design Versus Trust Melanie Berg AS&D in support of NASA/GSFC Melanie.D.Berg@NASA.gov Kenneth A. LaBel ken.label@nasa.gov 1 Presented by Melanie Berg at the Field Programmable Gate Array Symposium, Chantilly, VA,

465 views • 22 slides

Challenges Regarding IP Core Functional Reliability. Melanie Berg 1 , Kenneth LaBel 2 1.AS&D

Challenges Regarding IP Core Functional Reliability. Melanie Berg 1 , Kenneth LaBel 2 1.AS&D in support of NASA/GSFC Melanie.D.Berg@NASA.gov 2. NASA/GSFC Kenneth.A.LaBel@NASA.gov 1 To be presented by Melanie Berg at the Microelectronics

409 views • 20 slides

City of Somerville Zoning Amendment Union Square Zoning Amendment Meeting #11 4-12-17

City of Somerville Zoning Amendment Union Square Zoning Amendment Meeting #11 4-12-17 Todays Agenda 1. Website Review 2. USQ Covenant Review of similar projects / benefits Some perspective on Exhibit C 3. USQ Zoning Open

917 views • 69 slides

Lessons from Discrete Mathematics Kirsten Nelson Carleton University October 14, 2017 Contact:

Lessons from Discrete Mathematics Kirsten Nelson Carleton University October 14, 2017 Contact: kirsten.nelson@carleton.ca Kirsten Nelson (Carleton University) Lessons from Discrete Mathematics October 14, 2017 1 / 63 Introduction 1

932 views • 75 slides

Community Development Block Grant FY2020 RFP Application Workshop City of New Bedford Office

Community Development Block Grant FY2020 RFP Application Workshop City of New Bedford Office of Housing & Community Development 1/17/2020 1 AGENDA Understanding the Request for Proposal (RFP) Application CDBG Program Basics

952 views • 62 slides

COMPLETE OR BALANCED? Providing variable treatments will not make a street incomplete! Plan for

COMPLETE OR BALANCED? Providing variable treatments will not make a street incomplete! Plan for all uses, but balance the solutions Share the road, it is a public rights-of-way Many issues, but can only afford a few solutions Demand

1.15k views • 96 slides

Bandit-based Search for Constraint Programming Manuel Loth 1 , 2 , 4 , Mich` ele Sebag 2 , 4 , 1 ,

Bandit-based Search for Constraint Programming Manuel Loth 1 , 2 , 4 , Mich` ele Sebag 2 , 4 , 1 , Youssef Hamadi 3 , 1 , Marc Schoenauer 4 , 2 , 1 , Christian Schulte 5 1 Microsoft-INRIA joint centre 2 LRI, Univ. Paris-Sud and CNRS 3 Microsoft

402 views • 37 slides

Internship Defense David Taralla University of Lige Thursday 19 December 2013 Contents

Internship Defense David Taralla University of Lige Thursday 19 December 2013 Contents Introduction Context Basic idea From the idea to the theoretical implementation Conclusion Internship Defense David Taralla University of Lige

647 views • 27 slides

Data Science methods for treatment personalization in Persuasive Technology Prof. dr. M.C.Kaptein

Data Science methods for treatment personalization in Persuasive Technology Prof. dr. M.C.Kaptein Professor Data Science & Health Principal Investigator @ JADS 12 April 2019 Undergraduate Topics in Computer Science Maurits Kaptein Edwin

792 views • 32 slides

Lecture #1: Introduction to CS109A aka STAT121A, AC209A, CSCIE-109A CS109A Introduction to Data

Lecture #1: Introduction to CS109A aka STAT121A, AC209A, CSCIE-109A CS109A Introduction to Data Science Pavlos Protopapas, Kevin Rader and Chris Tanner 1 Lecture Outline Why data science? Why taking CS109A? What is data science?

863 views • 67 slides