Better Transfer Learning with Inferred Successor Maps Tamas Madarasz - PowerPoint PPT Presentation

May 23, 2023 •104 likes •469 views

Better Transfer Learning with Inferred Successor Maps Tamas Madarasz 1,2 , Tim Behrens 1,2 arXiv:1906.07663 Spotlight NeurIPS 2019 1: University of Oxford 2: UCL The successor representation (SR) Dayan, 1993 Neural Computation The successor

Better Transfer Learning with Inferred Successor Maps Tamas Madarasz 1,2 , Tim Behrens 1,2 arXiv:1906.07663 Spotlight NeurIPS 2019 1: University of Oxford 2: UCL
The successor representation (SR) Dayan, 1993 Neural Computation
The successor representation (SR) Dayan, 1993 Neural Computation reward function
Main approach • Cluster tasks and try to map current task to the cluster such that SR is easiest to adapt • Use the SR’s flexibility to approximate the optimal value function Wilson et al. 2007, ICML Lazaric and Ghamazadev 2010 , ICML Finn et al. 2017, ICML
Generative model over reward functions
Generative model over reward functions Dirichlet Process mixture model of kernel- smoothed rewards
Generative model over reward functions Dirichlet Process mixture model of kernel- smoothed rewards
Generative model over reward functions Dirichlet Process mixture model of kernel- smoothed rewards
Bayesian Successor Representation (BSR) M: Successor Representation CR: Convolved reward map
Bayesian Successor Representation (BSR)
Bayesian Successor Representation (BSR)
Bayesian Successor Representation (BSR)
Bayesian Successor Representation (BSR)
Bayesian Successor Representation (BSR)
Results Barreto et al. 2017 NeurIPS
Multi-task exploration bonus by offsetting the reward belief vector w w UCB inspired constant offset Offset using CR maps, acting as w priors for rewards Auer 2002 JMLR
Results
Results
Results Hippocampus Blum and Abbot 1996 Levy et al. 2005 Stachenfeld et al. 2017 Boccara et al. 2019 Science Jezek et al. 2019 Nature Grieves et al. 2016 Elife
Thank you! arXiv:1906.07663 Transfer and Multi-task learning Poster#52 10:45 AM - 12:45 PM

Recommend

Making maps pretty Andrea Aime Jim Groffen Making Maps Pretty Making Maps Pretty 1 1 Making

Making maps pretty Andrea Aime Jim Groffen Making Maps Pretty Making Maps Pretty 1 1 Making maps pretty Introduction Making Maps Pretty Making Maps Pretty 2 2 Introducing carthography Depiciting shape and location, conveing

589 views • 48 slides

An enumerative relationship between maps and 4-regular maps Michael La Croix April 9, 2008 An

An enumerative relationship between maps and 4-regular maps Michael La Croix April 9, 2008 An enumerative relationship between maps and 4-regular maps Outline 1 Background Surfaces Maps Rooted Maps 2 Map Enumeration A Counting Problem A

1.28k views • 126 slides

ROCKBOX FABRIQ EDITION ITS TIME FOR FOR BETTER SOUND. BETTER DESIGN. BETTER SPECS.

ROCKBOX FABRIQ EDITION ITS TIME FOR FOR BETTER SOUND. BETTER DESIGN. BETTER SPECS. F A B ROCKBOX R I Q E D I T I O N A A NEW NEW GENERA GENERATION TION BETTER SOUND. BETTER DESIGN. BETTER SPECS. BETTER BETTER

874 views • 84 slides

APPENDICES appendix 1. Systems maps appendix 1. Systems maps appendix 1. Systems maps appendix

APPENDICES appendix 1. Systems maps appendix 1. Systems maps appendix 1. Systems maps appendix 1. Systems maps appendix 2. Affinity diagram Insights (next few slides)

322 views • 30 slides

CS371m - Mobile Computing Maps Using Google Maps This lecture focuses on using Google Maps

CS371m - Mobile Computing Maps Using Google Maps This lecture focuses on using Google Maps inside an Android app Alternatives Exist: Open Street Maps http://www.openstreetmap.org/ If you simply want to display a "standard

1.15k views • 79 slides

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer

Industrial Transfer Learning Introduction to Industrial Transfer Learning Industrial Transfer Learning Motivation Machine Learning in Manufacturing Decision Support Automation Process Control Self-Optimization Predictive Quality Predictive

742 views • 24 slides

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We

Radiative Transfer Radiative Transfer Radiative transfer is a branch of atmospheric physics. We consider this topic under the following headings: The Spectrum of Radiation Radiative Transfer Radiative transfer is a branch of atmospheric

904 views • 58 slides

Can Facial Uniqueness be Inferred from Impostor Scores? Abhishek Dutta Oct 14, 2013, Nijmegen,

Can Facial Uniqueness be Inferred from Impostor Scores? Abhishek Dutta Oct 14, 2013, Nijmegen, Netherlands. Presentation accompanying the following paper: A. Dutta, R. Veldhuis and L. Spreeuwers. Can Facial Uniqueness be Inferred from Impostor

145 views • 10 slides

Better Advice, Better Lives Adults Select Committee 21 st June Usk 1 Better Advice, Better Lives

Better Advice, Better Lives Monmouthshire County Citizens Advice Better Advice, Better Lives Adults Select Committee 21 st June Usk 1 Better Advice, Better Lives Introduction As you are no doubt aware, in 2012, the Government announced plans

84 views • 4 slides

ERISA Successor and Affiliate Liability in Asset Sales and Distressed Benefit Plans Mitigating

Presenting a live 90-minute webinar with interactive Q&A ERISA Successor and Affiliate Liability in Asset Sales and Distressed Benefit Plans Mitigating Controlled Group and Successor Liability for Affiliated Companies, M&As and Corporate

1.14k views • 76 slides

Successor Agency Asset Update COUNTYWIDE OVERSIGHT BOARD / JULY 30, 2019 Summary Of the 25

Successor Agency Asset Update COUNTYWIDE OVERSIGHT BOARD / JULY 30, 2019 Summary Of the 25 Successor Agencies in Orange County: 5 have completed last and final ROPS (Brea, Lake Forest, San Clemente, Tustin, Yorba Linda) 4 reported they

1.09k views • 11 slides

The tree property at the double successor of Ajdin Halilovi c a measurable cardinal

The tree property at the double successor of a measurable cardinal with 2 large The tree property at the double successor of Ajdin Halilovi c a measurable cardinal with 2 large (joint work with Sy Friedman)

968 views • 94 slides

The theory of successor extended by several predicates S everine Fratani LaBRI , Universit

The theory of successor extended by several predicates S everine Fratani LaBRI , Universit e Bordeaux 1. LIAFA , Universit e Paris 7. URL:http://dept-info.labri.u-bordeaux.fr/ fratani T HE THEORY OF SUCCESSOR EXTENDED BY SEVERAL

766 views • 41 slides

Intel P6 Intel P6 15-213 Internal Designation for Successor to Pentium Internal Designation for

Intel P6 Intel P6 15-213 Internal Designation for Successor to Pentium Internal Designation for Successor to Pentium The course that gives CMU its Zip! n Which had internal designation P5 Fundamentally Different from Pentium

236 views • 8 slides

Crowdsourcing 3D Semantic Maps for Vehicle Cognition Cognition for Cars Decisions Eyes

Crowdsourcing 3D Semantic Maps for Vehicle Cognition Cognition for Cars Decisions Eyes Cognition Civil Maps Cloud Bounding boxes Civil Maps In Car Vehicle Cognition Vehicle Cognition through 3D Maps Vehicle Cognition through 3D Maps

568 views • 28 slides

Dynamical systems Expanding maps on the circle Jana Rodriguez Hertz ICTP 2018 lifts and degree

lifts and degree linear expanding maps expanding maps on the circle topologically mixing Dynamical systems Expanding maps on the circle Jana Rodriguez Hertz ICTP 2018 lifts and degree linear expanding maps expanding maps on the circle

292 views • 27 slides

Care of the Patient with Posttraumatic Stress Disorder Thomas C. Neylan, M.D. Director, PTSD

Care of the Patient with Posttraumatic Stress Disorder Thomas C. Neylan, M.D. Director, PTSD Clinical and Research Programs University of California, San Francisco San Francisco VAMC Epidemiology of PTSD National Comorbidity Study 7.8%

376 views • 22 slides

Hippocampal-prefrontal plasticity seems to reverberate in a thalamic-prefrontal loop: what else

Hippocampal-prefrontal plasticity seems to reverberate in a thalamic-prefrontal loop: what else neuromathematics could tell us? Lzio S. Bueno-Jnior Joo P. Leite Medical School of Ribeiro Preto 1/5 First, the structure: a particular

487 views • 23 slides

spatial cognition its all relative COGS 1 Oct. 13, 2009 Einstein and Picasso knew a

spatial cognition its all relative COGS 1 Oct. 13, 2009 Einstein and Picasso knew a thing or two about relativity similarity in features of navigational strategies across mammalian species similarity in detailed structure of brain

421 views • 21 slides

Brain-like replay for continual learning with artificial neural networks Gido M van de Ven, Hava

Brain-like replay for continual learning with artificial neural networks Gido M van de Ven, Hava T Siegelmann, Andreas S Tolias Bridging AI and Cognitive Science workshop (ICLR 2020) Catastrophic forgetting in neural networks When a

452 views • 16 slides

Unsupervised Subgoal Discovery Method for Learning Hierarchical Representations Jacob Rafati

Unsupervised Subgoal Discovery Method for Learning Hierarchical Representations Jacob Rafati Ph.D., Electrical Engineering and Computer Science (EECS) Computational Cognitive Neuroscience Laboratory (CCNL) http://rafati.net Co-authored with David

515 views • 15 slides

Percentage of Past Month Marijuana Users Aged 12 or Older: Annual Averages, 2002-2003 Percentage

Marijuana : Science in the Context of a Shifting Social and Legal Environment Wilson M. Compton, M.D., M.P.E. Deputy Director National Institute on Drug Abuse Marijuana is the Most Commonly Used Illicit Drug In the U.S. Over 115 million

772 views • 58 slides

Buddhas Brain: The Practical Neuroscience of Happiness, Love, and Wisdom PESI Seminars, 2013

Buddhas Brain: The Practical Neuroscience of Happiness, Love, and Wisdom PESI Seminars, 2013 Rick Hanson, Ph.D. The Wellspring Institute for Neuroscience and Contemplative Wisdom www.WiseBrain.org www.RickHanson.net

1.05k views • 77 slides

Co-funded by the European Union Fenix User Forum meeting Parallel Session 1 Co-funded by the

Co-funded by the European Union Fenix User Forum meeting Parallel Session 1 Co-funded by the European Union Fenix User Forum meeting outline Welcome and introduction Anne Nahm, JUELICH ICEI services overview and update

488 views • 30 slides