Understanding the impact of entropy on policy optimization Zafarali - PowerPoint PPT Presentation

Feb 05, 2023 •128 likes •205 views

Understanding the impact of entropy on policy optimization Zafarali Ahmed, Nicolas Le Roux, Mohammad Norouzi, Dale Schuurmans bit.ly/2HQvGoQ zafarali.ahmed@mail.mcgill.ca Why should we understand policy optimization? What is policy

Understanding the impact of entropy on policy optimization Zafarali Ahmed, Nicolas Le Roux, Mohammad Norouzi, Dale Schuurmans bit.ly/2HQvGoQ zafarali.ahmed@mail.mcgill.ca
Why should we understand policy optimization? What is policy optimization? Find parameterized policy that maximizes rewards. (1) Collect data + calculate objective (2) Take gradient + update policy parameters Why is it difficult? Bad gradient estimates? Difficult geometry? Poor conditioning? Not enough “Exploration”?
Contribution 1: How do we study high dim objective functions? STEP 2: How does objective change along random perturbations? STEP 1: Collect random perturbations of (+, +) objective (+, -) (-, -) θ 0
Contribution 1: How do we study high dim objective functions? Examples @ A Local Optimum @ A Saddle Point
Contribution 2: Why does entropy regularization help? Experiments on exact grid worlds and Mujoco 4.5 Conclusion: Even the absence of gradient estimation error, policy entropy helps by smoothing the objective function:
Understanding the impact of entropy on policy optimization Read the paper! bit.ly/2HQvGoQ Come see poster! Poster # 29 TODAY - 6.30 PM Pacific Ballroom Chat with me! zafarali.ahmed@mail.mcgill.ca

Recommend

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a measure of the uncertainty of

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a measure of the uncertainty of a discrete random variable. Properties: H(x) >= 0 Entropy Entropy Lesser the probability for an event, larger the entropy.

474 views • 21 slides

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1

Entropy Entropy Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1 Entropy Entropy and Information Joint Entropy Frank Keller Conditional Entropy School of Informatics University of Edinburgh

81 views • 4 slides

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

Outline Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the Technion) Huffman coding Arithmetic coding Lempel-Ziv coding 2 Entropy Definitions Alphabet : A finite set containing at least

403 views • 10 slides

1) Entropy = measure of randomness 2) Entropy = measure of compressibility More random = Less

Introduction to Information Retrieval Entropy: a basic introduction 1) Entropy = measure of randomness 2) Entropy = measure of compressibility More random = Less compressible High entropy = high randomness/low compressibility Low entropy =

236 views • 19 slides

Chapter 2 Entropy, Relative Entropy, and Mutual Infor- mation Peng-Hua Wang Graduate Institute

Chapter 2 Entropy, Relative Entropy, and Mutual Infor- mation Peng-Hua Wang Graduate Institute of Communication Engineering National Taipei University Chapter Outline Chap. 2 Entropy, Relative Entropy, and Mutual Information 2.1 Entropy 2.2

753 views • 51 slides

Road detection via entropy By Anna Zaidman 1 1 What is entropy? Entropy is a mathematically

Road detection via entropy By Anna Zaidman 1 1 What is entropy? Entropy is a mathematically - defined thermodynamic quantity that helps to account for the flow of energy through a thermodynamic process. 2 What is

467 views • 13 slides

Entropy Change in Entropy Reversible Isobaric Process Ideal Gas in a Reversible Process Free

Entropy Change in Entropy Reversible Isobaric Process Ideal Gas in a Reversible Process Free Expansion of an Ideal Gas Microscopic Interpretation of Entropy Entropy and the Second Law of Thermodynamics

466 views • 11 slides

Entropy and The Second Law of Thermodynamics Entropy (S)

Entropy and The Second Law of Thermodynamics Entropy (S) Entropy is o8en related to disorder but not strictly correct Entropy is a measure

353 views • 8 slides

Orc David Schleef Entropy Wave Inc (c) 2009 Entropy Wave Inc What is Orc A system for

Orc David Schleef Entropy Wave Inc (c) 2009 Entropy Wave Inc What is Orc A system for describing low-level computation on modern CPUs (c) 2009 Entropy Wave Inc Motivation (c) 2009 Entropy Wave Inc Motivation Want maintainable assembly

611 views • 28 slides

Topological entropy and algebraic entropy on locally compact abelian groups - The Bridge Theorem

Topological entropy and algebraic entropy on locally compact abelian groups - The Bridge Theorem - Topological entropy and algebraic entropy on locally compact abelian groups - The Bridge Theorem - Anna Giordano Bruno - University of Udine

212 views • 17 slides

Probabilistic Models of Human Sentence Experiment 1: Entropy and Sentence Length 2 Processing

From Sentence to Text From Sentence to Text Experiment 1: Entropy and Sentence Length Experiment 1: Entropy and Sentence Length Experiment 2: Entropy in Context Experiment 2: Entropy in Context Experiment 3: Entropy out of Context Experiment

231 views • 7 slides

Model Uncertainty and Robustness: Entropy Coherent and Entropy Convex Measures of Risk Roger J.

Model Uncertainty and Robustness: Entropy Coherent and Entropy Convex Measures of Risk Roger J. A. Laeven Dept. of Econometrics and OR Tilburg University, CentER and Eurandom based on joint work with Mitja Stadje August 30, 2011 Entropy

442 views • 40 slides

ARE THE SHANNON ENTROPY AND RESIDUAL ENTROPY SYNONYMS? H = R 0 ? MARKO POPOVIC DEPARTMENT OF

ARE THE SHANNON ENTROPY AND RESIDUAL ENTROPY SYNONYMS? H = R 0 ? MARKO POPOVIC DEPARTMENT OF CHEMISTRY AND BIOCHEMISTRY, BYU, PROVO, UT 84602, POPOVIC.PASA@GMAIL.COM Thermodynamic entropy - S (J/K) At T=0K ideal crystal imperfect

560 views • 21 slides

4200:225 Equilibrium Thermodynamics Unit I. Earth, Air, Fire, and Water Chapter 3. The Entropy

4200:225 Equilibrium Thermodynamics Unit I. Earth, Air, Fire, and Water Chapter 3. The Entropy Balance By J.R. Elliott, Jr. Unit I. Energy and Entropy Chapter 3. The Entropy Balance Introduction to Entropy Microscopic definition: dS = k

356 views • 20 slides

Entropy and gravitational interaction c 1 Milutin Blagojevi c i Branislav Cvetkovi 1 Institut

Introduction Black hole entropy in MB model Black hole entropy in PGT Entropy of conformally flat black holes Conclusion Entropy and gravitational interaction c 1 Milutin Blagojevi c i Branislav Cvetkovi 1 Institut za fiziku, Beograd

331 views • 31 slides

Zero entropy systems Dominique Perrin May 12, 2016 Dominique Perrin Zero entropy systems May

Zero entropy systems Dominique Perrin May 12, 2016 Dominique Perrin Zero entropy systems May 12, 2016 1 / 22 Introduction Subject: symbolic systems of zero entropy, focusing on systems of linear complexity. How can we describe them? The

456 views • 22 slides

On W. Thurstons core-entropy theory Bill in Jackfest, Feb. 2011 It started out... in 1975 or

On W. Thurstons core-entropy theory Bill in Jackfest, Feb. 2011 It started out... in 1975 or so. My first job after graduate school was, I was the assistant of Milnor... At one point, Id gotten a programmable HP desktop calculator and a

805 views • 47 slides

Entropy Let X be a discrete random variable The surprise of observing X = x is defined as

Entropy Let X be a discrete random variable The surprise of observing X = x is defined as log 2 P(X=x) Surprise of probability 1 is zero. Surprise of probability 0 is (c) 2003 Thomas G. Dietterich 1 Expected Surprise

267 views • 13 slides

Guessing Cryptographic Secrets and Oblivious Distributed Guessing Serdar Bozta s School of

Introduction Guessing, Predictability and Entropy Conclusions Guessing Cryptographic Secrets and Oblivious Distributed Guessing Serdar Bozta s School of Mathematical and Geospatial Sciences RMIT University August 2014 Monash University

809 views • 59 slides

Outline Definition of Information First part based very loosely on [Abramson 63]. (After

Outline Definition of Information First part based very loosely on [Abramson 63]. (After [Abramson 63]) Information theory usually formulated in terms of information Let E be some event which occurs with probability channels and coding

308 views • 6 slides

Is the entropy a good measure of correlation? Anita Dobek, Krzysztof Moliski, Ewa Skotarczak

Is the entropy a good measure of correlation? Anita Dobek, Krzysztof Moliski, Ewa Skotarczak Pozna Univeristy of Life Sciences Wojska Polskiego 28, 60-637 Pozna Bdlewo, 2016 Dobek, Moliski, Skotarczak Is the entropy a good measure

418 views • 19 slides

Graph Entropy Measures in Publication Network Data Andreas Holzinger Bernhard Ofner Christof

Graph Entropy Measures in Publication Network Data Andreas Holzinger Bernhard Ofner Christof Stocker Andr Calero Valdez Anne Kathrin Schaar Martina Ziefle Matthias Dehmer page 1 Agenda 1 Overview CSP1 - Interdisciplinary Innovation

661 views • 17 slides

Entropy production and fluctuation phenomena in nonequilibrium systems Haye Hinrichsen Faculty

Entropy production and fluctuation phenomena in nonequilibrium systems Haye Hinrichsen Faculty for Physics and Astronomy University of Wrzburg, Germany Workshop on Large Fluctuations in Non-Equilibrium Systems MPIPKS Dresden, July 2011 I n

917 views • 75 slides

Chapter 10: Phenomena Phenomena: Below is data from several different chemical reactions. All

Chapter 10: Phenomena Phenomena: Below is data from several different chemical reactions. All reaction were started by putting some of every substance in the chemical reaction into an expandable/contractable container at 25 C . If

271 views • 24 slides