Surprising Negative Results for Generative Adversarial Tree Search - PowerPoint PPT Presentation

Oct 07, 2023 •202 likes •317 views

Surprising Negative Results for Generative Adversarial Tree Search Kamyar Azizzadenesheli 1,2,5 , Brandon Yang 2 , Weitang Liu 3 , Emma Brunskill 2 , Zachary C Lipton 4 , Animashree Anandkumar 5 1 UC Irvine, 2 Stanford University, 3 UC Davis, 4

Surprising Negative Results for Generative Adversarial Tree Search Kamyar Azizzadenesheli 1,2,5 , Brandon Yang 2 , Weitang Liu 3 , Emma Brunskill 2 , Zachary C Lipton 4 , Animashree Anandkumar 5 1 UC Irvine, 2 Stanford University, 3 UC Davis, 4 Carnegie Mellon University, 5 Caltech
Introduction: Deep Q-Network (DQN) FC1 Conv2 Conv1 Up 0.5 Down 2.0 Stay 1.5
Introduction: DQN The DQN estimation of the Q-function can be arbitrarily biased (Thrun & Schwartz 1993, Antos et al. 2008) We empirically observe this phenomenon in DQN for Pong
Generative Adversarial Tree Search Given a model of the environment: 1. Do Monte-Carlo Tree Search (MCTS) for a limited horizon 2. Bootstrap with the Q function at the leaves
Generative Adversarial Tree Search Given a model of the environment: 1. Do Monte-Carlo Tree Search (MCTS) for a limited horizon 2. Bootstrap with the Q function at the leaves [Prop. 1] Let e Q be the upper bound on the error in estimation of the Q-function. In GATS with roll-out horizon H, it contributes to the error in estimation of the return as 𝛿 H e q .
Generative Dynamics Model Generates next frames conditioned on the current frames and actions
Negative Results
The Goldfish and the Gold Bucket
The Goldfish and the Gold Bucket
Conclusions We develop a sample-efficient generative model for RL using GANs Given a fixed Q-function, GATS reduces the worst-case error in estimation from the Q-function exponentially in roll-out depth as 𝛿 H e q . Even with perfect modeling, GATS can impede learning of the Q-function. This study of GATS highlights important considerations for combining model-based and model-free reinforcement learning.

Recommend

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is a type of neural net, used in deep learning/machine learning problems The goal of a GAN is to train two simultaneous models: a generative model

441 views • 18 slides

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse and Jimmy Ba CSC421/2516 Lecture 18: Generative Adversarial Networks 1 / 20 Implicit Generative Models Recall: implicit generative models learn a

237 views • 20 slides

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science and Technology wzhuai@ust.hk April 3, 2019 Robust Estimation and Generative Adversarial Nets [GLYZ18] Generative Adversarial Nets for Robust

252 views • 24 slides

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner (CMU 11-785) Generative Adversarial Networks March 21, 2018 1 / 79 Overview Benjamin Striner (CMU 11-785) Generative Adversarial Networks March

1k views • 82 slides

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is Generative Adversarial Networks (GAN)? Generative - creating new data that depends on the choice of the training set Adversarial - competitive

894 views • 9 slides

generative design systems Generative Brief Design Definitions Workshop Processes

Generative Brief Design Definitions Workshop Processes Applications generative design systems Generative Brief Design Definitions Workshop Processes Applications generative design systems Generative Brief Design Definitions

943 views • 75 slides

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks Generative Adversarial Text to Image Synthesis

666 views • 39 slides

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 1 / 25 Overview In generative modeling, wed like to train a network that models a distribution, such as a

920 views • 25 slides

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron Courville 1 In todays talk Generative Adversarial Networks Goodfellow et al., NIPS 2014 Conditional Generative

2.84k views • 40 slides

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

614 views • 27 slides

SOME SURPRISING FACTS ABOUT (the problem of) SURPRISING FACTS D. Mayo February 26, 2011 1

SOME SURPRISING FACTS ABOUT (the problem of) SURPRISING FACTS D. Mayo February 26, 2011 1 Abstract: A common intuition about evidence is that if data x have been used to construct a hypothesis H ( x ), then x should not be used again in support

1.12k views • 43 slides

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

Bregman Divergence Function Generative Adversarial Networks (GANs) Wasserstein Divergence and GANs Relaxed Wasserstein Empirical Results Conclusions Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and

1.62k views • 148 slides

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide a way to sample from any distribution. 1. Sample z , where denotes an efficiently sampleable distribution (e.g., uniform or Gaussian). 2.

486 views • 47 slides

Conditional Generative Adversarial Networks (and a brief look at image-to-image translation)

Conditional Generative Adversarial Networks (and a brief look at image-to-image translation) Final Presentation Peter Bromley Generative Models What is generative modeling? Data: Samples from high dimensional probability distribution p real

621 views • 25 slides

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks Two imaginary celebrities that were dreamed up by a random number generator. https://research.nvidia.com/publication/2017-10

665 views • 39 slides

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical University November 4, 2017 Sahin Olut (ITU Vision Lab) Generative Adversarial Networks November 4, 2017 1 / 23 Outline Motivation 1 Why

565 views • 30 slides

Sustainable way of testing your code by Eugene Amirov Teamlead at Scrapinghub For top 100 most

Sustainable way of testing your code by Eugene Amirov Teamlead at Scrapinghub For top 100 most starred Python projects on GitHub the percentage of testing code is a little bit more that 23%. Percentage of tests Lines of code in tests def

918 views • 56 slides

SQL$Joins Max$Masnick August&7,&2015 What%are%joins?

SQL$Joins Max$Masnick August&7,&2015 What%are%joins? Combine(two((or(more)(tables(into(a(single(results(table I"will"go"through"the"most"common"joins"here. Setup

597 views • 21 slides

1 Profile:- Comprompt Solutions LLP (formerly known as Comprompt Solutions from 2000 to 2017)

1 Profile:- Comprompt Solutions LLP (formerly known as Comprompt Solutions from 2000 to 2017) has become a brand to reckon with in the field of IT security and solutions. During its formative years, Comprompt won accolades from hundreds of its

590 views • 40 slides

GMP 6.2.0 Installation GMP 6.2.0 Installation gcc/g++ g /g Source download:

G NU M ulti- P recision Library 1 GMP 6.2.0 Installation GMP 6.2.0 Installation gcc/g++ g /g Source download: https://gmplib.org/ complete build: configure make

478 views • 12 slides

Content moderation If youre a moderator for any communities: CS 278 | Stanford University |

Reply in Zoom chat: Content moderation If youre a moderator for any communities: CS 278 | Stanford University | Michael Bernstein are you light touch or heavy handed? I fine-tuned some AI language models on your submitted midterm

719 views • 52 slides

Share Everything Play Fair Dont Hit People Put Things Back Where You Found Them

Share Everything Play Fair Dont Hit People Put Things Back Where You Found Them Say Your Sorry When You Hurt Someone Dont Take Things That Are Not Yours Take a Nap Every Afuernoon Wash Your Hands Before You

521 views • 16 slides

8. Other Deep Architectures CS 519 Deep Learning, Winter 2018 Fuxin Li With materials from Zsolt

8. Other Deep Architectures CS 519 Deep Learning, Winter 2018 Fuxin Li With materials from Zsolt Kira and Ian Goodfellow A brief overview of other architectures Unsupervised Architectures Deep Belief Networks Autoencoders GANs

439 views • 25 slides

Module 9 Media Communications Module Nine: Media Communications 1 Objectives Understand

The last several Modules were about communication. Our interactions may not stop with employees, peers and contractors. There may be a time when we have to deal with the media. Module 9 will help you meet this responsibility. 0 Module 9

542 views • 27 slides

Surprising Negative Results for Generative Adversarial Tree Search - PowerPoint PPT Presentation

Surprising Negative Results for Generative Adversarial Tree Search Kamyar Azizzadenesheli 1,2,5 , Brandon Yang 2 , Weitang Liu 3 , Emma Brunskill 2 , Zachary C Lipton 4 , Animashree Anandkumar 5 1 UC Irvine, 2 Stanford University, 3 UC Davis, 4

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Robust Estimation and Generative Adversarial Networks Weizhi ZHU Hong Kong University of Science

Generative Adversarial Networks Benjamin Striner CMU 11-785 March 21, 2018 Benjamin Striner

GAN-based Photo Video Synthesis Summary of Generative Adversarial Nets Lei Zhang What is

generative design systems Generative Brief Design Definitions Workshop Processes

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

Generative Adversarial Networks presented by Ian Goodfellow presentation co-developed with Aaron

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

SOME SURPRISING FACTS ABOUT (the problem of) SURPRISING FACTS D. Mayo February 26, 2011 1

Bregman and Wasserstein, with Applications to Generative Adversarial Networks (GANs) and beyond

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Conditional Generative Adversarial Networks (and a brief look at image-to-image translation)

Generative Adversarial Networks Aaron Mishkin UBC MLRG 2018W2 1 Generative Adversial Networks

Generative Adversarial Networks Sahin Olut Department of Computer Engineering Istanbul Technical

Sustainable way of testing your code by Eugene Amirov Teamlead at Scrapinghub For top 100 most

SQL$Joins Max$Masnick August&amp;7,&amp;2015 What%are%joins?

1 Profile:- Comprompt Solutions LLP (formerly known as Comprompt Solutions from 2000 to 2017)

GMP 6.2.0 Installation GMP 6.2.0 Installation gcc/g++ g /g Source download:

Content moderation If youre a moderator for any communities: CS 278 | Stanford University |

Share Everything Play Fair Dont Hit People Put Things Back Where You Found Them

8. Other Deep Architectures CS 519 Deep Learning, Winter 2018 Fuxin Li With materials from Zsolt

Module 9 Media Communications Module Nine: Media Communications 1 Objectives Understand

SQL$Joins Max$Masnick August&7,&2015 What%are%joins?