Guided Policy Search Sergey Levine Learning on PR2 Shape sorting - PowerPoint PPT Presentation

Jan 16, 2023 •502 likes •837 views

Guided Policy Search Sergey Levine Learning on PR2 Shape sorting cube Visuomotor Policies Guided Policy Search trajectory optimization supervised learning expectation under current policy trajectory distribution(s) Lagrange multiplier

Guided Policy Search Sergey Levine
Learning on PR2
Shape sorting cube
Visuomotor Policies
Guided Policy Search trajectory optimization supervised learning
expectation under current policy trajectory distribution(s) Lagrange multiplier
Supervised Learning Objective
Trajectory Optimization (without GPS)
Trajectory Optimization
Trajectory Optimization new old [see Levine & Abbeel ‘14 for details]
[see L. et al. NIPS ‘14 for details]
Trajectory Optimization (with GPS)
[see L. et al. NIPS ‘14 for details]
Instrumented Training training time test time
~ 92,000 parameters Chelsea Finn
Experimental Tasks
Shape sorting cube
Hanger
Hammer
Bottle
Igor Mordatch Locomotion better trajectory optimization + large scale simulation
Igor Mordatch Darwin Robot better trajectory optimization + large scale simulation + adaptation to real world dynamics Mordatch, Mishra, Eppner, Abbeel
Guided Policy Search Applications manipulation dexterous hands with N. Wagener and P. Abbeel with V. Kumar and E. Todorov locomotion aerial vehicles tensegrity robot with G. Kahn, T. Zhang, P. Abbeel with M. Zhang, K. Caluwaerts, P. Abbeel with V. Koltun
DAGGER typically 0.0, except when i = 1, then 1.0
DAGGER Video See http://videolectures.net/aistats2011_ross_reduction/
Trajectory Optimization – Dynamics Fitting
[see L. et al. NIPS ‘14 for details]
Learned Motion Skills
More Visuomotor Experiments
Beyond Instrumented Training training time test time Finn, Tan, Duan, Darrell, L., Abbeel ‘15
Learning Visual State Spaces
Visual State Space Experiments

Recommend

Learning Dynamic Manipulation Skills under Unknown Dynamics with Guided Policy Search Sergey

Learning Dynamic Manipulation Skills under Unknown Dynamics with Guided Policy Search Sergey Levine Pieter Abbeel UC Berkeley UC Berkeley Team TROOPER: Lockheed Martin, University of Pennsylvania, Philipp Krahenbuhl, Stanford University

205 views • 18 slides

Sea Search-Gu Guided, Lightly-Su Super ervised ed Training of of Structured Prediction on

Sea Search-Gu Guided, Lightly-Su Super ervised ed Training of of Structured Prediction on Energy Networ orks Pedram Rooshenas Dongxu Zhang Gopal Sharma Andrew McCallum St Struc uctur ured d Predi diction We are interested to

817 views • 35 slides

Best-first Utility-guided Search Wheeler Ruml and Minh B. Do Embedded Reasoning Area Palo Alto

Best-first Utility-guided Search Wheeler Ruml and Minh B. Do Embedded Reasoning Area Palo Alto Research Center { ruml , minhdo } @parc.com Wheeler Ruml (PARC) Best-first Utility-guided Search 1 / 10 Overview Overview Anytime Algs

463 views • 17 slides

Search of Trypanosomicidal Active Principles by Metabolomic-guided Fractionation in Baccharis

Search of Trypanosomicidal Active Principles by Metabolomic-guided Fractionation in Baccharis trimera Javier Nargoli 1 *, Javier Varela 1 , Hugo Cerecetto 1, 2 and Mercedes Gonzlez 1 1 Grupo de Qumica Medicinal, Facultad de Ciencias Universidad

371 views • 18 slides

Guided Evolutionary Strategies Augmenting random search with surrogate gradients Niru

Guided Evolutionary Strategies Augmenting random search with surrogate gradients Niru Maheswaranathan // Google Research, Brain Team Joint work with: Luke Metz, George Tucker, Dami Choi, Jascha Sohl-dickstein Optimizing with surrogate gradients

379 views • 24 slides

FFR Guided Functional FFR Guided Functional FFR Guided Functional FFR Guided Functional

FFR Guided Functional FFR Guided Functional FFR Guided Functional FFR Guided Functional Angioplasty Angioplasty g g p p y y in Complex Anatomy in Complex Anatomy Jung-Min Ahn, MD g , Heart Institute Asan Medical Center Heart

671 views • 27 slides

MVC Guided Pathways Brief review of Guided Pathways at MVC Plan for Today Spring

MVC Guided Pathways Brief review of Guided Pathways at MVC Plan for Today Spring 2018-Summer 2019 Action Plan and Budget Guided Pathways in California A Framework for Improving Completion Multiple Guided Pathways Initiatives

407 views • 13 slides

Explorations in Bootstrapping Guided Search 8th Language and Computation Day Deirdre Lungley

Research Overview Interactive Experimentation Bootstrapping Experimentation Going Forward Explorations in Bootstrapping Guided Search 8th Language and Computation Day Deirdre Lungley dmlung@essex.ac.uk October 8, 2009 Deirdre Lungley

437 views • 28 slides

Monte Carlo Tree Search guided by Symbolic Advice for MDPs Damien Busatto-Gaston, Debraj

Monte Carlo Tree Search guided by Symbolic Advice for MDPs Damien Busatto-Gaston, Debraj Chakraborty and Jean-Francois Raskin Universit Libre de Bruxelles September 16, 2020 HIGHLIGHTS 2020 1/13 Markov Decision Process 1 s 0 4 a 1 a 2 1

392 views • 26 slides

Offline Policy-search in Bayesian Reinforcement Learning Castronovo Michael University of Li`

Offline Policy-search in Bayesian Reinforcement Learning Castronovo Michael University of Li` ege, Belgium Advisor: Damien Ernst 15th March 2017 Contents Introduction Problem Statement Offline Prior-based Policy-search (OPPS)

848 views • 46 slides

Transfer of Samples in Policy Search via Multiple Importance Sampling Andrea Tirinzoni, Mattia

Transfer of Samples in Policy Search via Multiple Importance Sampling Andrea Tirinzoni, Mattia Salvini, and Marcello Restelli 36th International Conference on Machine Learning, Long Beach, California 1 Motivation Policy Search (PS) : very

507 views • 19 slides

Year 3 Guided Pathways Plan Presentation Presented by: Palomar Guided Pathways Team DATE: May

Year 3 Guided Pathways Plan Presentation Presented by: Palomar Guided Pathways Team DATE: May 15, 2020 Presenters: Glyn Bongolan, Alex Cuatok, Kelly Falcone, Katy Farrell, Wendy Nelson And our Guided Pathways Ambassadors Focus for Today:

656 views • 50 slides

Optimal Aggregation Policy for Web Search Jeong-Min Yun 1 , Yuxiong He 2 , Sameh Elnikety 2 ,

Optimal Aggregation Policy for Web Search Jeong-Min Yun 1 , Yuxiong He 2 , Sameh Elnikety 2 , Shaolei Ren 3 1 POSTECH, 2 Microsoft Research, 3 Florida International University 1 Web Search Architecture Billions of web documents are partitioned

207 views • 18 slides

1 Heuristic (Bound- -Guided) Search Guided) Search Bucket Tree Heuristic (Bound Bucket Tree

Finding Leading Solutions Leading Solutions Finding Many AI problems = Constraint optimization problems On- -demand Bound Computation demand Bound Computation On Diagnosis (state estimation) for Finding Leading Solutions for Finding

266 views • 8 slides

National Action Plan on Climate Change Indias climate policy p y Guided by: Guided by:

National Action Plan on Climate Change Indias climate policy p y Guided by: Guided by: National Action Plan on Climate Ch Change (June 2008) (J 2008) State Action Plan(s) on Climate Change (Ongoing) Change (Ongoing) National

402 views • 13 slides

Deep Reinforcement Learning 1 Outline 1. Overview of Reinforcement Learning 2. Policy Search 3.

Deep Reinforcement Learning 1 Outline 1. Overview of Reinforcement Learning 2. Policy Search 3. Policy Gradient and Gradient Estimators 4. Q-prop: Sample Efficient Policy Gradient and an Off-policy Critic 5. Model Based Planning in Discrete

769 views • 53 slides

Le Learning De Deep Co Control Po Policies fo for Au Autonomous Ae Aerial Ve Vehicles wi

Le Learning De Deep Co Control Po Policies fo for Au Autonomous Ae Aerial Ve Vehicles wi with MP MPC- -Gu Guided Po Policy Se Search Tianhao Zhang, Gregory Kahn, Sergey Levine, Pieter Abbeel

400 views • 10 slides

MODEL-BASED, MUTATION-DRIVEN TEST CASE GENERATION VIA HEURISTIC-GUIDED BRANCHING SEARCH Andreas

MODEL-BASED, MUTATION-DRIVEN TEST CASE GENERATION VIA HEURISTIC-GUIDED BRANCHING SEARCH Andreas Fellner FMCAD Student Forum Wien, October 4th 2017 TEST CASE GENERATION WITH 1 Andreas Fellner TEST CASE GENERATION WITH Abstract Model UML /

358 views • 34 slides

Guided Pathways Equity & Education Update Feb 7, 2020 Guided Pathways Decision Making

Guided Pathways Equity & Education Update Feb 7, 2020 Guided Pathways Decision Making Structure Equity & Education Council Guided Pathways Study Group Meta Majors Onboarding Communications ? Team Team Team Program Mapping,

289 views • 8 slides

Student Support (Re)defined and Guided Pathways: Bringing Students onto the Pathways SUNY Guided

Student Support (Re)defined and Guided Pathways: Bringing Students onto the Pathways SUNY Guided Pathways Project September 26, 2018 Dr. Darla Cooper Executive Director The RP Group www.rpgroup.org Mission Strengthen California

599 views • 31 slides

Year 2 Guided Pathways Plan Presentation Presented by: Palomar Guided Pathways Team Wednesday

Year 2 Guided Pathways Plan Presentation Presented by: Palomar Guided Pathways Team Wednesday 15th: 1-2pm, LRC-116 Presenters: April Cunningham, PJ DeMaris, Wendy Nelson, Kelly Falcone, Elena Chirkova-Sikora Tuesday 21st: 10-11am, LRC-116

644 views • 36 slides

to ECR Search Policy - Hill Climbing to " 01 ECR Search Genetic Doit a . a

Policysearchttill Climbing to ECR Search Policy - Hill Climbing to " 01 ECR Search Genetic Doit a . a toooo : Thilo On Policysearchttill Climbing # to " ECR Search Genetic rennin

534 views • 20 slides

for Active Learning Guided Inquiry Learning The POGIL Project Process Oriented, Guided

Innovative Teaching Methods for Active Learning Guided Inquiry Learning The POGIL Project Process Oriented, Guided Inquiry Learning Developed in college Chemistry classrooms (1994), has since expanded to other fields and into high

489 views • 10 slides

5 Growth Mysteries in Search of a Broader Innovation Policy William F. Maloney Policy Research

5 Growth Mysteries in Search of a Broader Innovation Policy William F. Maloney Policy Research Talk Development Research Group World Bank May 19, 2014 References Engineers, Innovative Capacity, and Development (2014) with Felipe

727 views • 36 slides