Learning Dexterity Peter Welinder SEPTEMBER 09, 2018 Learning - PowerPoint PPT Presentation

Aug 18, 2023 •396 likes •749 views

Learning Dexterity Peter Welinder SEPTEMBER 09, 2018 Learning Trends towards learning-based robotics Reinforcement Learning Go (AlphaGo Zero) Dota 2 (OpenAI Five) What about Robotics? RL doesnt work because it uses lots of experience. 5

Learning Dexterity Peter Welinder SEPTEMBER 09, 2018
Learning
Trends towards learning-based robotics
Reinforcement Learning Go (AlphaGo Zero) Dota 2 (OpenAI Five)
What about Robotics? RL doesn’t work because it uses lots of experience. 5 million games ~500 years of playing Go: 200 years per day Dota: 200 years per day
Simulators
Learning dexterity
24 joints: 20 actuated 4 under actuated
Rotating a block Challenges RL in real world high dimensional control noisy and partial observations manipulating multiple objects.
Approach
Reinforcement Learning + Domain Randomization
Reinforcement Learning STATE REWARDS action t = policy(state t ) X score = reward(state t , action t ) AGENT (POLICY) t ACTIONS
Reinforcement Learning θ ∗ = arg max X reward(policy θ , τ ) θ τ ∈ episodes Proximal Policy Optimization (PPO) Schulman et al. (2017)
Policy finger joint positions Action Distribution LSTM Fully-connected ReLU Normalization fingertip positions object pose Noisy Observation Goal
Distributed training with Rapid Rollout Workers Optimizers 6,000 CPU Cores 8 GPUs Policy Parameters
Domain Randomization F Sadeghi, S Levine (2017) Tobin et al. (2017) Peng et al. (2018)
Physics Randomizations object dimensions object and robot link masses surface friction coefficients robot joint damping coefficients actuator force gains joint limits gravity vector
Object position Dense Object rotation Concat SSM SSM SSM ResNet ResNet ResNet Pool Pool Pool Conv Conv Conv Camera 1 Camera 2 Camera 3
Train in Simulation A Distributed workers collect B We train a control policy using reinforcement learning. experience on randomized It chooses the next action based on fingertip positions environments at large scale. and the object pose. LSTM Observed Robot States Actions C We train a convolutional neural network to predict the object pose given three simulated camera images. CONV CONV Object Pose CONV
Transfer to the Real World We combine the pose estimation network D and the control policy to transfer to the real world. CONV Fingertip LSTM CONV Locations Actions CONV Object Pose
Results
Page Title
Results MAX NUMBER   MEDIAN NUMBER   RANDOMIZATONS OBJECT TRACKING OF SUCCESSES OF SUCCESSES All Vision 46 11.5 All Motion tracking 50 13 None Motion tracking 6 0
Training time 50 Consecutive Goals Achieved 40 30 20 10 0 1 10 100 Years of Experience All Randomizations No Randomizations
Tip Pinch Palmar Pinch Tripod Quadpod Power Grasp 5-finger Precision Grasp
Thank You Visit openai.com for more information. FOLLOW @OPENAI ON TWITTER

Recommend

Hand structure and dexterity Tetsuyou Watanabe Toshihiro Nishimura Kanazawa University, Japan

Hand structure and dexterity Tetsuyou Watanabe Toshihiro Nishimura Kanazawa University, Japan Contents 1. Human can perform only one thing at a time 2. How to acquire dexterity at robotic hand 3. Underactuated soft gripper with Variable

504 views • 24 slides

Solar Gloves and Socks are a solution for people that need dexterity while working in a cold

Solar Gloves and Socks are a solution for people that need dexterity while working in a cold outside climate. These gloves are thinner and give you better use of your fingers in cold weather. This is achieved by heating the fingers using a web of

196 views • 3 slides

APPROACHING HUMAN HAND DEXTERITY THROUGH HIGHLY BIOMIMETIC DESIGN Zhe Xu Mechanical Engineering

APPROACHING HUMAN HAND DEXTERITY THROUGH HIGHLY BIOMIMETIC DESIGN Zhe Xu Mechanical Engineering and Materials Science Yale University Magician's Hand Manipulation Tricks Magician Peter Pitchford http://www.magicbymanipulation.com/

778 views • 48 slides

Dexterous Manipulation with External Forces October 10, 2016 IROS Workshop Daejeon Alberto

Dexterous Manipulation with External Forces October 10, 2016 IROS Workshop Daejeon Alberto Rodriguez MCube Lab Intrinsic vs. Extrinsic Dexterity Extrinsic Dexterity Exploit Robot Environment [Nikhil Chavan-Dafle et al., Extrinsic

765 views • 20 slides

A deep dive into DEX file format Rodrigo Chiossi Rodrigo Chiossi ABS 2014 Bio Rodrigo

A deep dive into DEX file format Rodrigo Chiossi Rodrigo Chiossi ABS 2014 Bio Rodrigo Chiossi Android Engineer @ Intel OTC AndroidXRef www.androidxref.com Dexterity https://github.com/rchiossi/dexterity Rodrigo Chiossi

821 views • 22 slides

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on Learning Differences, Learning Challenges, and Learning Strengths Experience-based Learning: Experience based Learning: Language Build on the

370 views • 14 slides

N O T E S E R V I C E T R A I N I N G P R E S E N T A T I O N P L E A S E R E A D T O C O M P L E

N O T E S E R V I C E T R A I N I N G P R E S E N T A T I O N P L E A S E R E A D T O C O M P L E T I O N ! WHY ? Students may be visually or hearing impaired Students may have dexterity or motor skill difficulty Students may have learning or

138 views • 12 slides

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning 11.1 Learning agents 11.2 Inductive learning 11.3 Deep learning 11.4 Statistical learning 11.5 Reinforcement learning 11.6 Transfer learning

1.7k views • 159 slides

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning Outcomes Define mobile learning. Describe policies for mobile learning. Identify mobile learning technologies. Definition of Mobile Learning

524 views • 16 slides

The Ten -Year-Old (grade 5) Physical Development Girls are generally ahead of boys in physical

The Ten -Year-Old (grade 5) Physical Development Girls are generally ahead of boys in physical maturity; onset of puberty for some girls Increase in body strength and hand dexterity Have improved coordination and reaction time

299 views • 6 slides

Performance of the Feedbacks Titolo presentazione Supervisor : Prof. Elena De Momi sottotitolo

Evaluating the Dexterity of Surgical Instruments and Performance of the Feedbacks Titolo presentazione Supervisor : Prof. Elena De Momi sottotitolo Co-supervisor : Prof. Sanja Dogramadzi Master Thesis of: Milano, XX mese 20XX Yavuz Glfem

225 views • 20 slides

Page-Turning Robot Wade Lacey, Andrew Harter, Reed Farber, Liam Walsh 1 The Problem 2 Spinal

Page-Turning Robot Wade Lacey, Andrew Harter, Reed Farber, Liam Walsh 1 The Problem 2 Spinal cord injuries greatly limit mobility and dexterity Image source: https://www.slideshare.net/mrinaljoshi3/long-term-issues-in-spinal-cord-injury 3

921 views • 39 slides

Accessibility Ethical, good business, the law SWEN-445 Topics Software ethics Visually

Accessibility Ethical, good business, the law SWEN-445 Topics Software ethics Visually impaired Deaf and hard of hearing Dexterity and mobility impairments Section 508 the law Accessibility a Case of Software Ethics

492 views • 19 slides

Universal Usability Ethical, good business, the law SWEN-444 Topics Universal usability and

Universal Usability Ethical, good business, the law SWEN-444 Topics Universal usability and software ethics Visually impaired Deaf and hard of hearing Dexterity and mobility impairments Section 508 the law Universal

593 views • 33 slides

R.I.T S. Ludi/R. Kuehl p. 1 R I T Software Engineering Topics Universal usability and

Universal Usability (Accessibility) Ethical, good business, the law R.I.T S. Ludi/R. Kuehl p. 1 R I T Software Engineering Topics Universal usability and software ethics Visually impaired Deaf and hard of hearing Dexterity and

768 views • 28 slides

HIRE Behavioral Competency Based Interviewing Handout I. Interviewing and Hiring Skills versus

HIRE Behavioral Competency Based Interviewing Handout I. Interviewing and Hiring Skills versus Competencies Types Definition Example What is a Proficiency, facility, or dexterity that is acquired or developed through training or

535 views • 5 slides

Junko Ito & Armin Mester UC Santa Cruz Kattobase : The linguistic structure of Japanese

Junko Ito & Armin Mester UC Santa Cruz Kattobase : The linguistic structure of Japanese baseball chants Acknowledgements The research reported on here was done in collaboration with Haruo Kubozono (NINJAL, Tokyo, Japan) Shin

974 views • 57 slides

PODCAST EXPRESS DAISY CEDENO ON AIR Internet 2001 ESTUDIO # 1 www.Daisycedeno.com

PODCAST EXPRESS DAISY CEDENO ON AIR Internet 2001 ESTUDIO # 1 www.Daisycedeno.com www.HispanicInAmericaMedia.com ESTUDIO 2 PODCAST CREATE AND LAUNCH 12 POINTS TO CREATE YOUR PODCAST FORMATS 12 POINTS: 6 ABSTRACT & 6 CONCRETE 6

462 views • 24 slides

Thank you for joining us! Enterprise Community Partners Southeast Presents: 2018 Southeast Q

Thank you for joining us! Enterprise Community Partners Southeast Presents: 2018 Southeast Q & A Capacity Building (SecAon 4) May 16, 12 PM We will begin at 12 PM. *There will be no sound until the event begins.* 2018 Southeast Q &

460 views • 30 slides

FPL EmPOWERing STEM Educators: Project Review & Lessons Learned June 3, 2020 We have many

FPL EmPOWERing STEM Educators: Project Review & Lessons Learned June 3, 2020 We have many participants in this presentation. Thank you for placing your phone on MUTE until it is your turn to present or you have a question. Background &

551 views • 29 slides

TWITTER: #ResetDevelopment @JeanneBellCP @kimkleincommons @HaasJrFund 1 From UnderDeveloped

TWITTER: #ResetDevelopment @JeanneBellCP @kimkleincommons @HaasJrFund 1 From UnderDeveloped TWITTER: #ResetDevelopment @JeanneBellCP 2 @kimkleincommons @HaasJrFund Lack of people in the pipeline 50% of EDs said last search didnt produce

810 views • 15 slides

Access Power Peering A historical perspective on The Evolution of the Internet Peering Ecosystem

Access Power Peering A historical perspective on The Evolution of the Internet Peering Ecosystem William B. Norton Executive Director DrPeering International The 2 nd Workshop on Internet Economics (WIE11) DR PEERING December 1-2, 2011

475 views • 26 slides

Q uantum - P rofessional R esearch & I nvestment M anagement E nterprise s Raymond A Rondeau

Q uantum - P rofessional R esearch & I nvestment M anagement E nterprise s Raymond A Rondeau (401) 451 4163 | QPrime2@cox.net | www.Investorstein@com Q uantum P rofessional R esearch & I nvestment M anagement E nterprises

349 views • 6 slides

D Collabora;on Mee;ng 10 th June 2014 Terry WyaE. University

Electroweak Physics: then, now and in the future D Collabora;on Mee;ng 10 th June 2014 Terry WyaE. University of Manchester.

513 views • 20 slides