Representing Movement Primitives as Implicit Dynamical Systems - PowerPoint PPT Presentation

Representing Movement Primitives as Implicit Dynamical Systems learned from Multiple Demonstrations Robert Krug and Dimitar Dimitrov Center for Applied Autonomous Sensor Systems (AASS) Örebro University, Sweden robert.krug@oru.se Robert Krug ICAR 2013 1 / 12

Dynamical Movement Primitives (DMP) [Ijspeert et al., 2002] Feedback controllers in joint/task space . . . . . . formulated as one dynamical system per DoF: ˙ x ( t ) = f ( x ( t ) , s ( t )) Common phase variable s ( t ) to synchronize DoF Robert Krug ICAR 2013 2 / 12

Dynamical Movement Primitives (DMP) [Ijspeert et al., 2002] Feedback controllers in joint/task space . . . . . . formulated as one dynamical system per DoF: ˙ x ( t ) = f ( x ( t ) , s ( t )) Common phase variable s ( t ) to synchronize DoF � t “On-the-fly” motion profile generation: x ( t ) = 0 f ( x ( τ ) , s ( τ )) d τ Robert Krug ICAR 2013 2 / 12

Motivation Outline Motivation 1 Concept 2 Results 3 Contributions & Outlook 4 Robert Krug ICAR 2013 2 / 12

Motivation Why use primitive motion controllers? Generate desired motions for a platform with many DoF Shadow Hand & Arm with 24 DoF Robert Krug ICAR 2013 3 / 12

Motivation Why use primitive motion controllers? Generate desired motions for a platform with many DoF Controllers ˙ x = f ( x , s ) are state policies Replaces explicit planning Disturbance compensation Time synchronization of arbitrary many DoF Shadow Hand & Arm with 24 DoF Robert Krug ICAR 2013 3 / 12

Motivation Why use primitive motion controllers? Generate desired motions for a platform with many DoF Controllers ˙ x = f ( x , s ) are state policies Replaces explicit planning Disturbance compensation Time synchronization of arbitrary many DoF Motions resemble demonstrations Simple implementation Shadow Hand & Arm with 24 DoF Robert Krug ICAR 2013 3 / 12

Motivation What’s the problem? DMP [Ijspeert et al., 2002]: Stable spring excited by a learned control input u � q � ∈ R 2 x ( t ) = f ( x , s ) = Ax ( t ) + B u ( s ; p ) , x = ˙ q ˙ � �� spring learned p Robert Krug ICAR 2013 4 / 12

Motivation What’s the problem? DMP [Ijspeert et al., 2002]: Stable spring excited by a learned control input u � q � ∈ R 2 x ( t ) = f ( x , s ) = Ax ( t ) + B u ( s ; p ) , x = ˙ q ˙ � �� spring learned p Problem: One-shot learning → undesirable behavior in regions not covered by the demonstration Robert Krug ICAR 2013 4 / 12

Motivation What’s the problem? DMP [Ijspeert et al., 2002]: Stable spring excited by a learned control input u � q � ∈ R 2 x ( t ) = f ( x , s ) = Ax ( t ) + B u ( s ; p ) , x = ˙ q ˙ � �� spring learned p Problem: One-shot learning → undesirable behavior in regions not covered by the demonstration Solution: Capture different dynamics from multiple demonstrations [Ude et al., 2010][Forte et al., 2012] Robert Krug ICAR 2013 4 / 12

Motivation What’s the problem? DMP [Ijspeert et al., 2002]: Stable spring excited by a learned control input u � q � ∈ R 2 x ( t ) = f ( x , s ) = Ax ( t ) + B u ( s ; p ) , x = ˙ q ˙ � �� spring learned p Problem: One-shot learning → undesirable behavior in regions not covered by the demonstration Solution: Capture different dynamics from multiple demonstrations [Ude et al., 2010][Forte et al., 2012] Presented approach → locally optimal combination: D ∑ x ( t ) = Ax ( t )+ B λ d ( t ) u d ( s ; p d ) ˙ d = 1 Robert Krug ICAR 2013 4 / 12

Concept Outline Motivation 1 Concept 2 Results 3 Contributions & Outlook 4 Robert Krug ICAR 2013 4 / 12

Concept Re-compute the dynamical system online Optimize combination of pre-learned control inputs at each time step k . . . D ∑ x [ k ] = Ax [ k ]+ B λ d [ k ] u d [ k ] ˙ d = 1 . . . by minimizing a distance criterion between current and demonstrated states Robert Krug ICAR 2013 5 / 12

Concept Re-compute the dynamical system online Optimize combination of pre-learned control inputs at each time step k . . . D ∑ x [ k ] = Ax [ k ]+ B λ d [ k ] u d [ k ] ˙ d = 1 . . . by minimizing a distance criterion between current and demonstrated states States evolve “in between” demonstrations . . . . . . or get “pulled” onto them with dynamics governed by A Encodes different dynamics Robert Krug ICAR 2013 5 / 12

Concept Re-compute the dynamical system online Optimize combination of pre-learned control inputs at each time step k . . . D ∑ x [ k ] = Ax [ k ]+ B λ d [ k ] u d [ k ] ˙ d = 1 . . . by minimizing a distance criterion between current and demonstrated states States evolve “in between” demonstrations . . . . . . or get “pulled” onto them with dynamics governed by A Encodes different dynamics First step towards Model Predictive Control with state constraints Robert Krug ICAR 2013 5 / 12

Concept How does it work? Robert Krug ICAR 2013 6 / 12

Results Outline Motivation 1 Concept 2 Results 3 Contributions & Outlook 4 Robert Krug ICAR 2013 6 / 12

Results Generalization in simulation Robert Krug ICAR 2013 7 / 12

Results Disturbance rejection in simulation Robert Krug ICAR 2013 8 / 12

Results Evaluation on the Shadow Robot platform Grasp motions recorded with a sensorized glove . . . . . . and used to learn primitive controllers for the Shadow Hand Robert Krug ICAR 2013 9 / 12

Results Evaluation on the Shadow Robot platform Robert Krug ICAR 2013 10 / 12

Contributions & Outlook Outline Motivation 1 Concept 2 Results 3 Contributions & Outlook 4 Robert Krug ICAR 2013 10 / 12

Contributions & Outlook To sum up . . . Contributions: Learn motion controllers from multiple demonstrations . . . . . . and form a (locally) optimal combination to generate movements Allows to encode fundamentally different dynamics Predictable behavior without explicit costly motion planning! Robert Krug ICAR 2013 11 / 12

Contributions & Outlook To sum up . . . Contributions: Learn motion controllers from multiple demonstrations . . . . . . and form a (locally) optimal combination to generate movements Allows to encode fundamentally different dynamics Predictable behavior without explicit costly motion planning! Future work: Optimize over a time window → Model Predictive Control Incorporate spatial & temporal state space constraints (obstacle avoidance . . . ) Reactive on-line planning & control scheme [Anderson et al., 2012] Robert Krug ICAR 2013 11 / 12

Contributions & Outlook That’s it . . . Robert Krug ICAR 2013 12 / 12

References References Anderson, S., Karumanchi, S., and Iagnemma, K. (2012). Constraint-based planning and control for safe, semi-autonomous operation of vehicles. In IEEE Intelligent Vehicles Symposium, pages 383 – 388. Forte, D., Gams, A., Morimoto, J., and Ude, A. (2012). On-line motion synthesis and adaptation using a trajectory database. Robotics and Autonomous Systems, 60(10):1327 – 1339. Ijspeert, A., Nakanishi, J., and Schaal, S. (2002). Movement imitation with nonlinear dynamical systems in humanoid robots. In Proc. of the IEEE Int. Conf. on Robotics and Automation, volume 2, pages 1398 – 1403. Ude, A., Gams, A., Asfour, T., and Morimoto, J. (2010). Task-specific generalization of discrete and periodic dynamic movement primitives. IEEE Transactions on Robotics, 26(5):800 – 815. Robert Krug ICAR 2013 12 / 12

Representing Movement Primitives as Implicit Dynamical Systems - PowerPoint PPT Presentation

Representing Movement Primitives as Implicit Dynamical Systems learned from Multiple Demonstrations Robert Krug and Dimitar Dimitrov Center for Applied Autonomous Sensor Systems (AASS) rebro University, Sweden robert.krug@oru.se Robert Krug

Implicit Guarantees and Risk Taking: Implicit Guarantees and Risk Taking: Implicit Guarantees and

Implicit Bias Implicit bias Implicit bias refers to attitudes or stereotypes that affect our

Implicit Surfaces Implicit Surfaces An implicit surface is simply an iso-contour CIS 781 of a

RenderMan Primitives RenderMan Primitives CSCD 472? Slide 1 4/5/10 Primitive Attributes

Homotopy theories of dynamical systems Rick Jardine University of Western Ontario July 15, 2013

Continuous orbit equivalence rigidity Xin Li Dynamical systems and operator algebras Dynamical

Implicit Bias: Transcript Inclusive Teaching Series: Implicit Bias Welcome to the third module of

Implicit Extremes and Implicit MaxStable Laws Stilian Stoev ( sstoev@umich.edu ) University of

Multi-core Programming: Implicit Parallelism Tuukka Haapasalo April 16, 2009 Tuukka Haapasalo

Implicit Surfaces CPSC 599.86 / 601.86 Sonny Chan University of Calgary (some board work happened

Verilog HDL:Digital Design and Modeling Chapter 6 User-Defined Primitives Chapter 6

Implementing new Topology Mapping Primitives Guillermo Baltra Prior Work Primitives for

Beyond Block I/O: Rethinking / Traditional Storage Primitives Traditional Storage Primitives

Dynamical analysis of euclidean algorithms Introduction Dynamical analysis of euclidean

ANALYSIS of EUCLIDEAN ALGORITHMS An Arithmetical Instance of Dynamical Analysis Dynamical

ANALYSIS of EUCLIDEAN ALGORITHMS An Arithmetical Instance of Dynamical Analysis Dynamical

Canada Graduate Scholarship Masters (CGSM) 2020-21 Overview of Awards Frederick Banting

D. M. Therrell High School Norms This is a meeting of the GO Team. Only members of the team

GRADUATION CREDIT REQUIREMENTS Minimum Number of Credits for Regents Diploma and Regents Diploma

PASS Placement Advising for Student Success Presenters English/Writing Faculty Adviser David

AutoML for Object Detection Xiangyu Zhang MEGVII Research 1 AutoML for Advances in AutoML

Presentation skills: Grow vertically Generally, Companies dont ask for presentation. But it

Jersey Advertisement Pitch By Alex, Ethan and Charles AIM OF THE ADVERT. Advertise Jersey to

Discussion Paper on Substation - Switchgear Coordination based on IEC by Hermann Koch Chairman

Sambuz

Useful Links

Newsletter

Mail Us