robot learning
play

Robot learning from few demonstrations by exploiting the structure - PowerPoint PPT Presentation

Robot learning from few demonstrations by exploiting the structure and geometry of data Sylvain Calinon Senior Researcher Idiap Research Institute, Martigny, Switzerland Lecturer EPFL, Lausanne, Switzerland External Collaborator IIT,


  1. Robot learning from few demonstrations by exploiting the structure and geometry of data Sylvain Calinon Senior Researcher Idiap Research Institute, Martigny, Switzerland Lecturer EPFL, Lausanne, Switzerland External Collaborator IIT, Genoa, Italy

  2. Artificial Intelligence for Society Research Groups: • Speech & Audio Processing • Perception & Activity Understanding • Computer Vision & Learning • Social Computing • Biometric Person Recognition • Applied Machine Learning • Natural Language Processing MARTIGNY • Robot Learning & Interaction Research • Computational Bioimaging Education • Uncertainty Quantification and Optimal Design Technology transfer

  3. Learning from demonstration as an intuitive interface to transfer skills to robots

  4. Learning from demonstration - Challenges

  5. Finding Priors that are expressive enough to be used in a wide range of tasks

  6. Prior 1: Movements are smooth and continuous Prior 2: Actions often relate to objects, tools or body landmarks Prior 3: Data spaces in robotics have geometries and structures

  7. Movement generation as a mix of clustering, subspace analysis and optimal control Walking Running Walking We look for a compact and modular representation of continuous movements and skills that can learn from few interactions (with user and environment), that can exploit variation and coordination , and that can adapt to new situations in a fast manner.

  8. Learning of motions from few demonstrations center covariance matrix Global sharing of local coordination patterns with: Dictionary of coordination patterns: [Tanwani and Calinon, IEEE RA-L 1(1), 2016]

  9. Learning minimal intervention controllers Use low control commands! Track path! System plant state variable (position+velocity) control command (acceleration) Approach: Using control formalism in task space to tracking weight matrix solve analytically a basic control weight matrix form of model predictive control (MPC) with a double integrator as constant linear system [Tanwani and Calinon, IEEE RA-L 1(1), 2016]

  10. Learning minimal intervention controllers Use low control commands! Track path! System plant [Tanwani and Calinon, IEEE RA-L 1(1), 2016]

  11. Learning minimal intervention controllers  Analytical solution to generate Transition and state duration motion control by following a (HSMM) minimal intervention principle Stepwise reference with: [Calinon , Bruno and Caldwell, ICRA’2014]

  12. Application: Editing motions with variations User interface to edit and generate natural and dynamic motions by considering variation and coordination Compliant controller to retrieve safe and human-like motions Daniel Berio Frederic Fol Leymarie [Berio, Calinon and Leymarie , IROS’2016] [ Berio, Calinon and Leymarie , MOCO’2017]

  13. I-DRESS project Personalized assistance using haptic and visual information, with compliant controllers following a minimal intervention principle Dressing skills require some aspects to be time-independent, while other aspects are time- dependent for the generation of movements. Emmanuel Pignat [Pignat and Calinon, RAS 93, 2017]

  14. Prior 1: Movements are smooth and continuous Prior 2: Actions often relate to objects, tools or body landmarks Prior 3: Data spaces in robotics have geometries and structures

  15. Prior 2: Actions often relate to objects, tools or body landmarks Photo: Basilio Noris

  16. Regression with a Task-parameterized motions context variable c : • Learning of • Retrieval with  Generic approach, but limited generalization capability

  17. Control in multiple coordinate systems Track path in coordinate system j Use low control commands! New position and 2 2 orientation of coordinate 2 2 systems 1 and 2 2 Two candidate 1 1 coordinate systems (P=2) Set of demonstrations Reproduction in new situation [Calinon , HFR’2016]

  18. Control in multiple coordinate systems Control in multiple coordinate systems Track path in coordinate system j Use low control commands! In many robotics problems, the parameters describing the task or situation can be interpreted as coordinate systems 2 1 [Calinon , HFR’2016]

  19. Control in multiple coordinate systems Track path in coordinate system j Use low control commands!  Learning of a controller (instead of learning a trajectory) that adapts to new situations while regulating the gains according to the precision and coordination patterns required by the task [Calinon , HFR’2016]

  20. Control in multiple coordinate systems Track path in coordinate system j Use low control commands!  Retrieval of control commands in the form of trajectory distributions, facilitating exploration and adaptation (in either control or state space) [Calinon , HFR’2016]

  21. I-DRESS project SNSF, CHIST-ERA (2015-2018) [Canal, G., Pignat, E., Alenya, G, Calinon, S. and Torras , C., ICRA’2018]

  22. I-DRESS project SNSF, CHIST-ERA (2015-2018) [Canal, Pignat, Alenya, Calinon and Torras , ICRA’2018]

  23. http://dexrov.eu EC, H2020 (2015-2018)

  24. Exploitation in shared control Teleoperator side Robot side only Gaussian ID is transmitted Dr Andras Kupcsik Dr Ioannis Havoutis [Havoutis and Calinon, Autonomous Robots, 2018]

  25. Adaptation to different object shapes Coordinate system as task parameter [Calinon, Alizadeh and Caldwell, IROS’2013]

  26. Bimanual coordination and co-manipulation [Rozo et al., IROS’2015] [Silvério et al., IROS’2015] [Rozo et al., IEEE T-RO 32(3), 2016] Dr Leonel Rozo Dr João Silvério

  27. Learning & generalizing tasks prioritization Priority on left hand Demonstration Demonstration Reproduction Reproduction Candidate hierarchy Candidate hierarchy [Silvério, Calinon, Rozo and Caldwell (2018), Arxiv 1707.06791] [Calinon , ISRR’15]

  28. Learning & generalizing tasks prioritization Priority on right hand Demonstration Demonstration Reproduction Reproduction Candidate hierarchy Candidate hierarchy [Silvério, Calinon, Rozo and Caldwell (2018), Arxiv 1707.06791] [Calinon , ISRR’15]

  29. Learning & generalizing tasks prioritization Equal priority Demonstration Demonstration Reproduction Reproduction Candidate hierarchy Candidate hierarchy [Silvério, Calinon, Rozo and Caldwell (2018), Arxiv 1707.06791] [Calinon , ISRR’15]

  30. Prior 1: Movements are smooth and continuous Prior 2: Actions often relate to objects, tools or body landmarks Prior 3: Data spaces in robotics have geometries and structures

  31. Prior 3: Data spaces in robotics have geometries and structures

  32. Motivation of using Riemannian manifolds

  33. Interpolation on Riemannian manifolds Orientation (unit quaternions) Rigid body motions (position+orientation) Covariance features, inertia and gain matrices, manipulability ellipsoids, trajectory distributions (symmetric positive definite matrices)

  34. Clustering on Riemannian manifolds Covariance features, inertia and gain matrices, manipulability ellipsoids, trajectory distributions (symmetric positive definite matrices) Orientation (unit quaternions) Rigid body motions (position+orientation)

  35. Regression on Riemannian manifolds Gaussian mixture regression (GMR) to compute from the joint distribution encoded as a GMM → Regression for orientation data (unit quaternions on )

  36. Regression with orientation and position data Four demonstrations of coordinated bimanual movement [Zeestraten, Havoutis, Silvério, Calinon and Caldwell, IEEE RA-L 2(3), 2017]

  37. Regression with orientation and position data Four reproductions with perturbations by the user [Zeestraten, Havoutis, Silvério, Calinon and Caldwell, IEEE RA-L 2(3), 2017]

  38. Regression with sEMG sensory data TACT-HAND SNSF, D-A-CH (2016-2019) Noémie Jaquier Surface Transformation in spatial Control of the electromyography covariances corresponding ( sEMG ) measurements (SPD matrices) hand pose [Jaquier and Calinon, IROS 2017]

  39. Comparison: standard GMR vs geometric GMR sEMG data from Ninapro database processed as spatial covariances: 12 Input 4 Output [Jaquier and Calinon, IROS 2017]

  40. Manipulability ellipsoid tracking Noémie Jaquier [N. Jaquier, L. Rozo, D.G. Caldwell and S. Calinon , RSS’2018]

  41. Conclusion Combining statistical learning techniques and model predictive control provides a generative approach to the transfer of skills and movements Statistical learning in multiple coordinate systems can be exploited to learn robot skills and movements from few demonstrations, with adaptation to new situations Robotics is rich in structures and geometries that can be exploited to acquire skills and movements from a small set of interactions (with user or environment)

Recommend


More recommend