orf mosaic
play

ORF -MOSAIC M. Mahdi Ghazaei Ardakani*, Henrik Jrntell**, and Rolf - PowerPoint PPT Presentation

ORF -MOSAIC M. Mahdi Ghazaei Ardakani*, Henrik Jrntell**, and Rolf Johansson* * Dep. of Automatic Control and ** Exp. Medical Science Group Lund University, Sweden How it began! 2 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 3 LCCC Symposium,


  1. ORF -MOSAIC M. Mahdi Ghazaei Ardakani*, Henrik Jörntell**, and Rolf Johansson* * Dep. of Automatic Control and ** Exp. Medical Science Group Lund University, Sweden

  2. How it began! 2 LCCC Symposium, 2012-04-19 Mahdi Ghazaei

  3. 3 LCCC Symposium, 2012-04-19 Mahdi Ghazaei

  4. Outline  Introduction  Cerebellum  Different cerebellar models  MOSAIC  Problem formulation  Modeling  Methods  Experimental Design  Simulations  Results  Discussion  Conclusions LCCC Symposium, 2012-04-19 Mahdi Ghazaei 4

  5. Cerebellum and its Role [1 ] [1 ] Modified from Neuroscie, 3rd Ed. [2] Courtesy of H.Jörntell [2] LCCC Symposium, 2012-04-19 Mahdi Ghazaei 5

  6. Microznoe circuitry [1 ] Figure form Handbook of Robotics, Springer 2008, LCCC Symposium, 2012-04-19 Mahdi Ghazaei 6

  7. Principles in Cerebellum  Feedforward processing  Divergence and convergence  Modularity  Plasticity LCCC Symposium, 2012-04-19 Mahdi Ghazaei 7

  8. Computational Models  Cerebellar Model Articulation Controller (CMAC)  Adjustable Pattern Generator (APG)  Schweighofer-Arbib  Cerebellar feedback-error-learning model (CBFELM)  Multiple paired forward-inverse model (MPFIM) [1] Figure adapted form Handbook of Robotics, Springer 2008 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 8

  9. MOSAIC Structure  Internal Model  Forward Models  Inverse Models  Modularity  Adaptation  Reduction of motor error  Efference copy  Spin-offs  HMM-MOSAIC  HMOSAIC  e-MOSAIC  MMRL  AMA-MOSAICI [1] Figure from D.M. Wolpert and M. Kawato, Neural Net.,11:1325, 1998 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 9

  10. MOSAIC based Models  Original MOSAIC  Tested for switching between 3 objects, and generalizing to a new one  No of modules are manually tuned  Requires careful tuning of parameters  The quality of forward models are critical  HMM MOSAIC  Same experiments as above  Probabilistic model using HMM  heavy computation  Fixed to linear forward models  Originally in batch mode  Improved parameter tuning and resp. estimation by EM LCCC Symposium, 2012-04-19 Mahdi Ghazaei 10

  11. MOSAIC based Model  HMOSAIC Same experiments  Two layers of MOSAIC  Higher layer provides estimate of prior probabilities   eMOSAIC Humanoid robot control   LQG for controllers  Forward models replaced by Kalman filters No adaptation   AMA-MOSAICI  Sit-to-stand control  Clustering algorithm for determining no of modules  Clustering and training Off-line  MMRL  Controllers are replaced by RL agents  Discrete and continuous case  Self-organization of modules  Haunting task in a grid world and controlling an inverted pendulum LCCC Symposium, 2012-04-19 Mahdi Ghazaei 11

  12. MOSAIC based Model  Toy problems  More serious problems  Resorting to classic controllers or RL  Simplification  No dealy  No adaptation LCCC Symposium, 2012-04-19 Mahdi Ghazaei 12

  13. Objectives  Investigation of the applicability of a biologically inspired model of cerebellum to deal with:  More complex embodiments  Less accurate models (delays, noise, …) [1] Figure from F.M.M.O. Camposa, J.M.F. Calado, Ann. Rev. In Control 33 (2009), 70 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 13

  14. Problem Statement ORF-MOSAIC as a biologically inspired cerebellar model to adaptively control a human-like robotic arm with potential delays LCCC Symposium, 2012-04-19 Mahdi Ghazaei 14

  15. Methodology Choose models Validation Simulation Fix assumptions  Models as faithful as possible to biology  Fixes according well-established theories in control engineering LCCC Symposium, 2012-04-19 Mahdi Ghazaei 15

  16. Modeling  Motor Cortex (Trajectory Generation, …)  Cerebellum (MOSAIC )  Sensory System and Lower Motor Control  Arm  Muscle Systems LCCC Symposium, 2012-04-19 Mahdi Ghazaei 16

  17. Lower Motor Control LCCC Symposium, 2012-04-19 Mahdi Ghazaei 17

  18. Musculoskletal Model  Preserves essential features of an arm  Mono-articular and bi-articular  Hill-type muscle model [1] Figure from Handbook of brain theory, MIT press, 1995 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 18

  19. Assumptions  Minimum jerk trajectory by CTX  Planning in task space, control in joint space and transformation to muscle space  Minimum tension principle for muscle control  Internal model LCCC Symposium, 2012-04-19 Mahdi Ghazaei 19

  20. Methods  Linear models representing cellular structure  Low level control represented by a feedback controller and a transformation  Approximation to known adaptive controllers LCCC Symposium, 2012-04-19 Mahdi Ghazaei 20

  21. Customization of MOSAIC  Introduction of receptive fields for modules  Dissociation of adaptation from control in a module  Why different modules?  Taking care of different subtasks which are domains in state space  Plasticity role?  Adapt existing internal models to cope with small changes in plant  To acquire new skills but no retention LCCC Symposium, 2012-04-19 Mahdi Ghazaei 21

  22. Cerebellar Controller n 2 1 τ ff .. . Σ θ d , θ d , θ d Linear Inverse Model × . ( θ , θ ) 0 RF 1 × .. . θ , θ , θ Linear Forward τ copy Prediction Likelihood Model λ Soft max LCCC Symposium, 2012-04-19 Mahdi Ghazaei 22

  23. Model of Arm LCCC Symposium, 2012-04-19 Mahdi Ghazaei 23

  24. Minimum Tension Controller  Convex optimization  With some mathematical tricks reformulated to a quadratic programming problem LCCC Symposium, 2012-04-19 Mahdi Ghazaei 24

  25. Simulation of Arm Constant muscle activation LCCC Symposium, 2012-04-19 Mahdi Ghazaei 25

  26. Simulation of Arm Constant muscle activation States vs. time 0.7 400 u 1 u 2 200 0.6 u 3 0 Normalized Muscle Activation u 4 0.5 u 5 -200 u 6 0.4 -400 0.3 -600 q 1 0.2 q 2 -800 w 1 0.1 w 2 -1000 0 -1200 0 1 2 3 4 5 0 1 2 3 4 5 Time (s) Time (s) LCCC Symposium, 2012-04-19 Mahdi Ghazaei 26

  27. Simulation of Arm Minimum Tension LCCC Symposium, 2012-04-19 Mahdi Ghazaei 27

  28. Simulation of Arm Minimum Tension States vs. time 150 1 u 1 u 2 100 0.8 u 3 Normalized Muscle Activation u 4 50 u 5 0.6 u 6 0 0.4 -50 0.2 q 1 -100 q 2 w 1 0 -150 w 2 -0.2 -200 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Time (s) Time (s) LCCC Symposium, 2012-04-19 Mahdi Ghazaei 28

  29. End-2-end Simulation Trajectory generator .. . X θ , θ , θ 30 Inverse Kinematics 30 Cerebellar .. . Controller θ d , θ d , θ d τ sp u T τ C3/C4 Arm + Muscles A(q) Synergy dynamics Spinal Cord . θ d , θ d Feedback feedback . Controller motor cmd θ , θ . θ , θ LCCC Symposium, 2012-04-19 Mahdi Ghazaei 29

  30. Final Experimental Design  30 [ms] delay in the path to the cerebellar controller  PD controller with stiffness parameter of human arm and no delay  Movement 0.65 [s] , wait for 0.65 [s]  16 modules in a15x15 [cm] workspace  External translation invariant force field in task space  Object : 60 [cm] rod with 2 [Kg], perpendicular to the arm LCCC Symposium, 2012-04-19 Mahdi Ghazaei 30

  31. Mapping of modules to workspace Numbering and color coding of Samples of receptive fields in modules static configuration LCCC Symposium, 2012-04-19 Mahdi Ghazaei 31

  32. Hand Trajectory Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 32

  33. Feedback & Feedforward contributions Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 33

  34. Contributions from modules Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 34

  35. Controlling Modules Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 35

  36. External Field Test Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 36

  37. Handling an Object Adaptation Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 37

  38. Parameters Across Modules No object With an object LCCC Symposium, 2012-04-19 Mahdi Ghazaei 38

  39. Parameters Across Modules w/ External Field LCCC Symposium, 2012-04-19 Mahdi Ghazaei 39

  40. Different trajectory 0.48 0.48 0.46 0.46 0.44 0.44 y(m) y(m) 0.42 0.42 0.4 0.4 0.38 0.38 0.36 0.36 0.16 0.18 0.2 0.22 0.24 0.26 0.28 0.16 0.18 0.2 0.22 0.24 0.26 0.28 x(m) x(m) 40 LCCC Symposium, 2012-04-19 Mahdi Ghazaei

  41. Discussion  Learning of non-linearities by the cerbellar controller  Specialization of modules with 7-parameters to different areas of the force field  Trade-offs  Unit complexity vs. the number of modules  Unit adaptation vs. effective switching or combination  How to localize the model in cerebellum and brain?  Microzones and modules  Biologically plausible signals 41 LCCC Symposium, 2012-04-19 Mahdi Ghazaei

  42. Conclusion  Cerebellar Model for Control Inspired by the Microzonal Structure  Arm Model with Musculo-skeletal structure  Adaptation to the changes in the load and external disturbances despite delay  Highly sparse representation with not full knowledge of the model as a priority  Model for distributed control and adaptation 42 LCCC Symposium, 2012-04-19 Mahdi Ghazaei

Recommend


More recommend