ORF -MOSAIC M. Mahdi Ghazaei Ardakani*, Henrik Jörntell**, and Rolf Johansson* * Dep. of Automatic Control and ** Exp. Medical Science Group Lund University, Sweden
How it began! 2 LCCC Symposium, 2012-04-19 Mahdi Ghazaei
3 LCCC Symposium, 2012-04-19 Mahdi Ghazaei
Outline Introduction Cerebellum Different cerebellar models MOSAIC Problem formulation Modeling Methods Experimental Design Simulations Results Discussion Conclusions LCCC Symposium, 2012-04-19 Mahdi Ghazaei 4
Cerebellum and its Role [1 ] [1 ] Modified from Neuroscie, 3rd Ed. [2] Courtesy of H.Jörntell [2] LCCC Symposium, 2012-04-19 Mahdi Ghazaei 5
Microznoe circuitry [1 ] Figure form Handbook of Robotics, Springer 2008, LCCC Symposium, 2012-04-19 Mahdi Ghazaei 6
Principles in Cerebellum Feedforward processing Divergence and convergence Modularity Plasticity LCCC Symposium, 2012-04-19 Mahdi Ghazaei 7
Computational Models Cerebellar Model Articulation Controller (CMAC) Adjustable Pattern Generator (APG) Schweighofer-Arbib Cerebellar feedback-error-learning model (CBFELM) Multiple paired forward-inverse model (MPFIM) [1] Figure adapted form Handbook of Robotics, Springer 2008 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 8
MOSAIC Structure Internal Model Forward Models Inverse Models Modularity Adaptation Reduction of motor error Efference copy Spin-offs HMM-MOSAIC HMOSAIC e-MOSAIC MMRL AMA-MOSAICI [1] Figure from D.M. Wolpert and M. Kawato, Neural Net.,11:1325, 1998 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 9
MOSAIC based Models Original MOSAIC Tested for switching between 3 objects, and generalizing to a new one No of modules are manually tuned Requires careful tuning of parameters The quality of forward models are critical HMM MOSAIC Same experiments as above Probabilistic model using HMM heavy computation Fixed to linear forward models Originally in batch mode Improved parameter tuning and resp. estimation by EM LCCC Symposium, 2012-04-19 Mahdi Ghazaei 10
MOSAIC based Model HMOSAIC Same experiments Two layers of MOSAIC Higher layer provides estimate of prior probabilities eMOSAIC Humanoid robot control LQG for controllers Forward models replaced by Kalman filters No adaptation AMA-MOSAICI Sit-to-stand control Clustering algorithm for determining no of modules Clustering and training Off-line MMRL Controllers are replaced by RL agents Discrete and continuous case Self-organization of modules Haunting task in a grid world and controlling an inverted pendulum LCCC Symposium, 2012-04-19 Mahdi Ghazaei 11
MOSAIC based Model Toy problems More serious problems Resorting to classic controllers or RL Simplification No dealy No adaptation LCCC Symposium, 2012-04-19 Mahdi Ghazaei 12
Objectives Investigation of the applicability of a biologically inspired model of cerebellum to deal with: More complex embodiments Less accurate models (delays, noise, …) [1] Figure from F.M.M.O. Camposa, J.M.F. Calado, Ann. Rev. In Control 33 (2009), 70 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 13
Problem Statement ORF-MOSAIC as a biologically inspired cerebellar model to adaptively control a human-like robotic arm with potential delays LCCC Symposium, 2012-04-19 Mahdi Ghazaei 14
Methodology Choose models Validation Simulation Fix assumptions Models as faithful as possible to biology Fixes according well-established theories in control engineering LCCC Symposium, 2012-04-19 Mahdi Ghazaei 15
Modeling Motor Cortex (Trajectory Generation, …) Cerebellum (MOSAIC ) Sensory System and Lower Motor Control Arm Muscle Systems LCCC Symposium, 2012-04-19 Mahdi Ghazaei 16
Lower Motor Control LCCC Symposium, 2012-04-19 Mahdi Ghazaei 17
Musculoskletal Model Preserves essential features of an arm Mono-articular and bi-articular Hill-type muscle model [1] Figure from Handbook of brain theory, MIT press, 1995 LCCC Symposium, 2012-04-19 Mahdi Ghazaei 18
Assumptions Minimum jerk trajectory by CTX Planning in task space, control in joint space and transformation to muscle space Minimum tension principle for muscle control Internal model LCCC Symposium, 2012-04-19 Mahdi Ghazaei 19
Methods Linear models representing cellular structure Low level control represented by a feedback controller and a transformation Approximation to known adaptive controllers LCCC Symposium, 2012-04-19 Mahdi Ghazaei 20
Customization of MOSAIC Introduction of receptive fields for modules Dissociation of adaptation from control in a module Why different modules? Taking care of different subtasks which are domains in state space Plasticity role? Adapt existing internal models to cope with small changes in plant To acquire new skills but no retention LCCC Symposium, 2012-04-19 Mahdi Ghazaei 21
Cerebellar Controller n 2 1 τ ff .. . Σ θ d , θ d , θ d Linear Inverse Model × . ( θ , θ ) 0 RF 1 × .. . θ , θ , θ Linear Forward τ copy Prediction Likelihood Model λ Soft max LCCC Symposium, 2012-04-19 Mahdi Ghazaei 22
Model of Arm LCCC Symposium, 2012-04-19 Mahdi Ghazaei 23
Minimum Tension Controller Convex optimization With some mathematical tricks reformulated to a quadratic programming problem LCCC Symposium, 2012-04-19 Mahdi Ghazaei 24
Simulation of Arm Constant muscle activation LCCC Symposium, 2012-04-19 Mahdi Ghazaei 25
Simulation of Arm Constant muscle activation States vs. time 0.7 400 u 1 u 2 200 0.6 u 3 0 Normalized Muscle Activation u 4 0.5 u 5 -200 u 6 0.4 -400 0.3 -600 q 1 0.2 q 2 -800 w 1 0.1 w 2 -1000 0 -1200 0 1 2 3 4 5 0 1 2 3 4 5 Time (s) Time (s) LCCC Symposium, 2012-04-19 Mahdi Ghazaei 26
Simulation of Arm Minimum Tension LCCC Symposium, 2012-04-19 Mahdi Ghazaei 27
Simulation of Arm Minimum Tension States vs. time 150 1 u 1 u 2 100 0.8 u 3 Normalized Muscle Activation u 4 50 u 5 0.6 u 6 0 0.4 -50 0.2 q 1 -100 q 2 w 1 0 -150 w 2 -0.2 -200 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Time (s) Time (s) LCCC Symposium, 2012-04-19 Mahdi Ghazaei 28
End-2-end Simulation Trajectory generator .. . X θ , θ , θ 30 Inverse Kinematics 30 Cerebellar .. . Controller θ d , θ d , θ d τ sp u T τ C3/C4 Arm + Muscles A(q) Synergy dynamics Spinal Cord . θ d , θ d Feedback feedback . Controller motor cmd θ , θ . θ , θ LCCC Symposium, 2012-04-19 Mahdi Ghazaei 29
Final Experimental Design 30 [ms] delay in the path to the cerebellar controller PD controller with stiffness parameter of human arm and no delay Movement 0.65 [s] , wait for 0.65 [s] 16 modules in a15x15 [cm] workspace External translation invariant force field in task space Object : 60 [cm] rod with 2 [Kg], perpendicular to the arm LCCC Symposium, 2012-04-19 Mahdi Ghazaei 30
Mapping of modules to workspace Numbering and color coding of Samples of receptive fields in modules static configuration LCCC Symposium, 2012-04-19 Mahdi Ghazaei 31
Hand Trajectory Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 32
Feedback & Feedforward contributions Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 33
Contributions from modules Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 34
Controlling Modules Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 35
External Field Test Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 36
Handling an Object Adaptation Before Training After Training LCCC Symposium, 2012-04-19 Mahdi Ghazaei 37
Parameters Across Modules No object With an object LCCC Symposium, 2012-04-19 Mahdi Ghazaei 38
Parameters Across Modules w/ External Field LCCC Symposium, 2012-04-19 Mahdi Ghazaei 39
Different trajectory 0.48 0.48 0.46 0.46 0.44 0.44 y(m) y(m) 0.42 0.42 0.4 0.4 0.38 0.38 0.36 0.36 0.16 0.18 0.2 0.22 0.24 0.26 0.28 0.16 0.18 0.2 0.22 0.24 0.26 0.28 x(m) x(m) 40 LCCC Symposium, 2012-04-19 Mahdi Ghazaei
Discussion Learning of non-linearities by the cerbellar controller Specialization of modules with 7-parameters to different areas of the force field Trade-offs Unit complexity vs. the number of modules Unit adaptation vs. effective switching or combination How to localize the model in cerebellum and brain? Microzones and modules Biologically plausible signals 41 LCCC Symposium, 2012-04-19 Mahdi Ghazaei
Conclusion Cerebellar Model for Control Inspired by the Microzonal Structure Arm Model with Musculo-skeletal structure Adaptation to the changes in the load and external disturbances despite delay Highly sparse representation with not full knowledge of the model as a priority Model for distributed control and adaptation 42 LCCC Symposium, 2012-04-19 Mahdi Ghazaei
Recommend
More recommend