Categorization by Sensory-Motor Interaction in Artificial Agents - PDF document

Categorization by Sensory-Motor Interaction in Artificial Agents Martin Tak´ aˇ c Dept. of Applied Informatics Faculty of Mathematics, Physics, and Informatics Comenius University, Slovakia takac@ii.fmph.uniba.sk http://www.fmph.uniba.sk/ ∼ takac

Abstract We propose a computational model of categorization, grounding the categories in sensory-motor interaction with a dynamical environment. Simple perceptual categories are represented as discrimination criteria – membership functions based on dis- tributional information about intra-cluster variances of properties of a category, the representation more effec- tive for predictions than prototypes. Complex categories are represented as cross-categorial associations of criteria of objects, actions and changes, hence, they support action-based inferences and can serve as grounded meanings not only for nouns and ad- jectives, but also for at least elementary verbs in models of language evolution and acquisition.

The Goal 1. Propose and test cognitively relevant representation of various types of categories that could serve as grounded meanings in language models. 2. Propose and test mechanisms of the formation of categories by sensory-motor interaction with a (simulated) dynamic environment. Cognitive Relevance The proposed model is consistent with the following findings/hypotheses: • Perceptual symbols, context sensitivity of representation (Barsalou). • Basic-level categories, prototype effects (Rosch). • Geometric conceptual representation (G¨ ardenfors). • Importance of similarity (Tversky). • Representation of affordances (Gibson) and verb- islands (Tomasello). • Neuroscience: simple categories in perceptual sub- systems (Ungerleider and Mishkin, Orban et al., Ri- zolatti et al.), complex categories in association ar- eas.

The model Environment Non-toroidal 2D grid with objects having properties that can change in (discrete) time. The agent senses objects on the grid within some distance from itself. Actions The agent has a repertoire of actions (e.g. touch, lift, move), which can be performed with particular parameters, e.g. pushing with different forces, stretching the arm at a different angle, walking with different sizes of step etc. Actions performed on an object cause changes of it’s attribute values. Perception The sensations of the agent are in the form of perceptual frames of objects, actions and changes. Object frame example: {� weight : 10 � , � size : 3 � , � posX : 2 � , � posY : 6 �} Action frame example: � moveBy, {� x : 0 � , � y : − 8 �}�

The action type (e.g. moveBy) is the abstraction of a non-declarative agent’s knowledge of the action – a motor stereotype of invariant characteristics of the action, while action parameters are varying characteristics of a particular execution of the action. Change frame example: {� ∆ posX : − 2 �} Representation of Categories Categories are represented by discrimination criteria . Each discrimination criterion representing a particular concept is a membership function that takes a perceptual frame as an argument and returns a value from the closed interval [0 , 1], expressing to what extent is the frame an instance of the concept (0 means not at all, 1 means the best, prototypical example). A discrimination criterion records the mean and variance of each attribute common to all instances of the concept seen so far. The membership function r evaluates the similarity of the input percept f with the mean case of the category inversely weighted by the variances of particular attributes a : r ( f ) = sim ( r, f ) = e − k dist ( r,f )

where � ( f.a − r.a ) 2 � 1 � � � dist ( r, f ) = σ 2 ( r.a ) | A r | a ∈ A r Categorization Process Objects and actions are grouped to categories by the change. That is, if an action leads to the same change on several objects, they will all fall in the same category and vice versa. All action categories associated with some object category represent agent’s knowledge of affordances of the object, while all object categories associated with an action category form the precursor of a verb-centered semantic representation – a verb island . For a perceived object, action and change, the most similar of the stored cross-categorial associations is found. If the change category of the association is similar enough to the perceived change (the similarity is bigger than the threshold θ ( t )), the percepts are considered to be the instances of the associated categories and all three categories are updated by the percepts. Otherwise, a new category is created for the less similar percept of either the object, or the action. The prediction threshold θ ( t ) increases in time to model the child’s growing ability to distinguish differences in the environment.

Experiment The 25 × 25 environment contained the agent and 30 randomly placed objects – 10 ”fruits”, 10 ”toys” and 10 ”pieces of furniture”. The initial values of object attributes were randomly generated from respective in- tervals of a predefined pattern, e.g. { weight : [1, 3], size : [1, 49], color : [0, 4], roundness : [0, 9], posX : [0, 24], posY : [0, 24], posZ : 0 } for fruits, and { weight : [20, 49], size : [20, 49], legs : [0, 4], material : [0, 9], posX : [0, 24], posY : [0, 24], posZ : 0 } for pieces of furniture. In each time step, the agent randomly chose one of the objects and performed on it an action randomly generated from the pattern � actionType : liftUp , { armPosIncrease : [1, 9], force : [1, 19] } � or � actionType : putDown , { armPosDecrease : [1, 9] } � . The effects of the action on the chosen object were simulated by the environment: the action liftUp lead to increase of the posZ attribute of the object by the value of armPosIncrease , if the force was greater than the weight of the object, otherwise the action had no effect. In the case of putDown action, the posZ attribute of the object was set to max(0 , posZ – armPosDecrease ).

The Results Threshold Effect (a) 2 30 prediction, generality, threshold 1.8 25 total number of criteria 1.6 1.4 20 1.2 1 15 0.8 10 0.6 prediction 0.4 generality 5 threshold 0.2 criteria 0 0 0 1000 2000 3000 4000 5000 time (a) While the prediction threshold is low, the agent only uses a few basic criteria. Then the number of criteria starts to rapidly increase, which leads to a better accuracy of the prediction. As the threshold stabilizes, the total number of criteria slowly saturates, together with the generality exponentially decaying to a certain value. The prediction value converges to approximately 0.7 corresponding to the average distance σ , which is an average intra-cluster distance of the category, hence the criteria give correct predictions.

Merging (b) 2 25 prediction, generality, threshold 1.8 total number of criteria 1.6 20 1.4 1.2 15 1 0.8 10 0.6 prediction 0.4 5 generality threshold 0.2 criteria 0 0 0 1000 2000 3000 4000 5000 time (b) Merging of similar criteria keeps the number of criteria lower at the cost of lower accuracy (higher generality). In the second experiment, the agent merges similar criteria every 50th time step since the time 1000. This decreases the total number of the criteria at the cost of more general predictions. The prediction value again converges to approximately 0.7, i.e. the criteria give correct predictions.

Comparison to Prototypes (c) 1.6 50 prediction, generality, threshold 45 1.4 total number of criteria 40 1.2 35 1 30 0.8 25 20 0.6 15 0.4 prediction 10 generality 0.2 threshold 5 criteria 0 0 0 1000 2000 3000 4000 5000 time (c) The proposed representation copes with a different importance of attributes (or scaling of different dimen- sions) by recording their intra-category variances. In order to compare it with a standard prototype representation, we ran an experiment, where criteria behaved like prototypes in conceptual spaces (in that the terms of the sum in the formula for computing the distance were not divided by the variances). Despite that the criteria were merged, the number of necessary criteria is almost double and the prediction value is lower than in the case with variances.

Example of resulting categories Object criteria: posX posY posZ weight color C1 13 ± 7 13 ± 8 0 ± 0 37 ± 23 C2 14 ± 7 13 ± 8 4 ± 13 39 ± 22 C3 11 ± 6 10 ± 7 35 ± 28 4 ± 3 2 ± 2 C4 11 ± 6 10 ± 7 25 ± 23 4 ± 3 Associations: Action Category putDown (5 ± 3) liftUp (6 ± 2 , 10 ± 6) C1 no change C2 no change C3 ∆= { posZ : − 6 ± 1 } ∆= { posZ : 7 ± 1 } C4 ∆= { posZ : − 4 ± 2 } ∆= { posZ : 5 ± 2 } Number of objects of each type for a category they are most similar to: Category Object type C1 C2 C3 C4 agent 1 fruit 8 2 toy 1 3 6 furniture 5 5

Categorization by Sensory-Motor Interaction in Artificial Agents - PDF document

Categorization by Sensory-Motor Interaction in Artificial Agents Martin Tak a c Dept. of Applied Informatics Faculty of Mathematics, Physics, and Informatics Comenius University, Slovakia takac@ii.fmph.uniba.sk http://www.fmph.uniba.sk/

Sensory Processing, Self- Self-Regulation and Sensory Processing Sensory Motor Preferences

Sensory Replacement What is it? Related Devices Can it be applied to the art world? Sensory

Categorization Categorization is the basis of structure and meaning in our world. We

Lab 8. Speed Control of a Dc motor The Motor Drive Motor Speed Control Project 1. Generate PWM

Introduction to Sensory Processing Sensory Integration (SI) is the automatic ability to; What is

Sensory Processing Disorder Sensory Modulation Disorder - (Hyper / Hypo/ Seeking) Sensory

Lab 9. Speed Control of a D.C. motor Sensing Motor Speed (Tachometer Frequency Method) Motor

Lab 11. Speed Control of a D.C. motor Motor Characterization Motor Speed Control Project

Lab 11. Speed Control of a D.C. motor Motor Characterization Motor Speed Control Project

System Introduction to Sensory Physiology: Sensory- Motor System General Properties of

Text Categorization (I) Luo Si Department of Computer Science Purdue University Text

SENSORY EVALUATION .. Basics of Sensory evaluation, Tools, Techniques, Methods and

PHGY 212 - Physiology SENSORY PHYSIOLOGY Sensory Neural Pathways Martin Par Assistant

Motor Skills What are motor skills? A motor skill is a learned sequence of movements that

Motor Diagnostic and Motor Motor Diagnostic and Motor Health Study Health Study Dr. Howard W.

Motor/Prop Matching Lecture 14 ME EN 415 Andrew Ning aning@byu.edu Motor/Prop Matching prop

Extracting Structured Semantic Spaces from Corpora Marco Baroni Center for Mind/Brain Sciences

Affordances SWEN-445 What is an Affordance? Psychologist James Gibson, Theory of

unpacking the buyer decision process Products have 3 time zones 1. The purchase decision -

Natural Image Statistics and Neural Representation Eero P Simoncelli Bruno A Olshusen Center for

Kalman Filtering Pieter Abbeel UC Berkeley EECS Many slides adapted from Thrun, Burgard and Fox,

Lecture 3 Interaction Fundamentals Terry Winograd CS147 - Introduction to Human-Computer

Col ollaborative laborative In Info formation mation Seeki king: ng: On tra raca cabi

WORKSHOP: REAL LIVE DATA FOR CS COURSES CCSC:NE 2018 @ UNIVERSITY OF NEW HAMPSHIRE NADEEM