Translating and Evolving: Towards a model of language change in - PowerPoint PPT Presentation

Translating and Evolving: Towards a model of language change in DisCoCat Tai-Danae Bradley, Martha Lewis, Jade Master, and Brad Theilman SYCO3, March 2019 Oxford, UK Bradley & al Translating & Evolving 1/20

Background www.appliedcategorytheory.org Bradley & al Translating & Evolving 2/20

How can we incorporate language change and language learning into DisCo? Translation - construed in a broad sense Translation between two different languages - French to English and back Translation between different levels of complexity of the same language Translation between two users of one language - updating each other’s language models The aim is to provide a categorical description of translation that encompasses these three different notions. Bradley & al Translating & Evolving 3/20

Preliminary concepts We introduce the notion of a language model that unifies the product space representation of [Coecke et al., 2010] and the functorial representation of [Kartsaklis et al., 2013] This allows us to formalize the notion of lexicon which had previously been only loosely defined in the DisCoCat framework We then describe how to build a dictionary between two lexicons Bradley & al Translating & Evolving 4/20

Categorical Language Models and Translations Definition Let J be a category which is freely monoidal on some set of grammatical types. A distributional categorical language model or language model for short is a strong monoidal functor F : ( J , · ) → (FVect , ⊗ ) Bradley & al Translating & Evolving 5/20

� � � Categorical Language Models and Translations Definition A translation T = ( j , α ) from a language model F : J → FVect to a language model F ′ : J ′ → FVect is a monoidal functor j : J → J ′ and a monoidal natural transformation α : F ⇒ F ′ ◦ j. Pictorially, ( j , α ) is the following 2-cell F J FVect ⇒ α j F ′ J ′ Bradley & al Translating & Evolving 6/20

Categorical Language Models and Translations Definition Let DisCoCat be the category with distributional categorical language models as objects, translations as morphisms. Composition of morphisms runs as follows: Given translations ( T = j , α ) and T ′ = ( j ′ , α ′ ) , the composite translation is computed pointwise. That is, T ′ ◦ T is the translation ( j ′ ◦ j , α ′ ◦ α ) where α ′ ◦ α is the vertical composite of the natural transformations α and α ′ . Bradley & al Translating & Evolving 7/20

Product Space Representation Definition Let F : J → FVect be a language model and let K : FVect → Cat be a faithful functor. The product space representation of F with respect to K, denoted PS K ( F ) , is the Grothendieck construction of K ◦ F. Explicitly, PS K ( F ) is a category where an object is a pair ( g , − → u ) where g is an object of J and − → u is an object of K ◦ F ( g ) a morphism from ( g , − → u ) to ( h , − → v ) is a tuple ( r , f ) where r : g → h is a morphism in J and f : K ◦ F ( r )( − → u ) → − → v is a morphism in K ◦ F ( h ) the composite of ( r ′ , f ′ ): ( g , − → u ) → ( h , − → v ) and ( r , f ): ( h , − → v ) → ( i , − → x ) is defined by ( r , f ) ◦ ( r ′ , f ′ ) = ( r ◦ r ′ , f ◦ ( K ◦ F )( r )( f ′ )) Bradley & al Translating & Evolving 8/20

Product Space Representation What should we use for the functor K ? Definition Let V be a finite dimensional real vector space. Then, the free chaotic category on V , denoted C ( V ) , is a category where objects are elements of V and, for all − → u , − → v in V we include a unique arrow d ( − → u , − → v ): − → u → − → v labeled by the Euclidean distance d ( − → u , − → v ) between − → u and − → v . This construction extends to a functor C : FVect → Cat . For M : V → W , define C ( M ): C ( V ) → C ( W ) to be the functor which agrees with M on objects and sends arrows d ( − → u , − → v ) to d ( M − → u , M − → v ) . The morphisms in C ( V ) for a vector space V allow us to keep track of the relationships between different words in V . Bradley & al Translating & Evolving 9/20

Product Space Representation What should we use for the functor K ? When K = C as in Definition 5 the product space representation is as follows: objects are pairs ( g , − → u ) where g is a grammatical type and − → u is a vector in F ( g ). a morphism ( r , d ): ( g , − → u ) → ( h , − → v ) is: a type reduction r : g → h a positive real number d : C ◦ F ( r )( − → u ) → − → v Bradley & al Translating & Evolving 10/20

Product Space Representation Proposition ( PS K ( F ) is monoidal) For K = C and K = D, PS K ( F ) is a monoidal category with monoidal product given on objects by ( g , − → u ) ⊗ ( h , − → v ) = ( g · h , Φ g , h ( − → u ⊗ − → v )) and on morphisms by ( r , f ) ⊗ ( r ′ , f ′ ) = ( r · r ′ , Φ g , h ( f ⊗ f ′ )) where Φ g , h : K ◦ F ( g ) − → ⊗ K ◦ F ( h ) → K ◦ F ( g · h ) is the natural isomorphism included in the data of the monoidal functor K ◦ F. Bradley & al Translating & Evolving 11/20

Product Space Representation Proposition (Translations are monoidal) Let K : FVect → Cat be a fully faithful functor. Then there is a functor PS K : DisCoCat → MonCat , where MonCat is the category where objects are monoidal categories and morphisms are strong monoidal functors, that sends language models F : J → Cat to the monoidal category PS K ( F ) translations T = ( j , α ) to the strong monoidal functor PS K ( T ): PS K ( F ) → PS K ( F ′ ) where the functor PS K ( T ) acts as follows: On objects, PS K ( T ) sends ( g , − → u ) to ( j ( g ) , α g − → u ) . Suppose ( r , f ): ( g , − → u ) → ( h , − → v ) is a morphism in PS K ( F ) so that r : g → h is a reduction in J and f : F ( r )( − → u ) → − → v is a morphism in F ( h ) . Then PS K ( T ) sends ( r , f ) to the pair ( j ( r ) , α h ◦ f ) . Bradley & al Translating & Evolving 12/20

Defining the lexicon Definition Let F be a categorical language model and let W be a finite set of words, viewed as discrete category. Then a lexicon for F is a functor ℓ : W → PS ( F ) . This corresponds to a function from W into the objects of PS ( F ) . NB: We have now fixed K = C and dropped the subscript in PS ( F ) We extend this to phrases, i.e. finite sequences of words v 1 . . . v n ∈ W ∗ where W ∗ is the free monoid on W . This defines a unique object in PS ( F ): ( g , − → i =1 ℓ ( v i ) = ( g 1 , − → v 1 ) ⊗ . . . ⊗ ( g n , − → v ) := ⊗ n v n ) = ( g 1 · · · g n , − → v 1 ⊗ . . . ⊗ − → v n ) The extension of ℓ to W ∗ will be denoted by ℓ ∗ : W ∗ → PS ( F ) . Bradley & al Translating & Evolving 13/20

Defining a dictionary Definition Let ℓ : W → PS ( F ) and m : V → PS ( G ) be lexicons and let T be a translation from F to G. Then, the F - G dictionary with respect to T is the comma category ( PS ( T ) ◦ ℓ ∗ ) ↓ m ∗ denoted by Dict T . Since W and V are discrete categories, ( PS ( T ) ◦ ℓ ∗ ) ↓ m ∗ is a set of triples ( p , ( r , d ) , q ) where p ∈ W ∗ , q ∈ V ∗ and ( r , d ): ( PS ( T ) ◦ ℓ )( p ) → m ( q ) is a morphism in PS ( G ) . Bradley & al Translating & Evolving 14/20

Defining a dictionary Explicitly, let ℓ ( p ) = ( g , − → m ( q ) = ( g ′ , − → p ) and q ) then ( r , d ) is a type reduction r : j ( g ) → g ′ in the grammar category J a morphism d in C ◦ G ( g ′ ). Recall from definition 5 that this corresponds to a real number d ( − → p ′ , − → q ) denoting the distance between − → p ′ and − → q in G ( g ′ ). Bradley & al Translating & Evolving 15/20

Example Example (Translation at the phrase level) J En = C ( { n E , s E } ), J S = C { n S , s S } Consider distributional categorical language models F En : J En → Cat and F Sp : J Sp → Cat We take the fragment consisting just of nouns and intransitive verbs. Let F En ( n ) = N En , F En ( s ) = S En , F Sp ( n ) = N Sp and F Sp ( s ) = S Sp . To specify PS ( T ) we set: j ( n E ) = n S and j ( s E ) = j ( s S ) α n E , α s E to be linear transformations that ‘behave in the right way’ Bradley & al Translating & Evolving 16/20

Example What does ‘behave in the right way’ mean? Backpedal: Suppose we are interested only in nouns. Then we can learn a linear transformation from lists of noun vectors and their translations. However, α is a monoidal natural transformation, so α gh = α g ⊗ α h for every product type gh . This holds if α n E , α s E are unitary In general, α n E , α s E won’t be unitary Bradley & al Translating & Evolving 17/20

Future Work Our current model doesn’t deal with changing the order of words, like adj-noun/noun-adj. This is a matter of priority Use the metric structure of vector spaces to form dictionaries Work with the diagrammatic calculus Investigate meaning change and negotiated meaning Investigate the implementation of compositional translation matrices. Bradley & al Translating & Evolving 18/20

If you liked this... ... try this! https://sites.google.com/view/semspace2019/home Bradley & al Translating & Evolving 19/20

Translating and Evolving: Towards a model of language change in - PowerPoint PPT Presentation

Translating and Evolving: Towards a model of language change in DisCoCat Tai-Danae Bradley, Martha Lewis, Jade Master, and Brad Theilman SYCO3, March 2019 Oxford, UK Bradley & al Translating & Evolving 1/20 Background

Evolving Data Access Evolving Data Access Evolving Data Access Evolving Data Access

UI Evolving Platform Evolving Architecture Evolving About Me Xianning ( Pronunciation

Evolving Neural Networks This lecture is based on Xin Yaos tutorial slides From Evolving

Welcome To The ResQIPS Research Symposium Keynote Discussion - Translating Health Policy Into

Translating Henri Poincar Bruce D. Popp, Ph.D. ATA Certified Translator, Fr>En ATA 57th

A New Two- -Scale Mix Model: Towards Scale Mix Model: Towards a Multi a Multi- - A New Two A

Employee Wellbeing CONVENTIONAL THE EVOLVING NORMAL Employee Wellbeing CONVENTIONAL THE

Translating Executable Software Models with micca Andrew Mangogna Model Realization 24th Annual

Translating Neuralese Jacob Andreas, Anca Dragan, and Dan Klein Learning to Communicate [Wagner

TRANSLATING TECHNOLOGY DISRUPTION INTO OPPORTUNITY FOR PROCUREMENT Paula Martinez COE CLUB

Compositional correctness of IP-based system design: Translating C/C++ Models into SIGNAL

CS449/649: Human-Computer Interaction Spring 2017 Lecture VI Anastasia Kuzminykh Translating

Translating BNGL models into Kappa - Our experience Kim Quyn L DI - NS August 29, 2017

Measuring the Fitness of Evolving Networks Section 6.3 TYLER SHEPHERD Overview 1. Recap of

Translating OCL to Natural Language David Burke and Kristofer Johannisson Background Existing

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

(On Goles Universal Machines) B. Martin University Nice-Sophia Antipolis, I3S DISCO 2011,

Modern Session Encryption David Wong outline 3. NOISE 2. STROBE 4. ??? 1. KECCAK Sponge

100% Big Data 0% Hadoop 0% Java Pavlo Baron, codecentric Wednesday, November 7, 12

Using Disco and MapReduce to study mRNA complexity Dan Williams SciPy 2011 Lightning Talk

Cooperative Distributed Energy Scheduling for Smart Homes Applying Stochastic Model Predictive

SCALING UBERS REALTIME MARKET PLATFORM QCON LONDON 2015 partners riders dispatch post

Covariance Matrices & All-pairs Similarity Reza Zadeh Introduction Reza Zadeh First Pass

Cheap Labor Can Be Expensive Ning Chen, Anna R. Karlin Michael Knig The Problem 5 4.5 4 2

Sambuz

Useful Links

Newsletter

Mail Us

Translating and Evolving: Towards a model of language change in - PowerPoint PPT Presentation

Translating and Evolving: Towards a model of language change in DisCoCat Tai-Danae Bradley, Martha Lewis, Jade Master, and Brad Theilman SYCO3, March 2019 Oxford, UK Bradley & al Translating & Evolving 1/20 Background

Evolving Data Access Evolving Data Access Evolving Data Access Evolving Data Access

UI Evolving Platform Evolving Architecture Evolving About Me Xianning ( Pronunciation

Evolving Neural Networks This lecture is based on Xin Yaos tutorial slides From Evolving

Welcome To The ResQIPS Research Symposium Keynote Discussion - Translating Health Policy Into

Translating Henri Poincar Bruce D. Popp, Ph.D. ATA Certified Translator, Fr&gt;En ATA 57th

A New Two- -Scale Mix Model: Towards Scale Mix Model: Towards a Multi a Multi- - A New Two A

Employee Wellbeing CONVENTIONAL THE EVOLVING NORMAL Employee Wellbeing CONVENTIONAL THE

Translating Executable Software Models with micca Andrew Mangogna Model Realization 24th Annual

Translating Neuralese Jacob Andreas, Anca Dragan, and Dan Klein Learning to Communicate [Wagner

TRANSLATING TECHNOLOGY DISRUPTION INTO OPPORTUNITY FOR PROCUREMENT Paula Martinez COE CLUB

Compositional correctness of IP-based system design: Translating C/C++ Models into SIGNAL

CS449/649: Human-Computer Interaction Spring 2017 Lecture VI Anastasia Kuzminykh Translating

Translating BNGL models into Kappa - Our experience Kim Quyn L DI - NS August 29, 2017

Measuring the Fitness of Evolving Networks Section 6.3 TYLER SHEPHERD Overview 1. Recap of

Translating OCL to Natural Language David Burke and Kristofer Johannisson Background Existing

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

(On Goles Universal Machines) B. Martin University Nice-Sophia Antipolis, I3S DISCO 2011,

Modern Session Encryption David Wong outline 3. NOISE 2. STROBE 4. ??? 1. KECCAK Sponge

100% Big Data 0% Hadoop 0% Java Pavlo Baron, codecentric Wednesday, November 7, 12

Using Disco and MapReduce to study mRNA complexity Dan Williams SciPy 2011 Lightning Talk

Cooperative Distributed Energy Scheduling for Smart Homes Applying Stochastic Model Predictive

SCALING UBERS REALTIME MARKET PLATFORM QCON LONDON 2015 partners riders dispatch post

Covariance Matrices &amp; All-pairs Similarity Reza Zadeh Introduction Reza Zadeh First Pass

Cheap Labor Can Be Expensive Ning Chen, Anna R. Karlin Michael Knig The Problem 5 4.5 4 2

Sambuz

Useful Links

Newsletter

Mail Us

Translating Henri Poincar Bruce D. Popp, Ph.D. ATA Certified Translator, Fr>En ATA 57th

Covariance Matrices & All-pairs Similarity Reza Zadeh Introduction Reza Zadeh First Pass